| Literature DB >> 27465245 |
Robert Stewart1, Katrina Davis2.
Abstract
PURPOSE: 'Big data' are accumulating in a multitude of domains and offer novel opportunities for research. The role of these resources in mental health investigations remains relatively unexplored, although a number of datasets are in use and supporting a range of projects. We sought to review big data resources and their use in mental health research to characterise applications to date and consider directions for innovation in future.Entities:
Keywords: Big data; Electronic health records; Epidemiology; Mental disorders
Mesh:
Year: 2016 PMID: 27465245 PMCID: PMC4977335 DOI: 10.1007/s00127-016-1266-8
Source DB: PubMed Journal: Soc Psychiatry Psychiatr Epidemiol ISSN: 0933-7954 Impact factor: 4.328
Resources arranged geographically
| Region/nation | Database | Mental health specific? | Description | Example publication |
|---|---|---|---|---|
| Middle East, Asia and Australasia | ||||
| Middle East | Clalit Health Services | No | National. Covers 55 % Israeli population | Hammerman et al. [ |
| Israeli Psychiatric Case register | Yes | National. Secondary care psychiatry. Since 1950 | Lichtenberg et al. [ | |
| Far East | Hong Kong Hospital Authority | No | Covers 95 % secondary care in HK | Cheung et al. [ |
| Seoul National University | No | Local secondary care | Park et al. [ | |
| Taiwan National Health Insurance Database | No | National. Covers 96 % Taiwan population | Chen et al. [ | |
| Australia | Mental Health National Outcomes and Casemix | Yes | National. Secondary care psychiatry. Since 2003 | Burgess et al. [ |
| Western Australia admin | No | Regional (3.7 m people). Mental health sub-group. Up to 50 years data | Lawrence et al. [ | |
| Multi-country (Asia) | Pan-Asian SNP Consortium (HUGO) | No | Research database | Ngamphiw et al. [ |
| Europe | ||||
| Western Europe | Asturias Cumulative Psychiatric Case Register (RACPAS) | Yes | Spain. Regional (1 m people). Secondary care psychiatry | Bobes et al. [ |
| Gmünder ErsatzKasse (GEK) | No | Germany. National. Large health insurer (6 % population, around 5 m people) | Sauer et al. [ | |
| German Research Network on Depression/DGPPN-BADO | Yes | BADO is national minimum data set for inpatient psychiatry. Depression network from 10 heterogeneous hospitals | von Wolff et al. [ | |
| Health Search Database | No | Italy. National. Primary care data (1.5 % population, around 1 m people) | Sultana et al. [ | |
| Marseille/French National Health Insurance Fund | No | Regional. Prescription data | Bocquier et al. [ | |
| Regensberg Hospital/DGPPN-BADO | Yes | Germany. Local. BADO is minimum data set from psychiatric inpatients | Frick et al. [ | |
| South Verona Community-Based Mental Health Service | Yes | Italy. Local. Secondary care psychiatry. 25 years+ of data | Donisi et al. [ | |
| Zurich/Swiss psychiatric case register | Yes | Regional. Secondary care psychiatry. 25 years+ of data | Lay et al. [ | |
| United Kingdom | Clinical Practice Research Data link (CPRD), formerly General Practice Research Database (GPRD) | No | National sample primary care providers. Some data open access (NIHR.ac.uk) | Margulis et al. [ |
| Clinical Record Interactive Search (CRIS) | Yes | Local secondary care psychiatry. South London and Maudsley Biomedical Research Centre (SLaM BRC) Case Register. 200,000+ people | Perera et al. [ | |
| Generation Scotland | No | Regional (Scotland). Research database. Family based cohort | Fernandez-Pujals et al. [ | |
| GRiST | Yes | Multiple locations, primary and secondary care psychiatry. Mental health risk assessment software | Buckingham [ | |
| Public Health England Mental Health Dementia and Neurology Intelligence Network | Yes | Regional (England). 22 ‘indicators’ from mixed administrative sources | Wilkinson et al. [ | |
| QResearch GP database | No | National sample primary care providers. 600 practices, around 12 m people | Coupland et al. [ | |
| The Health Improvement Network (THIN) | No | National sample primary care providers. 10 m people, broadly representative of population | Osborn et al. [ | |
| UK Biobank | No | National sample 500,000 volunteers. Research database | Smith et al. [ | |
| Secure Anonymised Information Linkage (SAIL) | No | Linked data from a range of healthcare sources covering Wales (population 3 m) | John et al. [ | |
| PsyCymru | Yes | An e-cohort of around 12,000 psychosis cases in Wales linked to SAIL data | Lloyd et al. [ | |
| Scandinavia | Danish Psychiatric Central Research Register | Yes | National. Secondary care psychiatry with extensive national linkage | Munk-Jorgensen and Ostergaard [ |
| deCODE Iceland | No | National opt-in commercial/research database | Thorgeirsson et al. [ | |
| Dutch National Survey in General Practice | No | National sample primary care providers | Maas et al. [ | |
| Finnish Hospital Discharge Register | No | National. Inpatients. Linked to other national registers | Haukka et al. [ | |
| Mid-Netherlands Psychiatric Care Register | Yes | Regional—Utrecht and surrounding areas, population 760 k. Secondary care psychiatry | Braam et al. [ | |
| Norwegian Patient Register | Yes | National. Secondary care psychiatry. Linked to other national registers | Evensen et al. [ | |
| Odense University Pharmaco-epidemiologic Database | No | Denmark. Local prescription database with linkage | Hansen et al. [ | |
| Eastern Europe | Hungarian National Health Insurance Fund | No | National. Prescription-with-indication database | Katona et al. [ |
| Multi-country (Europe) | European Observatory on Health Systems and Policies | No | Health services. Produces country-based reports | Dlouhý and Barták [ |
| European Prevention of Alzheimer’s Dementia (EPAD) project | Yes | A European Innovation Medicines Initiative | Ritchie et al. [ | |
| European Autism Interventions | Yes | A European Innovation Medicines Initiative | Murphy and Spooren [ | |
| Nordic population-based prescription database | No | Pharmaco-epidemiology using databases from five countries | Zoëga et al. [ | |
| PROTECT-EU | No | Pharmaco-vigilence using databases in three countries | Requena et al. [ | |
| Refinement | Yes | Mental health services. Population data and service inventory | Sfetcu et al. [ | |
| America | ||||
| Canada | Canadian Chronic Disease Surveillance System (CCDSS) | Yes | National. Will specifically monitor excess mortality in people with psychiatric diagnosis | Lesage et al. [ |
| Canadian Primary Care Sentinel Surveillance Network | No | National sample primary care providers | Wong et al. [ | |
| OntarioMD | No | Regional, primary care providers | Hwang et al. [ | |
| Ontario Mental Health Reporting System | Yes | Regional, based on interRAI MH dataset for psychiatric inpatients | Perlman et al. [ | |
| Saskatchewan Health Databases | No | Regional, multisource. 25 years+ of data | Meng et al. [ | |
| USA | 23andMe | No | National. Commercial genotyping database, self-report | Tung et al. [ |
| Agency for Healthcare Research and Quality (AHRQ) Healthcare Cost and Utilisation Project (HCUP) | No | National sample hospital care providers. Databases and software through Federal-State-Industry partnership | Smith et al. [ | |
| Alzheimer’s Disease Genetic Consortium | Yes | Distributed network of sample of healthcare providers | McDavid et al. [ | |
| CDC data surveillance systems, including national ambulatory care survey | No | National. A number of monitoring systems and surveys | Olfson et al. [ | |
| Data QUEST | No | Sample of 15 primary care providers in five states | Estiri et al. [ | |
| Electronic medical records and genomics network (eMERGE) | No | Distributed network of five leading academic medical centres for biobanking, includes Alzheimer’s cohorts | Kho et al. [ | |
| Group Health Research Institute (GHRI) | No | Healthcare management organization (HMO). HMO network member | Lin et al. [ | |
| Health Plan Employer Data and Information Set (HEDIS) | No | National. Set of performance measures used by most health plans in USA. Managed by National Committee for Quality Assurance (NCQA) | Clark et al. [ | |
| Informatics for integrating biology and the bedside (i2b2) | No | Local secondary care. Biobank affiliated with Harvard Medical Schools | Perlis et al. [ | |
| U.S. Food and Drug Administration (FDA) Mini-Sentinel, including Innovation in Medical Evidence Development and Surveillance (IMEDS) | No | National (currently sample) medication-based database, aiming to create active monitoring system | Raebel et al. [ | |
| Kaiser Permanente, including KP Research Program on Genes, Environment and Health (RGEH) | No | Regional sample. HMO based in Northern California, 3.4 m insured | Young et al. [ | |
| Mayo Clinic | No | Local secondary care provider. Based in Minnesota, also contributes to Olmsted County/Rochester projects | Sohn et al. [ | |
| MarketScan Research Database | No | National sample. Commercial claims and encounters database from mix of providers | Watkins et al. [ | |
| Medicaid & Medicare data | No | National sample. Government reimbursed healthcare activity. Data accessed through CMS.gov or a variety of platforms, including MarketScan and HEDIS | Medicaid Medical Directors Learning Network [ | |
| Mental Health Research Network at Health Care Systems Research Network, formerly HMO research network | No | National sample. Distributed network of up to 17 HMOs with virtual data warehouse. Potentially 11 m population in 11 states | Ahmedani et al. [ | |
| Multiparameter Intelligent Monitoring in Intensive Care (MIMIC) | No | Local critically ill. ICU patients in Massachesets teaching hospitals | Ghassemi et al. [ | |
| National Prescription Audit (NPA) and National Disease and Therapeutic Index (NDTI) | No | National sample. Commercial medication-focused databases from IMS Institute for Healthcare Informatics | Alexander et al. [ | |
| New York Presbytarian | No | Local. Single hospital. 30 years+ of data | Melamed et al. [ | |
| Palo Alto Medical Foundation (PAMF) | No | Regional. Single HMO. HMO network member | Goyal et al. [ | |
| Partners Healthcare | No | Regional. Single HMO. Feeds into i2b2 | Castro et al. [ | |
| PharMetrics Patient-Centric Database, now merged with IMS databases | No | National sample. Pharmacy and encounter data 14 m people | Berger et al. [ | |
| Penn Longitudinal Database | Yes | Regional. Public mental health use (secondary care) in Philadelphia. Also part of collaborative perinatal project | Connolly Gibbons et al. [ | |
| Shared Health Information Network (SHRINE) | No | Multiple sites. Secondary care. Collaboration between Harvard and University of California hospitals | Kohane [ | |
| Scalable Partnering Network (SPAN) for Comparative Effectiveness Research (CER) | No | National sample. Project providing linkage between nine HMOs and two community partners | Toh et al. [ | |
| Stanford Translational Research Integrated Database Environment (STRIDE) | No | Local. Data from healthcare provider. Data on 2 m people since 1994 | Raj et al. [ | |
| Texas Department of Criminal Justice | No | Local database of prisoners | Baillargeon et al. [ | |
| University of Michigan Health System data warehouse | No | Local secondary healthcare provider. Uses Electronic Medical Record Search Engine (EMERSE) | Hanauer et al. [ | |
| Vanderbilt University Biorepository—BioVU | No | Local secondary care provider. Genomics, select health metrics and EHR | Crawford et al. [ | |
| Veterans Affairs Database | No | National specialist provider for veterans. Provides healthcare for aprox 14 m, has smaller biobank | Bauer et al. [ | |
| Multi-continent | ||||
| Aetionomy | Yes | Neurodegenerative diseases. Under European Innovative Medicines Initiative, aligned with EPAD in Europe and GAP in North America | Hofmann-Apitius et al. [ | |
| Asian Pharmacoepidemiology Network (AsPEN) | No | Eight cohorts in distributed network model: six countries, four continents, 200 m people | Pratt et al. [ | |
| Enhancing NeuroImaging Genetics through Meta-Analysis (ENIGMA) | Yes | Sets of research cohorts. 70 institutions taking part | Thompson et al. [ | |
| Global Burden of Disease (GBD)/WHO mental health survey | No | Estimates of morbidity for 187 countries | Whiteford et al. [ | |
| Genetic Consortium for Anorexia Nervosa | Yes | Up to 30 datasets for GWAS | Reichborn-Kjennerud et al. [ | |
| Health Care Quality Indicators (HCQI) for OECD countries | No | Comparative data on national health systems | Moran and Jacobs[ | |
| IMS Prescribing Insights database | No | Medication-based database. Presence in 30 countries | Wong et al. [ | |
| Psychiatric Genomic Consortium | Yes | Has a number of working groups for specific disorders and cross-disorder group | Cross-Disorder Group of the Psychiatric Genomics [ | |
| International Genomics of Alzheimer’s Project (I-GAP) | Yes | Including existing genetic consortia and other cohorts | Lambert et al. [ | |
| Sequenced Treatment Alternatives to Relieve Depression (STAR*D) | Yes | Whilst this international study was not “big data”, in terms of using hybrid EHR and manual methods, it develops techniques to be used for observational research in big data | Garriock et al. [ | |
| WHO Global Health Observatory Data Repository | No | Special topics covered, including mental health and suicide | WHO [ | |
Example topics in papers discussing mental illness epidemiology, treatment and outcome
| Disorder (% of papers) | Descriptive epidemiology and service use | Risk factors, comorbidities and genetics | Treatment and prognosis | Physical health, pregnancy, mortality |
|---|---|---|---|---|
| All disorders (10 %) | Manson [ | Roque et al. [ | Donisi et al. [ | Perini et al. [ |
| Severe mental illness (5 %) | Lyalina et al. [ | Kyaga et al. [ | Perlman et al. [ | Matheson et al. [ |
| Dementia (9 %) | Knopman et al. [ | Exalto et al. [ | van den Bussche et al. [ | Rait et al. [ |
| Substance use disorder (2 %) | Bonn-Miller et al. [ | Nesvåg [ | Mark et al. [ | |
| Schizophrenia (6 %) | Okkels et al. [ | Harper et al. [ | Stroup et al. [ | Gal et al. [ |
| Bipolar disorder (2 %) | Castro et al. [ | Schaefer et al. [ | Hayes et al. [ | Lee and Lin [ |
| Depressive disorders (11 %) | Hoffmann et al. [ | Hanauer et al. [ | Morkem et al. [ | Lin et al. [ |
| Anxiety and somatoform disorders (2 %) | Walters et al. [ | Lacourt et al. [ | Sandelin et al. [ | Frayne et al. [ |
| Eating disorders (1 %) | Micali et al. [ | Reichborn-Kjennerudet al. [ | ||
| Post-partum mental disorders (2 %) | Polachek et al. [ | Goyal et al. [ | ||
| Intellectual disabilities (1 %) | Sprung et al. [ | Alexander et al. [ | ||
| Autism/Autism Spectrum Disorder (ASD) (6 %) | Kohane [ | Hsu et al. [ | Wong et al. [ | |
| Other neuro-developmental disorders (4 %) | Surén et al. [ | Leivonen et al. [ | Hoffmannet al. [ |
Fig. 1The relative number of papers found reporting on different classes of medication (57 papers on medication in total)
Examples of other topics appearing in multiple papers
| Topic (% of papers) | Example papers |
|---|---|
| Medication prescription 6 % | Sultana et al. [ |
| Medication safety and adverse drug reactions 13 % | Chung et al. [ |
| Medication safety in older adults | Hwang et al. [ |
| Medication safety during pregnancy | Hviid et al. [ |
| Suicide and self-injury 5 % | Stewart et al. [ |
| Mental health admissions 4 % | Frick et al. [ |
| Patient characteristics 4 % | Koopmans et al. [ |
| Mental Health Services Quality 3 % | Moran and Jacobs [ |