| Literature DB >> 29186302 |
Jessica D Tenenbaum1, Krithika Bhuvaneshwar2, Jane P Gagliardi3, Kate Fultz Hollis4, Peilin Jia5, Liang Ma6, Radhakrishnan Nagarajan7, Gopalkumar Rakesh8, Vignesh Subbian9, Shyam Visweswaran10, Zhongming Zhao5, Leon Rozenblit11.
Abstract
Mental illness is increasingly recognized as both a significant cost to society and a significant area of opportunity for biological breakthrough. As -omics and imaging technologies enable researchers to probe molecular and physiological underpinnings of multiple diseases, opportunities arise to explore the biological basis for behavioral health and disease. From individual investigators to large international consortia, researchers have generated rich data sets in the area of mental health, including genomic, transcriptomic, metabolomic, proteomic, clinical and imaging resources. General data repositories such as the Gene Expression Omnibus (GEO) and Database of Genotypes and Phenotypes (dbGaP) and mental health (MH)-specific initiatives, such as the Psychiatric Genomics Consortium, MH Research Network and PsychENCODE represent a wealth of information yet to be gleaned. At the same time, novel approaches to integrate and analyze data sets are enabling important discoveries in the area of mental and behavioral health. This review will discuss and catalog into an organizing framework the increasingly diverse set of MH data resources available, using schizophrenia as a focus area, and will describe novel and integrative approaches to molecular biomarker discovery that make use of mental health data.Entities:
Keywords: biomarker discovery; mental health; open access; translational bioinformatics
Mesh:
Substances:
Year: 2019 PMID: 29186302 PMCID: PMC6585382 DOI: 10.1093/bib/bbx157
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Figure 1Overview of micro- and macro-level biomarkers. indels, small insertions/deletions; SV, structural variants.
Figure 2A framework for classification of data-related resources. Nodes denote resource types (Entities, Initiatives, Platforms and Data sets), and edges show the many-to-many relationships among them.
Figure 3Visual representation of data platform attributes. See Table 1 for abbreviations.
Open data resources for biomarker discovery in mental health, particularly in schizophrenia
| Resource | Type | URL | Notes |
|---|---|---|---|
| Enhancing Neuro Imaging Genetics Through Meta Analysis (ENIGMA) | O |
| The ENIGMA Network brings together researchers in imaging genomics to understand brain structure, function and disease, based on brain imaging and genetic data. Includes Schizophrenia Working Group (ENIGMA-SCZ) |
| NIMH | O |
| The institute within the NIH that focuses on mental health and disease. The NIMH is one of 27 institutes and centers within NIH, which is part of the US Department of Health and Human Services |
| Open Translational Science In Schizophrenia (OPTICS) | O |
| A time-limited proof of concept pilot project designed to provide a forum for translational science based on Janssen clinical trial data made available to qualified investigators |
| Stanley Medical Research Institute (SMRI) | O |
| A nonprofit organization supporting research on the causes of, and treatments for, SCZ and bipolar disorder |
| Mental Health Research Network (MHRN) | O |
| Consortium of 13 health system research centers dedicated to improving patient mental health through research, practice and policy. Supported by a cooperative agreement from the NIMH. The MHRN conducts pragmatic research in health systems serving over 12 million patients |
| Common Mind Consortium | I |
| Public–private partnership to generate and analyze large-scale genomic data across several brain regions from human subjects with neuropsychiatric disease and to make these data and the associated analytical results broadly available to qualified investigators |
| Human Connectome Project (HCP) | I |
| Large NIH-funded project for integrating genomics, behavior and brain imaging. Currently, high-resolution imaging data are available on 1200 individuals. Primary modalities measure brain activity (resting state fMRI and task-evoked fMRI), white matter integrity (diffusion imaging and T2 FLAIR) and oscillatory brain activity (EEG and) |
| NIMH Human Genetics Initiative | I |
| Intended to establish a national resource of clinical and diagnostic information and immortalized cell lines from individuals with SCZ, bipolar disorder or Alzheimer's disease and their relatives, available to qualified investigators for research on the genetic basis of these disorders |
| PsychENCODE | I |
| Funded by the NIMH with the goal of accelerating discovery of noncoding functional genomic elements in the human brain and elucidating their role in the molecular pathophysiology of psychiatric disorders |
| Stanley Neuropathology Consortium (SNC) | I |
| A collection of 60 brains, consisting of 15 each diagnosed with SCZ, bipolar disorder or major depression, and unaffected controls. Samples may be requested for research purposed. Associated data are available in the SNC Integrative Database (SNCID)—see below |
| Psychiatrics Genomics Consortium (PGC) | I |
| Founded in 2007, the PGC includes over 800 investigators from 38 countries with the goal of conducting meta- and mega-analyses of genomic data for psychiatric disorders. The initial focus was on autism, attention-deficit hyperactivity disorder, bipolar disorder, major depressive disorder and SCZ. More recently, the scope has expanded to other conditions and other types of genetic variation beyond SNVs |
| Neuroscience Information Framework (NIF) | I/P |
| An NIH-funded framework for identifying, locating, relating, accessing, integrating and analyzing information from the neuroscience research enterprise. NIF has come to refer to both this initiative and the set of tools and platforms that make up that framework including the registry of electronic resources and the discovery portal for searching those resources. NIF includes >4500 curated resources and access to > 100 databases |
| Allen Brain Atlas/Data Portal | I/P |
| The Allen Institute for Brain Science is dedicated to understanding how the human brain works in health and disease. The Allen Human Brain Atlas integrates anatomic and genomic information across the brain. Data modalities include MRI, DTI, histology and gene expression data derived from both microarray and |
| NIMH Repository and Genomics Resource (RGR) | P |
| Includes 100+ studies, including CommonMind, PsychENCODE. Formerly the Center for Collaborative Genomic Studies on Mental Disorders, the RGR was established in 1998 through the NIMH Human Genetics Initiative to leverage and increase the value of human genetic samples and data produced through NIMH-funded research. It contains a collection of > 150 000 well-characterized, high-quality patient and control samples from patients with a range of mental disorders. The RGR’s Biologic Core and a Data Management Core are external to NIH |
| Function Biomedical Informatics Research Network Data Repository (FBIRN DR) | P | fbirnbdr.nbirn.net: 8080 (BROKEN) | FBIRN was initially focused on assessing major sources of variation of fMRI data generated across different scanners. The FBIRN Phase 1 data set consists of a traveling subject study of five healthy subjects, each scanned on 10 different 1.5 to 4 T scanners. The FBIRN Phase 2 and Phase 3 data sets consist of subjects with SCZ or schizoaffective disorder along with healthy comparison subjects scanned at multiple sites. The BIRN Data Repository (BDR) includes imaging, clinical, cognitive and physiological data |
| OpenNeuro (previously OpenfMRI) | P |
| A neuroimaging repository to enable reproducible analysis and data sharing. Started in 2010, it initially focused only on task-based MRI, but is now open to all forms of neuroimaging data, reflected in the name transition from OpenfMRI to OpenNeuro. Data are anonymized before distribution to protect the confidentiality of participants and distributed using a Public Domain license |
| Research Domain Criteria Database (RDoC DB) | P |
| A data repository for the harmonization and sharing of research data related to the RDoC initiative and mental health research more generally. The actual platform uses software designed to host the NIH’s National Database for Autism Research (NDAR) |
| SchizConnect | P |
| Federated access to several neuroimaging databases with images acquired on SCZ subjects. Data sources include FBIRN, NUSDAST, COINS and MCIC (maintained by the Mental Illness and Neuroscience Discovery Institute, now the Mind Research Network). More than 1100 subjects with >1000 have imaging data, including resting state fMRI, task-related fMRI, structural and diffusion imaging |
| SNCID | P |
| Web-based tool for exploring neuropathological traits, gene expression and associated biological processes in psychiatric disorders generated by the SNC within the SMRI |
| Australian Schizophrenia Research Bank | P |
| A research database and storage facility that links clinical and neuropsychological information, blood samples and structural and fMRI brain scans from people with SCZ and healthy nonpsychiatric controls, and currently has data on ∼900 cases and 900 controls |
| Internet Brain Volume Database (IBVD) | P |
| Centered around publications as the central data structure, IBVD is a Web-based searchable database of brain neuroanatomic volumetric observations that enables electronic access to the results in the published literature |
| dbGap | P |
| Developed by the NIH’s NCBI to archive and distribute the data and results from studies that have investigated the interaction of genotype and phenotype. While the focus is on genomic data, other data types are included as well, for example metabolomic data and laboratory values |
| Metabolights | P |
| A database for Metabolomics experiments and derived information. Metabolights is the slightly more established European counterpart to the NIH’s MW and the recommended metabolomics repository for a number of top journals |
| DataMed | P |
| Data search engine portal to enable users to search for data across different repositories developed for the NIH BD2K DDI by the bioCADDIE project team. The initial prototype release (v2.0) features a set of data repositories selected by the bioCADDIE team, with a form to suggest additional repositories for inclusion |
| Metabolomics Workbench (MW) | P |
| A repository for metabolomics data and metadata, MW provides analysis tools and access to metabolite standards, protocols, tutorials and training |
| PRIDE | P |
| A centralized, standards compliant, public data repository for proteomics data, including protein and peptide identifications, posttranslational modifications and supporting spectral evidence. Most of the data sets related to mental health disorders in PRIDE are derived from animal models |
| Synapse | P |
| Sage Bionetworks’ software platform for data sharing and provenance tracking. Synapse enables researchers to carry out, track and communicate research in real time and enables co-location of scientific content (data, code, results) and narrative descriptions of that work. The platform is agnostic regarding biomedical domain or data type and hosts a number of different file types and projects funded by a number of different sources |
| GEO | P |
| An international public repository developed by the NIH NCBI that archives and freely distributes microarray, next-generation sequencing and other high-throughput functional genomics data submitted by the research community |
| AE | P |
| The European counterpart to GEO. AE is an archive of functional genomics data from high-throughput functional genomics experiments. A subset of experiments is imported from GEO, while others are submitted directly |
| GEMMA | P |
| Gemma is a website, database and a set of tools for the meta-analysis, re-use and sharing of genomics data, currently primarily targeted at the analysis of gene expression profiles |
| OmicsDI | P |
| Enables data set discovery across omics data resources spanning eight international repositories, including both open and controlled access data resources. The resource provides key metadata for each data set and uses this metadata to enable search capabilities and identification of related data sets. OmicsDI helps researchers to idenitfy groups of related, multi-omics data sets across repositories |
Note: Type: O, organizational entity; I, initiative; P, platform.
Figure 4Overview of landscape of organizational entities, initiatives and data sharing platforms.
SCZ data sets in dbGaP
| Data set ID | Name | # Participants | Platform | Publication (PMIDs) | Citations | Data type |
|---|---|---|---|---|---|---|
| phs000979.v1.p1 (PRJNA293910) | Gene Expression in Postmortem DLPFC and Hippocampus from Schizophrenia and Mood Disorders | 914 | HumanHap650Yv3.0, Human1M-Duov3_B, Human HT-12 Expression Bead Ch | 28070120 | [ | SNP array, mRNA expression |
| phs000473.v2.p2 (PRJNA157243, PRJNA94281) | Sweden-Schizophrenia Population-Based Case-Control Exome Sequencing | 12 380 | SureSelect Human All Exon v.1 Kit, SureSelect Human All Exon v. | 22641211 | [ | WES |
| phs000738.v1.p1 | Exome Sequencing in Schizophrenia Families | 216 | SeqCap EZ Human Exome Library v2.0 | 23911319, 24317315 | [ | WES |
| phs000687.v1.p1 | Bulgarian Schizophrenia Trio Sequencing Study | 1826 | SureSelect Human All Exon v.2 Kit, SureSelect Human All Exon v3-50Mb, SeqCap EZ Human Exome Library v2.0 | 23040492, 22083728, 24463507 | [ | WES, SNP Genotype |
| phs000608.v1.p1 | Whole-Genome Profiling to Detect Schizophrenia Methylation Markers | 1459 | MBD-seq | 23244307 | [ | Methylation |
| phs000448.v1.p1 | Genetics of Schizophrenia in an Ashkenazi Jewish Case-Control Cohort | 3096 | HumanOmni1-Quad_v1-0_B | [ | SNP array | |
| phs000021.v3.p2 | Genome-Wide Association Study of Schizophrenia | 5064 | AFFY_6.0 | 16400611 | [ | SNP array |
| phs000167.v1.p1 | Molecular Genetics of Schizophrenia-nonGAIN Sample (MGS nonGAIN) | 3029 | AFFY_6.0 | 16400611 | [ | SNP array |
Keywords and counts for integrative biomarker studies in schizophrenia published before May 2017
| Keywords |
|
|---|---|
| schizophrenia [TIAB] AND GWAS AND expression | 285 |
| schizophrenia [TIAB] AND SNP AND expression | 242 |
| schizophrenia [TIAB] AND GWAS AND network | 140 |
| schizophrenia [TIAB] AND SNP AND network | 75 |
| schizophrenia [TIAB] AND GWAS AND methylation | 36 |
| schizophrenia [TIAB] AND GWAS AND eQTL | 35 |
| schizophrenia [TIAB] AND SNP AND integrative | 32 |
| schizophrenia [TIAB] AND GWAS AND quantitative traits | 26 |
| schizophrenia [TIAB] AND GWAS AND transcriptome | 26 |
| schizophrenia [TIAB] AND SNP AND methylation | 20 |
| schizophrenia [TIAB] AND SNP AND eQTL | 19 |
| schizophrenia [TIAB] AND SNP AND quantitative traits | 11 |
| schizophrenia [TIAB] AND SNP AND transcriptome | 10 |
| schizophrenia [TIAB] AND GWAS AND integrative | 5 |
| schizophrenia [TIAB] AND SNP AND transcriptome | 4 |
| schizophrenia [TIAB] AND genotyping AND transcriptome | 3 |
| schizophrenia [TIAB] AND SNP AND ATAC-seq | 1 |
Figure 5Publication summary of SCZ integrative studies.
Note: Publications in 2017 were estimated based on the data between January and May in 2017.