| Literature DB >> 32791514 |
Timothy Reynolds1,2, Emma C Johnson3, Spencer B Huggett4, Jason A Bubier1, Rohan H C Palmer4, Arpana Agrawal3, Erich J Baker2, Elissa J Chesler5.
Abstract
Genome-wide association studies and other discovery genetics methods provide a means to identify previously unknown biological mechanisms underlying behavioral disorders that may point to new therapeutic avenues, augment diagnostic tools, and yield a deeper understanding of the biology of psychiatric conditions. Recent advances in psychiatric genetics have been made possible through large-scale collaborative efforts. These studies have begun to unearth many novel genetic variants associated with psychiatric disorders and behavioral traits in human populations. Significant challenges remain in characterizing the resulting disease-associated genetic variants and prioritizing functional follow-up to make them useful for mechanistic understanding and development of therapeutics. Model organism research has generated extensive genomic data that can provide insight into the neurobiological mechanisms of variant action, but a cohesive effort must be made to establish which aspects of the biological modulation of behavioral traits are evolutionarily conserved across species. Scalable computing, new data integration strategies, and advanced analysis methods outlined in this review provide a framework to efficiently harness model organism data in support of clinically relevant psychiatric phenotypes.Entities:
Year: 2020 PMID: 32791514 PMCID: PMC7688940 DOI: 10.1038/s41386-020-00795-5
Source DB: PubMed Journal: Neuropsychopharmacology ISSN: 0893-133X Impact factor: 7.853
Genes identified in both human GWAS and model organism genetic studies.
| Human gene | Model organism | Year of publication | Model organism publication (PMID) | Trait | Date | Human publication (PMID) | Trait |
|---|---|---|---|---|---|---|---|
| MPDZ | Mouse | 2002 | 11978849 | Alcohol withdrawal | 2009 | 19175764 | Alcoholism |
| MC1R | Mouse | 2003 | 12663858 | Analgesia | 2003 | 12663858 | Analgesia |
| OPRM1 | Mouse/Rat | 1994/1998 | 7982048/9512064 | Pain genetic variation/alcohol intake | 1998 | 9756053/9689128 | Alcohol dependence/opioid binding, addiction |
| GAD1 | Mouse | 1994 | 8974318 | Alcohol withdrawal | 2007 | 17034009 | Alcoholism |
| CHRM5 | Mouse | 2002 | 11900778 | Increased drinking | 2004 | 15292665 | Schizophrenia |
| GABRB2 | Mouse | 2003 | 12490572 | Action of alcohol | 1999 | 10195814 | Alcohol dependence |
| ALDH2 | Rat | 1991 | 2053491 | Alcohol drinking behavior | 1982 | 7180842 | Alcohol metabolism Caucasian/Asian |
| ALDH1 | Mouse | 1996 | 4015840 | Alcohol metabolism inbred | 1983 | 6354999 | Alcohol metabolism Caucasian/Asian |
| FAM53b | Mouse | 2016 | 26581503 | Cocaine | 2014 | 23958962 | Cocaine dependence |
| PPP1R1B | Mouse | 1998 | 9694658 | Drugs of abuse | 2006 | 16237383 | Amphetamine experience |
| CSNK1e | Mouse | 1999/2005 | 10591541/16104378 | Amphetamine/cocaine-induced stimulation | 2006 | 16237383 | Amphetamine experience |
| COMT | Mouse | 1975/1998 | 1185192/9707588 | Differential seizure susceptibility/KO social behavior | 2003 | 12716966 | Methamphetamine brain response variation |
| DBH | Mouse | 1991/1999/ 2000 | 1684202/10594079/ 10777779 | Altered norepinephrine and serotonin/seizure/alcohol | 2000 | 10673769 | Cocaine-induced paranoia |
| DRD1 | Mouse | 1994 | 8001143 | Cocaine behavior | 1997 | 9154217 | Addictive behavior |
| DRD4 | Mouse | 1997 | 9323127 | Supersensitive cocaine | 1993 | 8216280/8268330 | Alcoholism/delusional behavior |
| DBH | Mouse | 1992/2000 | 1542654 /11093800 | Ethanol induced | 2000 | 10975602 | Smoking cessation |
| DDC | Mouse/Fly | 1986/2006 | 3703899 /16783013 | Drug studies locomotor behavior | 2005 | 15879433 | Nicotine dependence |
| HTR3A | Mouse | 2001 | 11685380 | Conditioned place preference | 2001 | 11207027 | Schizophrenia and bipolar |
| HTR5A | Mouse | 1999 | 10197537 | Activity/lsd | 2009 | 19328558 | Bipolar |
| ARRB2 | Mouse | 1999 | 10617462 | Morphine analgesia | 2006 | 16894395 | ADHD |
| GRIN3A | Mouse | 2005 | 15866554 | PPI | 2009/ 2011 | 20016182/ 20084518 | Alzheimer/nicotine |
| NRXN1 | Mouse | 2009 | 19822762 | PPI, learning, grooming | 2005 | 16451640 | COGA |
| HP1BP3 | Mouse | 2016 | 27460150 | Cognitive aging | |||
| DAT1 | Mouse | 1998 | 10195128 | Cocaine IVSA | 2001 | 11449401 | ADHD |
| GRIN2B | Mouse | 1996 | 8789948 | Abnormal startle | 2000 | 10945659 | ADHD, ODD and conduct disorder. |
| CHRNA3 | Mouse | 1999 | 10318955 | Megacystis-microcolon-intestinal hypoperistalsis | 1998 | 9758605 | Epilepsy |
| CHRNB4 | Mouse | 2004 | 14996991 | Seizure | 1998 | 9758605 | Epilepsy |
| CHRNA6 | Mouse | 2002 | 11927835 | Nicotine | 2002 | 12195439 | Epilepsy |
| CHRM1 | Mouse | 2001 | 11752469 | Hyperactivity | 2003 | 14504414 | Psychiatric symptomology |
| CHRM2 | Mouse | 1999 | 9990086 | Impaired drug response | 2002 | 12116189 | Depression |
| CYP2A6 | Mouse | 1989 | 2733794 | Altered metabolism | 1998 | 9655391 | Nicotine metabolism |
| CYP2B6 | Mouse | 2010 | 19923441 | Nicotine pharmacokinetics | 1992 | 1736885 | Drug metabolism |
| NTRK2 | Mouse | 1993 | 8402890 | Neonatal death | 2005 | 15838534 | Eating disorder |
| SHC3 | Mouse | 2005 | 15716419 | Spatial memory | 2007 | 17179996 | Nicotine |
| DNM1 | Mouse | 2007 | 17463283 | Abnormal motor capabilities/coordination/ movement | 2008 | 18806795 | Exercise-induced collapse |
| TAS2R38 | Mouse | 2014 | mousephenotypes.org | Limb grasping | 2005 | 15466815 | Taste |
| APBB1 | Mouse | 2004 | 14689444 | Abnormal spatial learning | 1998 | 10079843 | Alzheimer disease |
| NRG3 | Mouse | 2016 | 27606322 | Abnormal behavior | 2008 | 18708184 | Schizophrenia |
| DRD2 | Mouse | 1995 | 7566118 | Impaired coordination | 1991 | 1832466 | Neuropsychiatric disorders |
Fig. 1Multispecies genomic and epigenomic data integration.
Genetic variation, gene regulation, and homology datasets are retrieved from a variety of publicly available resources and data repositories. Human (VH) and mouse (VM) variants are connected to the gene (GM, GH) that either contains a coding variant or is regulated by a noncoding variant. Epigenetic markers and regulatory features (RM, RH) are retrieved from ENCODE and Ensembl, then overlapped with genetic variation data from Ensembl and NCBI in order to identify regulatory variants (VM, VH). Regulatory variants (VM, VH) are overlapped with gene-regulatory datasets in the form of eQTLs (EM, EH; processed from GTEx, GeneNetwork, and specific mouse populations) and chromatin interaction studies (e.g., ChIA-PET experiments from ENCODE and gene-promoter interactions from the Eukaryotic Promoter Database). Association of regulatory variants and gene-regulatory information allows for the identification of putative gene targets. These datasets are harmonized within-species for mice (VM, EM, GM, RM) and humans (VH, EH, GH), then related across species through orthologous gene targets (OM, OH) derived from homology resources like the Alliance for Genome Resources.
Tools for Functional Genomics Data Integration.
| Tool Name | Description | Strategy |
|---|---|---|
| AnnoPred | Estimates PRS using genome-wide variants that are differentially weighted based on the integration of evidence across GWAS summary statistics and multiple annotation resources for different tissue types, genomic features, and the functional assessment of SNPs. | Bayesian framework integration |
| DIAMOnD | This tool identifies potential variant-to-gene associations based on module inclusion. Uses an algorithm for detecting disease modules based on network connectivity. | Algorithm for network module analysis |
| ENCODE Screen | Useful for discovering the potential regulatory role of genetic variants using | Database |
| FOCUS | Used to determine gene–trait associations from transcriptome-wide annotation studies using LD among SNPs and eQTL weights embedded in a probabilistic model. | Probabilistic systems framework |
| FUMA | Online tool to visualize and aggregate positional, eQTL, and chromatin interaction maps to perform enrichment analysis of human GWAS data. Can be used to associate genetic variants to target genes based on eQTL and chromatin interaction studies. | Tools pipeline and visualization |
| GeneNetwork | Set of variant, expression, and eQTL multispecies tissue specific datasets used to link genetic maps to disease and phenotypes of interest. | Database, statistical and probabilistic tools |
| GeneWeaver | Multispecies data integration tools that allow users to identify putative genes of interest based on shared or unique genetic or variant data of interest. Tools available to map, manage, and analyze large datasets. | Bipartite, k-partite, combinatorics, network analysis |
| H-MAGMA | A modified version of MAGMA that extends gene-to-variant mapping by including long-range loci interactions predicted by Hi-C. | Statistical multiple regression models |
| Harmonizome | Online resource for data integration from existing genomic resources. | Association matrix, machine learning |
| HumanBase | Online tools for tissue specific gene and network interactions. | Association network, machine learning. |
| KnowEng | Integrative analysis following formatted pipelines for knowledge discovery. | Knowledge network, machine learning |
| LDPred-funct | Used to derive polygenic scores using multiple genetic variants. LDpred-funct estimates polygenic effects by employing a model that accounts for LD and identify trait-specific priors that are based on posterior casual associations. | Probabilistic modeling |
| MAGMA | Software tool used to assign GWAS identified variants to genes, based on physical proximity, and perform joint and conditional association models that examine gene-, gene-set, and interaction effects. | Statistical multiple regression models |
| modENCODE | Collaborative data set for genomic functional elements across several species, used to define genomic regions and variants of interest. | Database, ModMine toolset |
| Monarch | Semantic integration of phenotypic disease associations to identify underlying genes. | Knowledge graph |
| PAINTOR | Used to determine SNPs to be tested for phenotypes of interest. Predicts the impact of multiple casual variants on genomic annotations by incorporating summary associations statistics, functional annotations, and LD statistics. | Probabilistic systems framework |
| psychENCODE | Collaborative data set for genomic functional elements, used to define genomic regions and variants of interest in the brain. | Database, ModMine toolset |
| S-PrediXcan | Used to predict gene associations to disease using gene expression levels to mediate summary GWAS and measured transcriptome studies without the need to use individual-level data. | |
| SMR | Identifies genes with expression levels and pleiotropic associations with diseases of interest via the integration of GWAS variants and expression data derived from eQTL studies. | Mendelian randomized analysis |
| TWAS | Identifies expression–trait associations by creating putative transcriptome-wide associations derived by integrating gene expression measurement with GWAS estimated associations. |
Fig. 2Multispecies genomic and epigenomic analysis.
Species-specific gene, gene-regulatory, and variant-level data are harmonized from public resources. Using variant and gene annotations as input from post-GWAS annotation tools (e.g., FUMA, MAGMA, etc.), gene-regulatory components can be related across species via epigenomic modeling. Gene targets identified from epigenomic modeling can be used for further post-GWAS analysis with tools such as Enrichr, GeneWeaver, KnowEng, etc. Such analyses have numerous biomedical applications, such as the discovery of disease-relevant model organisms and traits.