Literature DB >> 25964630

PsyGeNET: a knowledge platform on psychiatric disorders and their genes.

Alba Gutiérrez-Sacristán1, Solène Grosdidier1, Olga Valverde1, Marta Torrens1, Àlex Bravo1, Janet Piñero1, Ferran Sanz1, Laura I Furlong1.   

Abstract

UNLABELLED: PsyGeNET (Psychiatric disorders and Genes association NETwork) is a knowledge platform for the exploratory analysis of psychiatric diseases and their associated genes. PsyGeNET is composed of a database and a web interface supporting data search, visualization, filtering and sharing. PsyGeNET integrates information from DisGeNET and data extracted from the literature by text mining, which has been curated by domain experts. It currently contains 2642 associations between 1271 genes and 37 psychiatric disease concepts. In its first release, PsyGeNET is focused on three psychiatric disorders: major depression, alcohol and cocaine use disorders. PsyGeNET represents a comprehensive, open access resource for the analysis of the molecular mechanisms underpinning psychiatric disorders and their comorbidities.
AVAILABILITY AND IMPLEMENTATION: The PysGeNET platform is freely available at http://www.psygenet.org/. The PsyGeNET database is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). CONTACT: lfurlong@imim.es SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25964630      PMCID: PMC4565028          DOI: 10.1093/bioinformatics/btv301

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Psychiatric disorders have a great impact on morbidity and mortality, accounting for 1:3 of disability worldwide (Murray and Lopez, 2013; Whiteford ) and generating most of healthcare expenditure (Collins ). Psychiatric diagnoses are syndromes rather than diseases with operative diagnostic criteria based on symptoms and signs (e.g. the Diagnostic and Statistical Manual of Mental Disorders, and the International Classification of Diseases Diagnostic Criteria) and lacking reliable biological markers. Co-occurrence of two or more psychiatric diagnoses (‘psychiatric comorbidity’) has been reported to be very frequent. This co-occurrence raises two major clinical questions: whether there are underlying common environmental, genetic and neurobiological ethiopathological factors and what is their impact on clinical care. For instance, there is a high prevalence of comorbidity between major depression (MD) and substance use disorders (Pettinati ) which might be related to poor therapeutic outcome for both disorders (WHO.Global Status Report on Alcohol and Health, 2014). In the past years, there has been a great progress on the genetics of psychiatric disorders (Sullivan ). However, there is still limited understanding of the cellular and molecular mechanisms underlying these diseases and their comorbidities. Some of the factors limiting progress in this area are (i) the heterogeneity and fragmentation of the available data on psychiatric disorders into silos and (ii) the lack of resources that collect these wealth of data, integrate them and supply the information in an intuitive, open access manner to the community. PsyGeNET (Psychiatric disorders and Genes association NETwork) has been developed with these issues in mind. PsyGeNET integrates curated information on psychiatric disorders and their genes, offering exploratory tools for their query and analysis. In its current version, we focused on three psychiatric disorders: MD, alcohol use disorder (AUD) and cocaine use disorder (CUD). The selection of the diseases has been made on the basis of the following factors: (i) the significant burden of each of the diseases for the society, (ii) the large body of literature reporting results on the genetics of these diseases, in particular for MD and (iii) the comorbidity between MD and substance use disorders. Here, we describe the development of the database, the functionalities of the platform and the questions that can be addressed with PsyGeNET.

2 PsyGeNET database

PsyGeNET database (version 1.0, release January 25, 2015) contains 2642 gene-disease associations (GDAs) between 1271 genes and 37 psychiatric disease concepts collected from DisGeNET (Piñero ) and the literature by text mining followed by expert curation. The psychiatric disorders were defined by experts in the domain of psychiatry and neurobiology using 59 (MD), 13 (AUD) and 11 (CUD) UMLS concepts (the list of concepts is available at http://www.psygenet.org/).

2.1 Curated data

We included expert-curated, direct associations between human genes and the selected disease concepts from the Comparative Toxicogenomics Database™ [CTD (Davis )]. On the other hand, a dataset of GDAs identified by text mining and curated by domain experts was developed (PsyCUR). To this end, Medline abstracts published between 1980 and 2013 in journals with a SCI impact factor greater than one were processed by BeFree (Bravo ) to select sentences with GDAs. The diseases were identified using the UMLS concepts that define each disorder, whereas an in-house developed gene dictionary was used to identify the genes, as described in Bravo . A web-based annotation tool was developed to assist the curation process. The curators were presented with a putative GDA and additional information such as the supporting publications with links to the PubMed abstracts and to DisGeNET was provided. The curator decided to annotate a particular GDA as valid based on her background knowledge and the publications that support the GDA identified by text mining. Only valid GDAs were stored as PsyCUR.

2.2 Animal model data

We obtained genes from animal models of the diseases of interest from the Mouse Genome Database [MGD (Blake )] and the Rat Genome Database [RGD (Laulederkind )]. Mouse and rat genes were mapped to their human orthologs using the mapping files provided by MGD (ftp://ftp.informatics.jax.org/pub/reports/HOM_MouseHumanSequence.rpt) and RGD (ftp://rgd.mcw.edu/pub/data_release/RGD_ORTHOLOGS.txt), respectively. These mapping files are derived from NCBI HomoloGene. We only included GDAs from animal models when a human ortholog of the mouse or rat gene could be found and were supported by publications. In the case of RGD, we did not include the associations labeled as ‘resistance’, ’induced’ or ‘no association’ nor the ones annotated with the following evidence codes ‘Inferred from electronic annotation’, ‘Inferred from sequence or structural similarity’ and ‘Non-traceable author statement’.

2.3 Other data sources

We extracted GDAs from the Genetic Association Database (GAD), an archive of human genetic association studies of complex diseases (Becker ). It includes a summary extracted from published articles in peer-reviewed journals on candidate gene and Genome Wide Association Studies (GWAS). We only included GDAs from GAD that were not labeled as ‘negative’ or ‘normal variation’.

2.4 Data classification

We organized the data according to the source databases in the following classes: CURATED (containing GDAs from PsyCUR and CTD), MODELS (containing GDAs from RGD and MGD) and GAD (containing data from GAD).

2.5 Data attributes

Diseases are annotated to UMLS concepts and classified according to the MeSH disease hierarchy, whereas genes are annotated to NCBI Gene and UniProt and classified according to the Panther Protein Ontology. Each GDA is annotated with the supporting evidence (the publications reporting the GDA, a representative sentence describing the association and the original source of provenance) and the PsyGeNET score. The PsyGeNET score ranks each GDA taking into account the number of sources that report each association, the level of curation of each source and the number of publications supporting the association. The score ranges between 0 and 1 and is computed as follows: where where and

2.6 Comparison to other sources

We performed a comparison between PsyGeNET, CTD and OMIM in terms of the coverage of genes. We used as query terms the list of UMLS concepts that define each psychiatric disorder: 59 concepts for MD, 13 concepts for AUD and 11 concepts for CUD UMLS (the list of concepts is available at http://www.psygenet.org/). We used DisGeNET to query OMIM (downloaded on February 2015) and CTD (version 13949M, downloaded on February 2015), as it allows using UMLS concepts to search for diseases. We used PsyGeNET release 1.1 (January 2015). The results are listed in Table 1, where the PsyCUR column represents the genes only available in PsyGeNET and not present in the other sources.
Table 1.

Genes associated to psychiatric disorders in PsyGeNET and other public resources

CTDOMIM®PsyGeNETPsyCUR
MD906845448
AUD366534170
CUD109012813

PsyGeNET contains information on the genetics of psychiatric disorders curated from the literature that is not available in other databases. CTD™, Comparative Toxicogenomics Database; OMIM®, Online Mendelian Inheritance of Man. MD: Major Depression, AUD: Alcohol Use Disorder, CUD: Cocaine Use Disorder

Genes associated to psychiatric disorders in PsyGeNET and other public resources PsyGeNET contains information on the genetics of psychiatric disorders curated from the literature that is not available in other databases. CTD™, Comparative Toxicogenomics Database; OMIM®, Online Mendelian Inheritance of Man. MD: Major Depression, AUD: Alcohol Use Disorder, CUD: Cocaine Use Disorder

3 PsyGeNET web interface

The PsyGeNET platform is composed of a database and a web interface with a set of analysis tools, powered by Onexus (http://www.onexus.org/). The web interface supports searching, visualizing, filtering and sharing of PsyGeNET data. Data files containing the results of the user’s search can be downloaded in a variety of formats. Moreover, automatically generated scripts are provided in several programming languages to reproduce the analyses performed by the user and to incorporate them into bioinformatic workflows. Lastly, functionalities are offered to share the results of queries performed with PsyGeNET via e-mail or by embedding the HTML code of exploration results in web pages. The web interface offers two entry points: Search view: It allows the user to search a specific gene (or disease) in the database using free-text or standard identifiers. As a result, the user retrieves all diseases (or genes) that are associated with the gene (or disease) and the supporting evidence. Queries on multiple genes or diseases are also supported. Browse view: It allows the user to explore all the information starting by a specific data source (e.g. PsyCUR). The results are shown in the same way as when using the Search view. The PsyGeNET results can be filtered by the score, data source and attributes of the data. The web interface also provides links to external resources such as NCBI Gene and UniProt for genes, Linked Life Data for diseases and PubMed for publications. The user can inspect the evidence of each GDA by exploring the sentences extracted from the supporting publications in which the gene and disease are highlighted. For more details on the functionalities offered by the web interface, go to the Web Interface Help section (http://www.psygenet.org/).

4 PsyGeNET applications

Examples of questions that can be answered with PsyGeNET are: Which genes are associated to alcoholism in expert-curated databases? What subtypes of depression are available in PsyGeNET? Which genes support the association between MD and AUD? A step-by-step guide on how to answer these questions is available in the online tutorial (http://www.psygenet.org/).

5 Conclusions

We have developed PsyGeNET as a new resource to foster translational research on psychiatric disorders and their genes, through an open access database and publicly available exploration and analysis tools. A comparison to other databases indicates that PsyGeNET is unique regarding data quality and coverage (Table 1). PsyGeNET can be used to support different case studies on psychiatric diseases and their comorbidities. Because of its comprehensiveness, level of curation and suite of tools, PsyGeNET represents a valuable resource to foster the research on the molecular and biological mechanisms underpinning psychiatric disorders and their comorbidities.

Funding

This work was supported by Instituto de Salud Carlos III-Fondo Europeo de Desarrollo Regional (CP10/00524, PI13/00082 and RD12/028/024 and RD12/0028/0009 Retics-RTA), MINECO (SAF2013-41761-R-Fondo Europeo de Desarrollo Regional), the Innovative Medicines Initiative Joint Undertaking (no. 115372, EMIF and no. 115191, Open PHACTS), resources of which are composed of financial contribution from the European Union’s Seventh Framework Programme (FP7/2007-2013) and EFPIA companies’ in kind contribution. The Research Programme on Biomedical Informatics (GRIB) is a node of the Spanish National Institute of Bioinformatics (INB). Conflict of Interest: none declared.
  11 in total

1.  The genetic association database.

Authors:  Kevin G Becker; Kathleen C Barnes; Tiffani J Bright; S Alex Wang
Journal:  Nat Genet       Date:  2004-05       Impact factor: 38.330

Review 2.  Genetic architectures of psychiatric disorders: the emerging picture and its implications.

Authors:  Patrick F Sullivan; Mark J Daly; Michael O'Donovan
Journal:  Nat Rev Genet       Date:  2012-07-10       Impact factor: 53.242

Review 3.  Global burden of disease attributable to mental and substance use disorders: findings from the Global Burden of Disease Study 2010.

Authors:  Harvey A Whiteford; Louisa Degenhardt; Jürgen Rehm; Amanda J Baxter; Alize J Ferrari; Holly E Erskine; Fiona J Charlson; Rosana E Norman; Abraham D Flaxman; Nicole Johns; Roy Burstein; Christopher J L Murray; Theo Vos
Journal:  Lancet       Date:  2013-08-29       Impact factor: 79.321

4.  Grand challenges in global mental health.

Authors:  Pamela Y Collins; Vikram Patel; Sarah S Joestl; Dana March; Thomas R Insel; Abdallah S Daar; Warwick Anderson; Muhammad A Dhansay; Anthony Phillips; Susan Shurin; Mark Walport; Wendy Ewart; Sir John Savill; Isabel A Bordin; E Jane Costello; Maureen Durkin; Christopher Fairburn; Roger I Glass; Wayne Hall; Yueqin Huang; Steven E Hyman; Kay Jamison; Sylvia Kaaya; Shitij Kapur; Arthur Kleinman; Adesola Ogunniyi; Angel Otero-Ojeda; Mu-Ming Poo; Vijayalakshmi Ravindranath; Barbara J Sahakian; Shekhar Saxena; Peter A Singer; Dan J Stein
Journal:  Nature       Date:  2011-07-06       Impact factor: 49.962

Review 5.  Current status of co-occurring mood and substance use disorders: a new therapeutic target.

Authors:  Helen M Pettinati; Charles P O'Brien; William D Dundon
Journal:  Am J Psychiatry       Date:  2013-01       Impact factor: 18.112

6.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes.

Authors:  Janet Piñero; Núria Queralt-Rosinach; Àlex Bravo; Jordi Deu-Pons; Anna Bauer-Mehren; Martin Baron; Ferran Sanz; Laura I Furlong
Journal:  Database (Oxford)       Date:  2015-04-15       Impact factor: 3.451

7.  Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research.

Authors:  Àlex Bravo; Janet Piñero; Núria Queralt-Rosinach; Michael Rautschka; Laura I Furlong
Journal:  BMC Bioinformatics       Date:  2015-02-21       Impact factor: 3.169

8.  The Rat Genome Database 2013--data, tools and users.

Authors:  Stanley J F Laulederkind; G Thomas Hayman; Shur-Jen Wang; Jennifer R Smith; Timothy F Lowry; Rajni Nigam; Victoria Petri; Jeff de Pons; Melinda R Dwinell; Mary Shimoyama; Diane H Munzenmaier; Elizabeth A Worthey; Howard J Jacob
Journal:  Brief Bioinform       Date:  2013-02-22       Impact factor: 11.622

9.  The Comparative Toxicogenomics Database: update 2013.

Authors:  Allan Peter Davis; Cynthia Grondin Murphy; Robin Johnson; Jean M Lay; Kelley Lennon-Hopkins; Cynthia Saraceni-Richards; Daniela Sciaky; Benjamin L King; Michael C Rosenstein; Thomas C Wiegers; Carolyn J Mattingly
Journal:  Nucleic Acids Res       Date:  2012-10-23       Impact factor: 16.971

10.  The Mouse Genome Database: integration of and access to knowledge about the laboratory mouse.

Authors:  Judith A Blake; Carol J Bult; Janan T Eppig; James A Kadin; Joel E Richardson
Journal:  Nucleic Acids Res       Date:  2013-11-26       Impact factor: 16.971

View more
  28 in total

1.  Heterogeneous network embedding for identifying symptom candidate genes.

Authors:  Kuo Yang; Ning Wang; Guangming Liu; Ruyu Wang; Jian Yu; Runshun Zhang; Jianxin Chen; Xuezhong Zhou
Journal:  J Am Med Inform Assoc       Date:  2018-11-01       Impact factor: 4.497

2.  Analysis of disease organ as a novel phenotype towards disease genetics understanding.

Authors:  Lingyun Luo; Chunlei Zheng; Jiaolong Wang; Minsheng Tan; Yanshu Li; Rong Xu
Journal:  J Biomed Inform       Date:  2019-06-15       Impact factor: 6.317

3.  Single-nucleus transcriptomics of the prefrontal cortex in major depressive disorder implicates oligodendrocyte precursor cells and excitatory neurons.

Authors:  Corina Nagy; Malosree Maitra; Arnaud Tanti; Matthew Suderman; Jean-Francois Théroux; Maria Antonietta Davoli; Kelly Perlman; Volodymyr Yerko; Yu Chang Wang; Shreejoy J Tripathy; Paul Pavlidis; Naguib Mechawar; Jiannis Ragoussis; Gustavo Turecki
Journal:  Nat Neurosci       Date:  2020-04-27       Impact factor: 24.884

4.  Discover novel disease-associated genes based on regulatory networks of long-range chromatin interactions.

Authors:  Hao Wang; Jiaxin Yang; Yu Zhang; Jianrong Wang
Journal:  Methods       Date:  2020-10-21       Impact factor: 3.608

5.  An integrative network analysis framework for identifying molecular functions in complex disorders examining major depressive disorder as a test case.

Authors:  Anup Mammen Oommen; Stephen Cunningham; Páraic S O'Súilleabháin; Brian M Hughes; Lokesh Joshi
Journal:  Sci Rep       Date:  2021-05-06       Impact factor: 4.379

6.  Text mining and expert curation to develop a database on psychiatric diseases and their genes.

Authors:  Alba Gutiérrez-Sacristán; Àlex Bravo; Marta Portero-Tresserra; Olga Valverde; Antonio Armario; M C Blanco-Gandía; Adriana Farré; Lierni Fernández-Ibarrondo; Francina Fonseca; Jesús Giraldo; Angela Leis; Anna Mané; M A Mayer; Sandra Montagud-Romero; Roser Nadal; Jordi Ortiz; Francisco Javier Pavon; Ezequiel Jesús Perez; Marta Rodríguez-Arias; Antonia Serrano; Marta Torrens; Vincent Warnault; Ferran Sanz; Laura I Furlong
Journal:  Database (Oxford)       Date:  2017-01-01       Impact factor: 3.451

7.  Dual Depression: A Sex Perspective.

Authors:  Adriana Farré; Judit Tirado-Muñoz; Marta Torrens
Journal:  Addict Disord Their Treat       Date:  2017-08-04

8.  Striatal Transcriptome and Interactome Analysis of Shank3-overexpressing Mice Reveals the Connectivity between Shank3 and mTORC1 Signaling.

Authors:  Yeunkum Lee; Sun Gyun Kim; Bokyoung Lee; Yinhua Zhang; Yoonhee Kim; Shinhyun Kim; Eunjoon Kim; Hyojin Kang; Kihoon Han
Journal:  Front Mol Neurosci       Date:  2017-06-28       Impact factor: 5.639

9.  brain-coX: investigating and visualising gene co-expression in seven human brain transcriptomic datasets.

Authors:  Saskia Freytag; Rosemary Burgess; Karen L Oliver; Melanie Bahlo
Journal:  Genome Med       Date:  2017-06-08       Impact factor: 11.117

10.  TPGLDA: Novel prediction of associations between lncRNAs and diseases via lncRNA-disease-gene tripartite graph.

Authors:  Liang Ding; Minghui Wang; Dongdong Sun; Ao Li
Journal:  Sci Rep       Date:  2018-01-18       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.