Literature DB >> 26578565

PCOSKB: A KnowledgeBase on genes, diseases, ontology terms and biochemical pathways associated with PolyCystic Ovary Syndrome.

Shaini Joseph1, Ram Shankar Barai1, Rasika Bhujbalrao2, Susan Idicula-Thomas3.   

Abstract

Polycystic ovary syndrome (PCOS) is one of the major causes of female subfertility worldwide and ≈ 7-10% of women in reproductive age are affected by it. The affected individuals exhibit varying types and levels of comorbid conditions, along with the classical PCOS symptoms. Extensive studies on PCOS across diverse ethnic populations have resulted in a plethora of information on dysregulated genes, gene polymorphisms and diseases linked to PCOS. However, efforts have not been taken to collate and link these data. Our group, for the first time, has compiled PCOS-related information available through scientific literature; cross-linked it with molecular, biochemical and clinical databases and presented it as a user-friendly, web-based online knowledgebase for the benefit of the scientific and clinical community. Manually curated information on associated genes, single nucleotide polymorphisms, diseases, gene ontology terms and pathways along with supporting reference literature has been collated and included in PCOSKB (http://pcoskb.bicnirrh.res.in).
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2015        PMID: 26578565      PMCID: PMC4702829          DOI: 10.1093/nar/gkv1146

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Polycystic ovary syndrome (PCOS) is a multi-factorial reproductive disorder affecting 7–10% of women globally (1,2). It is one of the major causes of female subfertility (3,4). The complexity of the syndrome is contributed by both the genes and the diseases associated with it. Several factors like gene interactions, environmental and ethnic influences and lifestyle play a role in the clinical manifestation of PCOS (5-7). Genetic studies on PCOS has revealed that genes having dissimilar functions and affecting varied biochemical pathways like ovarian and adrenal steroidogenesis (e.g.CYP11A, CYP21), gonadotropin action and regulation (e.g. FST, LHCGR and FSHR), insulin action and secretion (e.g. INS, IRS-1, IRS-2), inflammation (e.g.IL-6), complement and coagulation cascades (e.g. VWF), signalling (e.g. ADIPOQ, INS, LHCGR, AMH), cancer (e.g. INS, AR, MMP1) etc. are associated with PCOS (7,8). Single Nucleotide Polymorphisms (SNPs)/mutations in one or more of these genes have been linked to diverse comorbid conditions associated with PCOS such as obesity, diabetes, dyslipidemia, cardiovascular diseases, cancer and subfertility (6). Thus, depending on the causal factors, the phenotypic manifestations vary widely making diagnosis a daunting task. PCOS diagnosis is currently based on the NIH or the Rotterdam criteria formulated in 2003. For a disease condition to be diagnosed as PCOS, presence of any two of the following symptoms is essential: clinical or biochemical hyperandrogenism, oligo-anovulation and/or polycystic ovaries, excluding other endocrinopathies as per the Rotterdam criteria (1,5). Being a highly researched disorder, there is abundant information available in literature on the various aspects of PCOS such as its genetics, diverse phenotypic manifestations, associated comorbidities and even inconsistencies in the study results across different population groups. Electronic databases and computational tools are required to integrate this information from disparate sources and to analyse the complex data arising out of genetic heterogeneity for PCOS. Use of such computational approaches that involve collating and mining information on genes, their function/ontology terms, expression profiles, tissue specificity and pathway information is expected to lead to a better understanding of the genetic aetiology of PCOS (9,10). Several gene-disease networks and gene prioritization methods have been applied to identify novel candidate genes or the most crucial genes involved in pathophysiology of multi-factorial diseases like Alzheimer's, Cancer etc. (11–13). This approach has not yet been used for PCOS; an important contributing factor being absence of comprehensive and curated repositories catering to PCOS-related genomic, proteomic, metabolomic and clinical information. To facilitate and augment high-throughput research on PCOS including gene network and prioritization studies, we have created a manually curated knowledgebase with complete information on genes associated with PCOS collated from literature references and reviewed databases. It currently holds information on 241 genes and their corresponding proteins, 3D structures, SNPs/mutations, ontology terms, pathways and diseases.

DATA COLLECTION AND ORGANIZATION

PubMed, was searched using relevant keywords namely ‘PCOS’, ‘Polycystic ovarian syndrome’, ‘PCO’, ‘PCO1’, ‘anovulation’, ‘Stein Leventhal’, ‘Ovarian Syndrome’, ‘polycystic ovaries’, ‘polycystic ovary disease’, ‘POS’, ‘PCOD-Polycystic Ovarian Disease’, ‘hyperandrogenic chronic anovula’, ‘hyperandrogenic chronic anovulation’, ‘ovarian hyperthecosis’, ‘Sclerocystic Ovarian Disease’, ‘sclerocystic ovary syndrome’, ‘Bilateral PCOS’ and ‘functional ovarian hyperandrogen’. The genes cited in the retrieved literature were acquired from the NCBI Gene database. The reference literature was reviewed to ascertain the association of each of these genes to PCOS. Genes and SNPs, whose association with PCOS could not be confirmed based on the reference, were discarded. Relevant information on the validated genes such as nature of the study population, ethnicity, mutations/ SNPs, gene ontology terms, protein structural information, biochemical pathway and disease related information were retrieved from online databases such as NCBI PubMed (14), Gene (15), dbSNP (14), Ensembl (16), UniProt (17), Pfam (18), InterPro (19), GO (20), KEGG (21), GeneCards (22) and OMIM (23). The diseases associated with each of the validated genes were retrieved from GeneCards database. These diseases were grouped into non-redundant categories such as cancers/tumors, cardiac disorders, reproductive disorders, immune disorders etc. Each of the gene-disease associations was manually verified by reviewing the reference literature. Several gene-disease associations, retrieved using GeneCards, could not be established based on reference literature and were therefore discarded. The data collected on genes associated with PCOS have been organised into seven tables in PCOSKB.

DATA ARCHITECTURE

PCOSKB is built on Apache HTTP Server 2.0.59. The database was created using MySQL Server 5.0 and the web interfaces were designed using PHP 5.2.9, HTML and JavaScript. These are platform independent, open source software and they support multithreading and multiuser environment. The analysis graphs and charts were generated using the JpGraph library (http://jpgraph.net/).

WEB INTERFACES

PCOSKB has a user-friendly interface, as described below: Home: a brief introduction on PCOS and information accessible in the database is provided. Search: two search options: quick and advanced are available to users. Browse: this tab can be used to browse the database for: Genes: this page provides description of the genes, their genomic location and links to the complete gene record. SNPs: the detailed information on PCOS associated SNPs such as their upstream/downstream sequence, functional significance and links to relevant databases such as dbSNP and PubMed are provided. Associated diseases: the unique gene-disease associations in PCOSKB can be browsed Pathways: the pathways involving PCOS-related genes hyperlinked to the KEGG pathway database is provided. Ontology terms: the unique ontology terms associated with genes along with their associated pathways and diseases can be analysed. Tools: this section enables users to identify the significant gene ontology terms, pathways and disease phenotypes corresponding to a user-defined gene list. The gene list can be obtained either by directly selecting the genes present in PCOSKB or through their associated diseases. Help: the features present in the database are explained with examples to assist users. Analysis: the relative frequencies of the disease phenotypes, gene ontology terms and pathways predominantly associated with genes involved in PCOS pathophysiology are displayed. Statistics: it gives information on the total number of genes, associated diseases, ontology terms, pathways and SNPs present in the database. Users can directly view the associated diseases, gene ontology terms, SNPs for genes using the ‘Diseases associated with genes’ link. The number of genes that are associated with each of the diseases can be accessed through ‘Genes associated with disease’ link.

DATA ANALYSIS

The data in PCOSKB were analysed to identify the significant comorbid conditions, pathways and ontology terms associated with PCOS. The results of the analysis can be viewed in Figure 1.
Figure 1.

(A) Pie chart of associated diseases, (B) Pie chart of ontology terms, (C) Pie chart of ontology terms (BP), (D) Snapshot of an associated ontology terms page in PCOSKB.

(A) Pie chart of associated diseases, (B) Pie chart of ontology terms, (C) Pie chart of ontology terms (BP), (D) Snapshot of an associated ontology terms page in PCOSKB. Reproductive and cardiac disorders were the most prevalent diseases associated with PCOS, as expected. Hyperlinks are provided in the pie-chart for users to view the gene-disease associations (Figure 1A). Information available in KEGG pathway database was used to cluster the PCOS-related genes and retrieve the biochemical pathways. The distribution of the pathways, based on the gene members is depicted as bar graph. Each of the bars, represent a pathway and are hyperlinked to the respective KEGG pathway. The position (role) of the PCOS-associated gene/s in the pathway can be identified by their red highlights. The number of gene members for each of the pathway is indicated adjacent to the bar. Of the 175 unique pathways, ≈30% had five or more gene members. The ontology terms for the PCOS-related genes were retrieved from the GO database (AmiGO 2). The gene ontology annotations in the GO database are organized in a hierarchical manner with ‘parent’ terms being broad and ‘child’ terms being more specific to a biological process (BP), molecular function (MF) and cellular component (CC). Parents of the PCOS-related GO terms were retrieved using the GOOSE tool (20). These data were further used to group the genes based on the three broad categories namely, MF, BP and CC. The relative share of these three components in the total ontology terms is represented as a pie chart in the Analysis tab. Each of these components were further analysed exclusively for relative proportion of the ontology terms at the subsequent level of distance from the parent. The data associated with the ontology term such as genes, pathways, diseases can be accessed in a tabular format using the hyperlinks provided in the pie-charts (Figure 1B, C, D).

DATABASE SUMMARY

Genes and SNPs: PCOSKB holds information on 241 genes and 114 SNPs associated with PCOS. These genes and SNPs have been validated for their association with PCOS by reviewing the reference literature. The information is organised into six sections namely gene related, protein related, gene ontology, validated SNPs, associated diseases and references. Information mined from PCOS-related references (e.g. population size, ethnicity, SNPs/ mutation and associated conditions) can be viewed in the reference section. Associated diseases: the database has 1905 unique gene-disease associations and 500 unique PCOS-associated diseases. These diseases have been validated for their association with the genes/PCOS by reviewing the reference literature. The diseases are broadly divided into 18 categories. Reproductive (21%) and cardiac disorders (13%) feature in the top disease associations of PCOS. Biochemical pathways and gene ontology terms: analysis of the validated genes revealed its association with 175 unique pathways and 1473 unique ontology terms. These data highlight the diverse biological role of the genes associated with PCOS; an observation which corroborates the known complexity and genetic heterogeneity of PCOS.

COMPARISON WITH EXISTING DATABASES

Few gene-based databases like GeneCards (22) and DisGeNET (24) provide information on genes and associated disorders. The information present in these databases is not entirely manually curated and derived from several primary and secondary databases. A curated data set of genes and associated diseases is essential to understand the pathophysiology of multi-factorial diseases. There are few disease-specific databases like EpilepsyGene (25), AutismKB (26) which are manually curated. However, till date no database exists with information on genes, their associated ontology terms, pathways and disorders for reproductive disorders like PCOS. In PCOSKB, we have made an attempt to integrate complete information on genes associated with PCOS. Each gene has been manually verified with reference literature for its role in PCOS and details of the study population have been incorporated in the database. Each gene-disease association present in the database has been manually curated with published literature.

CONCLUSION

Although there are few online databases catering to gene disease associations, there are no manually curated databases dedicated to PCOS that focuses on observed human gene-disease associations along with information on gene ontology terms and biochemical pathways. Clinicians and researchers can leverage the well-integrated information on molecular and clinical aspects of PCOS available in PCOSKB. PCOSKB would be updated annually.
  25 in total

Review 1.  Molecular & genetic factors contributing to insulin resistance in polycystic ovary syndrome.

Authors:  Sarbani Mukherjee; Anurupa Maitra
Journal:  Indian J Med Res       Date:  2010-06       Impact factor: 2.375

Review 2.  Genetics of polycystic ovary syndrome.

Authors:  Corrine K Welt; Jessica M Duran
Journal:  Semin Reprod Med       Date:  2014-04-08       Impact factor: 1.303

3.  GeneCards Version 3: the human gene integrator.

Authors:  Marilyn Safran; Irina Dalah; Justin Alexander; Naomi Rosen; Tsippi Iny Stein; Michael Shmoish; Noam Nativ; Iris Bahir; Tirza Doniger; Hagit Krug; Alexandra Sirota-Madi; Tsviya Olender; Yaron Golan; Gil Stelzer; Arye Harel; Doron Lancet
Journal:  Database (Oxford)       Date:  2010-08-05       Impact factor: 3.451

Review 4.  Genetics of the polycystic ovary syndrome.

Authors:  Gülüm Kosova; Margrit Urbanek
Journal:  Mol Cell Endocrinol       Date:  2012-10-16       Impact factor: 4.102

Review 5.  Polycystic ovary syndrome: current status and future perspective.

Authors:  Erin K Barthelmess; Rajesh K Naz
Journal:  Front Biosci (Elite Ed)       Date:  2014-01-01

6.  AutismKB: an evidence-based knowledgebase of autism genetics.

Authors:  Li-Ming Xu; Jia-Rui Li; Yue Huang; Min Zhao; Xing Tang; Liping Wei
Journal:  Nucleic Acids Res       Date:  2011-12-01       Impact factor: 16.971

7.  EpilepsyGene: a genetic resource for genes and mutations related to epilepsy.

Authors:  Xia Ran; Jinchen Li; Qianzhi Shao; Huiqian Chen; Zhongdong Lin; Zhong Sheng Sun; Jinyu Wu
Journal:  Nucleic Acids Res       Date:  2014-10-16       Impact factor: 19.160

8.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes.

Authors:  Janet Piñero; Núria Queralt-Rosinach; Àlex Bravo; Jordi Deu-Pons; Anna Bauer-Mehren; Martin Baron; Ferran Sanz; Laura I Furlong
Journal:  Database (Oxford)       Date:  2015-04-15       Impact factor: 3.451

9.  Frequency and outcome of treatment in polycystic ovaries related infertility.

Authors:  Farzana Arain; Nesreen Arif; Hafeez Halepota
Journal:  Pak J Med Sci       Date:  2015       Impact factor: 1.088

10.  The InterPro protein families database: the classification resource after 15 years.

Authors:  Alex Mitchell; Hsin-Yu Chang; Louise Daugherty; Matthew Fraser; Sarah Hunter; Rodrigo Lopez; Craig McAnulla; Conor McMenamin; Gift Nuka; Sebastien Pesseat; Amaia Sangrador-Vegas; Maxim Scheremetjew; Claudia Rato; Siew-Yit Yong; Alex Bateman; Marco Punta; Teresa K Attwood; Christian J A Sigrist; Nicole Redaschi; Catherine Rivoire; Ioannis Xenarios; Daniel Kahn; Dominique Guyot; Peer Bork; Ivica Letunic; Julian Gough; Matt Oates; Daniel Haft; Hongzhan Huang; Darren A Natale; Cathy H Wu; Christine Orengo; Ian Sillitoe; Huaiyu Mi; Paul D Thomas; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 16.971

View more
  16 in total

Review 1.  Multiomics Analysis-Based Biomarkers in Diagnosis of Polycystic Ovary Syndrome.

Authors:  Shikha Rani; Piyush Chandna
Journal:  Reprod Sci       Date:  2022-01-27       Impact factor: 3.060

2.  Further delineation of familial polycystic ovary syndrome (PCOS) via whole-exome sequencing: PCOS-related rare FBN3 and FN1 gene variants are identified.

Authors:  Cengiz Karakaya; Aylin Pelin Çil; Kaya Bilguvar; Tunahan Çakir; Mete Hakan Karalok; Recep Onur Karabacak; Ahmet Okay Caglayan
Journal:  J Obstet Gynaecol Res       Date:  2022-02-09       Impact factor: 1.697

3.  Genome-wide methylation profiling in granulosa lutein cells of women with polycystic ovary syndrome (PCOS).

Authors:  E Makrinou; A W Drong; G Christopoulos; A Lerner; I Chapa-Chorda; T Karaderi; S Lavery; K Hardy; C M Lindgren; S Franks
Journal:  Mol Cell Endocrinol       Date:  2019-10-07       Impact factor: 4.102

4.  DES-Tcell is a knowledgebase for exploring immunology-related literature.

Authors:  Ahdab AlSaieedi; Adil Salhi; Faroug Tifratene; Arwa Bin Raies; Arnaud Hungler; Mahmut Uludag; Christophe Van Neste; Vladimir B Bajic; Takashi Gojobori; Magbubah Essack
Journal:  Sci Rep       Date:  2021-07-12       Impact factor: 4.379

5.  PCOSBase: a manually curated database of polycystic ovarian syndrome.

Authors:  Nor Afiqah-Aleng; Sarahani Harun; Mohd Rusman Arief A-Rahman; Nor Azlan Nor Muhammad; Zeti-Azura Mohamed-Hussein
Journal:  Database (Oxford)       Date:  2017-01-01       Impact factor: 3.451

6.  Computational characterization and identification of human polycystic ovary syndrome genes.

Authors:  Xing-Zhong Zhang; Yan-Li Pang; Xian Wang; Yan-Hui Li
Journal:  Sci Rep       Date:  2018-08-28       Impact factor: 4.379

Review 7.  Polycystic ovary syndrome (PCOS) and genetic predisposition: A review article.

Authors:  Nida Ajmal; Sanam Zeib Khan; Rozeena Shaikh
Journal:  Eur J Obstet Gynecol Reprod Biol X       Date:  2019-06-08

8.  Prenatal Androgenization Alters the Development of GnRH Neuron and Preoptic Area RNA Transcripts in Female Mice.

Authors:  Laura L Burger; Elizabeth R Wagenmaker; Chayarndorn Phumsatitpong; David P Olson; Suzanne M Moenter
Journal:  Endocrinology       Date:  2020-11-01       Impact factor: 4.736

9.  PCOSDB: PolyCystic Ovary Syndrome Database for manually curated disease associated genes.

Authors:  Maniraja Jesintha Mary; Umashankar Vetrivel; Deecaraman Munuswamy; Vijayalakshmi Melanathuru
Journal:  Bioinformation       Date:  2016-01-31

10.  PCOSKBR2: a database of genes, diseases, pathways, and networks associated with polycystic ovary syndrome.

Authors:  Mridula Sharma; Ram Shankar Barai; Indra Kundu; Sameeksha Bhaye; Khushal Pokar; Susan Idicula-Thomas
Journal:  Sci Rep       Date:  2020-09-07       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.