| Literature DB >> 31725861 |
Nor Afiqah-Aleng1, Sarahani Harun1, Mohd Rusman Arief A-Rahman1, Nor Azlan Nor Muhammad1, Zeti-Azura Mohamed-Hussein1,2.
Abstract
Polycystic ovarian syndrome (PCOS) is one of the main causes of infertility and affects 5-20% women of reproductive age. Despite the increased prevalence of PCOS, the mechanisms involved in its pathogenesis and pathophysiology remains unclear. The expansion of omics on studying the mechanisms of PCOS has lead into vast amounts of proteins related to PCOS resulting to a challenge in collating and depositing this deluge of data into one place. A knowledge-based repository named as PCOSBase was developed to systematically store all proteins related to PCOS. These proteins were compiled from various online databases and published expression studies. Rigorous criteria were developed to identify those that were highly related to PCOS. They were manually curated and analysed to provide additional information on gene ontologies, pathways, domains, tissue localizations and diseases that associate with PCOS. Other proteins that might interact with PCOS-related proteins identified from this study were also included. Currently, 8185 PCOS-related proteins were identified and assigned to 13 237 gene ontology vocabulary, 1004 pathways, 7936 domains, 29 disease classes, 1928 diseases, 91 tissues and 320 472 interactions. All publications related to PCOS are also indexed in PCOSBase. Data entries are searchable in the main page, search, browse and datasets tabs. Protein advanced search is provided to search for specific proteins. To date, PCOSBase has the largest collection of PCOS-related proteins. PCOSBase aims to become a self-contained database that can be used to further understand the PCOS pathogenesis and towards the identification of potential PCOS biomarkers. Database URL: http://pcosbase.org.Entities:
Year: 2017 PMID: 31725861 PMCID: PMC7243924 DOI: 10.1093/database/bax098
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.PCOSBase schema. This schema shows all the 29 tables with the connections from table to table.
Figure 2.PCOSBase data types structure organization. These data types are tables that can be found in Browse and Datasets menu.
Number of entries in the datasets of PCOSBase
| Dataset | Entries |
|---|---|
| PCOS-related proteins | 8185 |
| Gene ontologies | 13 237 |
| Biological processes | 8971 |
| Cellular components | 1305 |
| Molecular functions | 2961 |
| Domains | 7936 |
| Pathways | 1004 |
| Interactions | 320 472 |
| PCOS-related diseases | 1928 |
| Disease classes | 29 |
| Tissues | 91 |
| Databases | 9 |
| Resources | 30 |
| Transcriptomics | 19 |
| Proteomics | 11 |
| Publications | 14 368 |
Figure 3.PCOS-disease interaction network. This network is predicted based on PPI and 20 diseases have been predicted to be highly associated with PCOS. The network demonstrates the complexity of PCOS-diseases association and the size of the nodes indicates the degree of association between PCOS and diseases. Green node represents PCOS and size of each node denotes number of shared proteins between PCOS and its respective associated disease.