| Literature DB >> 25982315 |
Liwei Zhang1, Jinyan Guo2, Qi You1, Xin Yi1, Yi Ling1, Wenying Xu1, Jinping Hua3, Zhen Su4.
Abstract
Cotton (Gossypium spp.) is one of the most important natural fiber and oil crops worldwide. Improvement of fiber yield and quality under changing environments attract much attention from cotton researchers; however, a functional analysis platform integrating omics data is still missing. The success of cotton genome sequencing and large amount of available transcriptome data allows the opportunity to establish a comprehensive analysis platform for integrating these data and related information. A comprehensive database, Platform of Functional Genomics Analysis in Gossypium raimondii (GraP), was constructed to provide multi-dimensional analysis, integration and visualization tools. GraP includes updated functional annotation, gene family classifications, protein-protein interaction networks, co-expression networks and microRNA-target pairs. Moreover, gene set enrichment analysis and cis-element significance analysis tools are also provided for gene batch analysis of high-throughput data sets. Based on these effective services, GraP may offer further information for subsequent studies of functional genes and in-depth analysis of high-throughput data. GraP is publically accessible at http://structuralbiology.cau.edu.cn/GraP/, with all data available for downloading.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25982315 PMCID: PMC4433718 DOI: 10.1093/database/bav047
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Workflow of gene family classifications and PPI network. HMM models from UUCD, iTAK and Pfam databases were used to search putative members of TFs/TRs, PKs/PPs, UPS and CYP450s. Homolog searches between G. raimondii and Arabidopsis, and InterProScan as well as public information were also applied to further curate the results. Experimentally assayed PPIs in Arabidopsis were retrieved from publicly available data bases, and a giant PPI network was generated by combining orthologs between G. raimondii and Arabidopsis identified by different methods.
A list of data integrated in GraP
| Category | Description | Details |
|---|---|---|
| Genome assembly | JGI version | 1033 chromosome/scaffolds, 37 505 genes and 77 267 transcripts/proteins |
| Gene family | TFs/TRs | 82 TFs/TRs families, 3275 genes, 6406 proteins |
| PKs/PPs | 96 PKs/PPs families, 1956 genes, 4292 proteins | |
| CYP450s | 42 CYP450 families, 373 genes, 490 proteins | |
| UPS | 21 UPS member, 1749 genes, 3943 proteins | |
| Functional annotations | Gene ontology | 1338 accessions (496 for biological progress, 156 for cell component and 687 for molecular function), 20 979 genes, 45 743 proteins |
| KEGG pathways | 2815 pathways, 5580 genes, 9532 proteins | |
| MAPMAN | 2662 accessions, 24452 genes, 52 689 proteins | |
| Conserved domain | InterProScan searches | 2815 IPR ID, 30343 genes, 64 913 proteins |
| Intracellular network | Protein–protein interactions | 12 483 nodes and 103743 edges |
| Gene co-expressions | 20 480 nodes, 1 419 237 positively correlated edges and 1 127 237 negatively correlated edges | |
| microRNAs | Mature microRNAs and their targets | 416 Precursors with 416 mature sequences, 7506 target mRNAs |
| Homology match | Inparanoid: 15 142 ortholog gene pairsLevel 1: 18 696 | |
| 16 973 genes in | ||
| 16 625 genes in | ||
| 13 834 genes in | ||
| Synteny | Blocks | 14 344 co-linear blocks covering 37 223 genes |
Figure 2.An overview of GraP. (A) GraP was developed under LAMP (Linux, Apache, MySQL and Python/Perl) environments and based on integrating analysed results and tools. Several types of queries can be submitted for retrieving relevant information. (B) All data sets are organized by offering several web pages, search tools and web services. Meanwhile, extra links with other related public databases are also supplied.
Figure 3.Functional details of Gorai.011G121900, miR393 and miR482 in GraP. This page shows the functional details of Gorai.011G121900 including gene family, targeting microRNAs, conserved protein domains, ortholog matches with other species and expression levels in different tissues.
Figure 4.Functional analysis of protein-encoding genes related to Gorai.011G121900. (A) A PPI network containing Gorai.011G121900 and its connected signaling key elements. (B) A neighbor-joining (NJ) tree of SnRK proteins was performed based on the kinase domains using CLUSTALW and MEGA4 with bootstrap 1000 replicates, only the clades with bootstrap value higher than 50 were shown. (C) Cis-element enrichment analysis of all interactors showed high significance of signal transduction and key transcription factors. (D) GSEA analysis showed that major functions focused on response to abiotic stresses and regulation of related biological processes. (E) Enrichment analysis of KEGG pathways, gene families and PPI clusters.