| Literature DB >> 31695717 |
Lingling Da1, Yue Liu1, Jiaotong Yang1, Tian Tian1, Jiajie She1, Xuelian Ma1, Wenying Xu1, Zhen Su1.
Abstract
As an economically important crop, apple is one of the most cultivated fruit trees in temperate regions worldwide. Recently, a large number of high-quality transcriptomic and epigenomic datasets for apple were made available to the public, which could be helpful in inferring gene regulatory relationships and thus predicting gene function at the genome level. Through integration of the available apple genomic, transcriptomic, and epigenomic datasets, we constructed co-expression networks, identified functional modules, and predicted chromatin states. A total of 112 RNA-seq datasets were integrated to construct a global network and a conditional network (tissue-preferential network). Furthermore, a total of 1,076 functional modules with closely related gene sets were identified to assess the modularity of biological networks and further subjected to functional enrichment analysis. The results showed that the function of many modules was related to development, secondary metabolism, hormone response, and transcriptional regulation. Transcriptional regulation is closely related to epigenetic marks on chromatin. A total of 20 epigenomic datasets, which included ChIP-seq, DNase-seq, and DNA methylation analysis datasets, were integrated and used to classify chromatin states. Based on the ChromHMM algorithm, the genome was divided into 620,122 fragments, which were classified into 24 states according to the combination of epigenetic marks and enriched-feature regions. Finally, through the collaborative analysis of different omics datasets, the online database AppleMDO (http://bioinformatics.cau.edu.cn/AppleMDO/) was established for cross-referencing and the exploration of possible novel functions of apple genes. In addition, gene annotation information and functional support toolkits were also provided. Our database might be convenient for researchers to develop insights into the function of genes related to important agronomic traits and might serve as a reference for other fruit trees.Entities:
Keywords: Malus domestica; anthocyanin biosynthesis; chromatin state; co-expression network; fruit ripening; functional module
Year: 2019 PMID: 31695717 PMCID: PMC6817610 DOI: 10.3389/fpls.2019.01333
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
RNA-seq data resources.
| Tissue | Sample information | Experiment | Reference |
|---|---|---|---|
| seedling | seedling | SRR768136 | INRA |
| bud | bud break; dormant buds (0/1/2/3/4 months) | SRP099578 | Foundation Edmund Mach |
| flower | mature | SRS1558530 | PMID: 27503335 ( |
| stigmas | open flowers | SRR6308190 | IBMC/i3S |
| styles | open flowers | SRR6308181 | |
| filaments | open flowers | SRR6308188 | |
| anthers | 1-3 days prior to flower opening | SRR6308187 | |
| petals | open flowers | SRR6308191 | |
| pollen | open flowers | SRR6308192 | |
| sepals | open flowers | SRR6308194 | |
| receptacles | open flowers | SRR6308193 | |
| ovaries | open flowers | SRR6308189 | |
| fruit | 1-20 WAFB | SRR3384922 | PMID: 25576355 ( |
| 25/35/60/87 DPA | SRP018878 | INRA | |
| immature/mature | SRP102870 | PMID: 30250279 ( | |
| mature, mock/CreA/PhleoR infected with | SRP150975 | PMID: 30047230 ( | |
| fruit peel | mature | SRP102870 | PMID: 30250279 ( |
| seed | 20 DAPF | SRP048976 | PMID: 25781174 ( |
| shoot apex | 4-6-week-old seedling | SRX765691 | Michigan State University |
| new shoot | SRX765683 | ||
| stem | mature | SRS1558540 | PMID: 27503335 ( |
| cotyledon | mock/pale green lethal seedling | SRP069858 | |
| leaf | plantlets, mock/ASGV-infected | SRP034943 | PMID: 24736405 ( |
| fully developed, 0-14 DPA infected with | SRP018878 | INRA | |
| immature | SRR6308182 | IBMC/i3S | |
| youngest/oldest leaf; mock/infected with | ERP003589 | PMID: 24223809 ( | |
| mature | SRS1206445 | PMID: 27503335 ( | |
| mature | SRP102870 | PMID: 30250279 ( |
Epigenomic data resources.
| Type | Tissue | Sample information | SRA experiment | Reference |
|---|---|---|---|---|
| DNase-seq | leaf | mature | SRX2697891, SRX2697892 | PMID: 30250279 ( |
| fruit flesh | immature | SRX3420379, SRX3420380, | ||
| fruit flesh | mature | SRX2697889, SRX2697890 | ||
| H3K27me3 | leaf | mature | SRX768318 | Michigan State University |
| leaf | mature | SRX2697980, SRX2697981 | PMID: 30250279 ( | |
| fruit flesh | immature | SRX3420335 | ||
| fruit flesh | mature | SRX2697978, SRX2697979 | ||
| shoot apex | new shoot (6-10 leaves) | SRX768312 | Michigan State University | |
| H3K4me3 | leaf | new shoot (6-10 leaves) | SRX768320 | |
| shoot apex | new shoot (6-10 leaves) | SRX768315 | ||
| H3K36me2 | leaf | new shoot (6-10 leaves) | SRX768319 | |
| shoot apex | new shoot (6-10 leaves) | SRX768314 | ||
| Bisulfite-seq | fruit | 3 days after pollination | SRX2511185 | PMID: 28581499 ( |
| fruit | 9 days after pollination | SRX2511186 |
Figure 1General description of AppleMDO functions. (A) Basic information for MdACO1 obtained by using the search function in AppleMDO, including annotation, co-expression network, gene structure, chromatin states (with warm colors representing activation of transcription and cool colors representing inhibition of transcription), gene location, CDSs and proteins, gene family, orthologous genes in 13 species, gene ontology, Kyoto Encyclopedia of Genes and Genomes, Pfam domain, and expression patterns in various tissues and under different stresses. (B) Co-expression network analysis of MdACO1. A global or tissue-preferential co-expression network of MdACO1 was found by searching for MdACO1 (MD10G1328100) in an AppleMDO network analysis, including positive co-expression relationships, negative co-expression relationships, and possible protein–protein interactions. Further network comparison analysis has been implemented between the global network and tissue-preferential network. Moreover, for all genes in the network, we provide a GO enrichment analysis tool to further annotate gene functions and expression profile analysis tools to visualize gene expression levels. (C) Chromatin state analysis of MdACO1. Associated states and epigenetic marks of genes were found by searching for a single gene or a list of genes in gene analysis. Relative fold enrichment for genomic regions and emission parameters of different epigenetic marks in the state were found by searching for the state in state analysis. Furthermore, detailed information on states and the signals of epigenetic markers can be visualized in the genome using the UCSC genome browser.
AppleMDO content.
| Database content | Detailed information | Method | |
|---|---|---|---|
| Network | Global network | 43,862 genes (759,862 edges) | PCC & MR |
| Tissue-preferential network | 42,991 genes (683,265 edges) | PCC & MR | |
| Protein-protein interaction | 7,298 genes (37,406 edges) | InParanoid | |
| Module | Functional module | 9,133 genes (1,075 modules) | CFinder |
| Chromatin state | Chromatin state | 24 states (620,122 segments) | HMM |
| Gene family | Cytochrome P450 | 346 genes (88 families) | Blast & InterProScan |
| Protein kinase | 1,991 genes (87 families) | iTAK | |
| Ubiquitin | 1,306 genes (20 families) | HMM | |
| Transcription factor/regulator | 2,965 genes (83 families) | iTAK | |
| Carbohydrate-active enzyme | 1,048 genes (94 families) | InParanoid | |
| Epigenetic regulator | 822 genes (113 families) | InParanoid | |
| Annotation | GO annotation | 26,714 genes (65,061 entries) | Blast2GO |
| KEGG annotation | 10,343 genes (2,910 entries) | Orthologue | |
| Pfam domain | 33,445 genes (55,187 domains) | PfamScan | |
| Orthologues in | 18,838 genes (26,028 pairs) | InParanoid | |
| Orthologues in | 19,110 genes (30,789 pairs) | InParanoid | |
| Orthologues in | 23,333 genes (24,252 pairs) | InParanoid | |
| Orthologues in | 21,758 genes (25,038 pairs) | InParanoid | |
| Orthologues in | 20,256 genes (25,606 pairs) | InParanoid | |
| Orthologues in | 19,445 genes (20,055 pairs) | InParanoid | |
| Orthologues in | 19,093 genes (19,558 pairs) | InParanoid | |
| Orthologues in | 18,995 genes (20,019 pairs) | InParanoid | |
| Orthologues in | 14,503 genes (19,912 pairs) | InParanoid | |
| Orthologues in | 22,030 genes (28,549 pairs) | InParanoid | |
| Orthologues in | 20,979 genes (31,248 pairs) | InParanoid | |
| Orthologues in | 19,683 genes (29,722 pairs) | InParanoid | |
| Orthologues in | 16,908 genes (21,785 pairs) | InParanoid | |
Figure 2Global co-expression network analysis of MdACO1, associated with fruit ripening. (A) The global co-expression network of MdACO1 (MD10G1328100). The middle yellow circle is the ACO1 gene, and the surrounding green polygons are genes co-expressed with MdACO1. The red line indicates positive co-expression with MdACO1, and the blue line indicates negative co-expression with MdACO1. (B) Heatmap of all genes in the global co-expression network for MdACO1. The redder the color is, the higher the expression level is, and the bluer the color is, the lower the expression level is. (C) Gene set enrichment analysis of all genes in the global co-expression network for MdACO1 by PlantGSEA (http://structuralbiology.cau.edu.cn/PlantGSEA). (D) Comparison of the top 300 genes in the global positive co-expression network between MdACO1 in apple and SIACO1 in tomato.
Figure 3Global co-expression network analysis of MdCHS, related to anthocyanin biosynthesis. (A) The global co-expression network of MdCHS1 (MD04G1003300), MdCHS2 (MD04G1003000), and MdCHS3 (MD04G1003400). The yellow circles are the three MdCHS genes, and the green polygons are genes co-expressed with MdCHS. The red line indicates positive co-expression with MdCHS, and the blue line indicates negative co-expression with MdCHS. (B) Anthocyanin biosynthesis pathway. The genes marked in red are in the global co-expression network of MdCHS. (C) GO enrichment analysis of all genes in the global co-expression network for MdCHS1, MdCHS2, and MdCHS3 by agriGOv2 (http://systemsbiology.cau.edu.cn/agriGOv2/).
Figure 4Chromatin state analysis of three MdCHS genes. Associated states and epigenetic marks of MdCHS1, MdCHS2, and MdCHS3 in the UCSC genome browser. The red arrow represents the direction of gene transcription.