| Literature DB >> 33193666 |
Minghao Sheng1,2, Jiajie She2, Wenying Xu2, Yan Hong1, Zhen Su2, Xiaodong Zhang1.
Abstract
The herbaceous peony (Paeonia lactiflora Pall.) is a well-known ornamental flowering and pharmaceutical plant found in China. Its high medicinal value has long been recognized by traditional Chinese medicine (as Radix paeoniae Alba and Radix paeoniae Rubra), and it has become economically valued for its oilseed in recent years; like other Paeonia species, it has been identified as a novel resource for the α-linolenic acid used in seed oil production. However, its genome has not yet been sequenced, and little transcriptome data on Paeonia lactiflora are available. To obtain a comprehensive transcriptome for Paeonia lactiflora, RNAs from 10 tissues of the Paeonia lactiflora Pall. cv Shaoyou17C were used for de novo assembly, and 416,062 unigenes were obtained. Using a homology search, it was found that 236,222 (approximately 57%) unigenes had at least one BLAST hit in one or more public data resources. The construction of co-expression networks is a feasible means for improving unigene annotation. Using in-house transcriptome data, we obtained a co-expression network covering 95.13% of the unigenes. Then we integrated co-expression network analyses and lipid-related pathway genes to study lipid metabolism in Paeonia lactiflora cultivars. Finally, we constructed the online database HpeNet (http://bioinformatics.cau.edu.cn/HpeNet) to integrate transcriptome data, gene information, the co-expression network, and so forth. The database can also be searched for gene details, gene functions, orthologous matches, and other data. Our online database may help the research community identify functional genes and perform research on Paeonia lactiflora more conveniently. We hope that de novo transcriptome assembly, combined with co-expression networks, can provide a feasible means to predict the gene function of species that do not have a reference genome.Entities:
Keywords: Paeonia lactiflora Pall.; co-expression network; database; de novo assembly; functional annotation
Year: 2020 PMID: 33193666 PMCID: PMC7641121 DOI: 10.3389/fgene.2020.570138
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Data quality and annotation of unigenes assembled with mRNA-seq. (A) Length distribution of unigenes. (B) Unigenes annotated using the Eukaryotic Orthologous Groups (KOG) tool. All unigenes were divided into 26 groups. (C) Functional annotation of the unigenes based on the KEGG classification. The numbers of unigenes mapped to each pathway group are shown in the bar chart. Colors indicate KEGG categories: blue for metabolism, purple for genetic information processing, green for environmental information processing, red for cellular processes, and cyan for organismal systems. (D) Gene ontology (GO) classification of the unigenes. Each bar represents the number of unigenes mapped to each GO category. Bar color indicates GO category: green for biological process (BP), red for cellular component (CC), and blue for molecular function (MF). (E) Number of the tissues in which each unigene is expressed.
Annotation of unigenes based on multiple database resources.
| Source | Number of unigenes | Percentage (%) | E-value |
| NR | 197488 | 47.46% | 1E–5 |
| NT | 129014 | 31% | 1E–5 |
| Pfam domains | 139121 | 33.43% | 0.01 |
| Swiss-Prot | 138800 | 33.36% | 1E–5 |
| KOG annotation | 45348 | 10.89% | 1E–3 |
| GO annotation entries | 142163 | 34.16% | 1E–10 |
| KEGG pathways | 72433 | 17.40% | 1E–6 |
| Orthologous pairs in | 166486 | 40.01% | 1E–5 |
| Annotated in all databases | 25291 | 6.07% | – |
| Annotated in at least one database | 236222 | 56.77% | – |
| Total unigenes | 416062 | – | – |
Unigenes related to lipid metabolism, based on KEGG classification.
| KEGG Pathway | Gene Number |
| Biosynthesis of unsaturated fatty acids | 481 |
| Linoleic acid metabolism | 161 |
| Alpha-linolenic acid metabolism | 652 |
| Fatty acid biosynthesis | 404 |
| Fatty acid degradation | 612 |
| Fatty acid elongation | 174 |
| Glycerolipid metabolism | 437 |
| Glycerophospholipid metabolism | 665 |
| Sphingolipid metabolism | 284 |
| Steroid biosynthesis | 333 |
| Synthesis and degradation of ketone bodies | 58 |
| Arachidonic acid metabolism | 115 |
| Ether lipid metabolism | 214 |
| Cutin, suberine, and wax biosynthesis | 290 |
FIGURE 2Database architecture. The HpeNet database is divided into three main sections: network, annotations, and analytical tools. Lines with different colors indicate different pieces.
FIGURE 3Comprehensive explorations for the function of LHCA1 (Cluster-55448.207014 and Cluster-55448.124893). (A) and (B) Individual gene detail pages for Cluster-55448.207014 and Cluster-55448.124893, respectively. (C) Co-expression network for the LHCA1 genes (Cluster-55448.207014 and Cluster-55448.124893) in Paeonia lactiflora. The other light-harvesting complex genes of photosystem are highlighted with a dark green circle. (D) GSEA enrichment analyses of all genes from the co-expression network for LHCA1. (E) Heatmap of all genes from the co-expression network expressed in different samples.
FIGURE 4Expression view of FAD2 network in different tissues. Query gene and co-expressed genes highlighted with yellow and green circles, respectively. Gray circles represent genes without an expression profile in that tissue.
FIGURE 5Co-expression network analyses of Paeonia lactiflora fatty acid desaturase genes. (A) Co-expression network of FAD2 (Cluster-55448.309693) and FAD6 (Cluster-55448.32905 and Cluster-55448.309691). (B) GSEA enrichment analyses for all genes from (A). The results shown list the category, description, and FDR value.