| Literature DB >> 26580566 |
Fengxi Yang1, Genfa Zhu1.
Abstract
Cymbidium ensifolium belongs to the genus Cymbidium of the orchid family. Owing to its spectacular flower morphology, C. ensifolium has considerable ecological and cultural value. However, limited genetic data is available for this non-model plant, and the molecular mechanism underlying floral organ identity is still poorly understood. In this study, we characterize the floral transcriptome of C. ensifolium and present, for the first time, extensive sequence and transcript abundance data of individual floral organs. After sequencing, over 10 Gb clean sequence data were generated and assembled into 111,892 unigenes with an average length of 932.03 base pairs, including 1,227 clusters and 110,665 singletons. Assembled sequences were annotated with gene descriptions, gene ontology, clusters of orthologous group terms, the Kyoto Encyclopedia of Genes and Genomes, and the plant transcription factor database. From these annotations, 131 flowering-associated unigenes, 61 CONSTANS-LIKE (COL) unigenes and 90 floral homeotic genes were identified. In addition, four digital gene expression libraries were constructed for the sepal, petal, labellum and gynostemium, and 1,058 genes corresponding to individual floral organ development were identified. Among them, eight MADS-box genes were further investigated by full-length cDNA sequence analysis and expression validation, which revealed two APETALA1/AGL9-like MADS-box genes preferentially expressed in the sepal and petal, two AGAMOUS-like genes particularly restricted to the gynostemium, and four DEF-like genes distinctively expressed in different floral organs. The spatial expression of these genes varied distinctly in different floral mutant corresponding to different floral morphogenesis, which validated the specialized roles of them in floral patterning and further supported the effectiveness of our in silico analysis. This dataset generated in our study provides new insights into the molecular mechanisms underlying floral patterning of Cymbidium and supports a valuable resource for molecular breeding of the orchid plant.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26580566 PMCID: PMC4651537 DOI: 10.1371/journal.pone.0142434
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of sequencing and de novo assembling of the floral transcriptome in C. ensifolium.
| Summary of | |
|---|---|
| Raw reads | 121,626,996 |
| Clean reads | 109,366,378 |
| Total clean nucleotides | 10,924,593,174 bp |
| Total contigs | 136,553 |
| Total residues | 148,968,762 bp |
| Total unigenes | 111,892 |
| Smallest unigene | 201 bp |
| Largest unigene | 16814 bp |
| Average length | 932.03 bp |
Fig 1The length distribution of assembled unigenes.
The x-axis represents the sequence length in base pairs. The y-axis represents the unigenes number.
Summary of the blast hits against the known protein database.
| Num | Percent | |
|---|---|---|
| Total | 62,139 | 55.53% |
| Nr | 47,285 | 42.26% |
| ORF | 30,091 | 26.89% |
| COG | 43,018 | 38.45% |
| GO | 32,632 | 29.16% |
| KEGG | 21,461 | 19.18% |
Fig 2Characteristics of homology search of the unigenes in Nr database.
Fig 3COG function classification of assembled unigenes.
Fig 4GO classification of unigenes of assembled unigenes.
Summary of the KEGG pathway and their corresponding gene number.
| KEGG Pathways | Sub-pathways of KEGG Pathway | Number of unigenes | Number of reads |
|---|---|---|---|
|
| 1,094 | 1,693,012 | |
| Transport and catabolism | 595 | 1,187,266 | |
| Cell motility | 113 | 123,121 | |
| Cell growth and death | 414 | 417,722 | |
| Cell communication | 161 | 268,922 | |
|
| 2,688 | 4,615,076 | |
| Transcription | 436 | 618,315 | |
| Translation | 1,103 | 2,358,454 | |
| Folding, sorting and degradation | 845 | 1,804,603 | |
| Replication and repair | 484 | 203,113 | |
|
| 823 | 986,367 | |
| Membrane transport | 149 | 96,975 | |
| Signal transduction | 663 | 874,444 | |
| Signaling molecules and interaction | 13 | 14,951 | |
|
| 1,238 | 1,772,131 | |
| Immune system | 334 | 598,690 | |
| Endocrine system | 269 | 512,321 | |
| Circulatory system | 99 | 103,034 | |
| Digestive system | 245 | 154,968 | |
| Excretory system | 140 | 177,267 | |
| Nervous system | 283 | 429,307 | |
| Sensory system | 46 | 33,599 | |
| Development | 46 | 50,614 | |
| Environmental adaptation | 309 | 461,208 | |
|
| 3,776 | 7,328,381 | |
| Carbohydrate metabolism | 907 | 2,176,302 | |
| Energy metabolism | 840 | 2,277,509 | |
| Lipid metabolism | 574 | 1,700,740 | |
| Nucleotide metabolism | 459 | 354,067 | |
| Amino acid metabolism | 644 | 1,498,932 | |
| Metabolism of other amino acids | 250 | 682,150 | |
| Glycan biosynthesis and metabolism | 310 | 152,222 | |
| Metabolism of cofactors and vitamins | 512 | 473,178 | |
| Metabolism of terpenoids and polyketides | 272 | 419,236 | |
| Biosynthesis of other secondary metabolites | 22 | 566,666 | |
| Xenobiotics biodegradation and metabolism | 104 | 398,766 | |
| Total | 21,461 | 37,583,629 | |
Fig 5Predicted transcription factors of Cymbidium ensifolium.
Representatives of putative flowering-time genes in C.ensifolium.
| Gene ID | Homologous gene | Gene Nr-ID | Identidity Nr_top (%) | |
|---|---|---|---|---|
|
| Comp58987 | PHYA | XP_002278610.1 | 68.4 |
| Comp54211 | PHYB | XP_003558068.1 | 71.7 | |
| Comp5851 | XP_004295077.1 | 70.8 | ||
| Comp35281 | PIF3 | XP_002276198.2 | 38.6 | |
| Comp53073 | XP_002276198.2 | 37.4 | ||
| Comp52955 | EIF3 | ABL11477.1 | 38 | |
| Comp50981 | LHY | NP_001131529.2 | 34 | |
| Comp52658 | BAC99516.1 | 76.5 | ||
| Comp57367 | BAC99516.1 | 37.8 | ||
| Comp57910 | CCA1 | XP_004306608.1 | 80.6 | |
| Comp35965 | COP1 | BAA94422.1 | 60.2 | |
| Comp54784 | TOC1 | ADO51646.1 | 62.5 | |
| Comp48883 | GI | AEB35426.1 | 72.4 | |
| Comp56138 | ADP92454.1 | 80.3 | ||
| Comp27950 | CHS | AGJ50587.1 | 83 | |
| Comp48799 | NP_001064831 | 59.3 | ||
| Comp37213 | CO | NP_001047975.1 | 87.2 | |
| Comp51149 | NP_001057441.1 | 86.7 | ||
|
| Comp26556 | FT | AFS17371.1 | 54.5 |
| Comp39786 | ADP89905.1 | 78.3 | ||
| Comp41343 | ADW76861.1 | 98.5 | ||
| Comp44994 | AFU08240.1 | 73.8 | ||
| Comp48314 | ADP89470.1 | 63.8 | ||
| Comp301941 | BAD01612.1 | 86.3 | ||
| Comp36081 | AP2 | AGI62045.1 | 72.8 | |
| CL2711Contig1 | ABU68665.1 | 48.9 | ||
| CL2228Contig1 | AGI62044.1 | 72 | ||
| Comp43524 | AGI62041.1 | 82.1 | ||
| Comp51324 | AGI62047.1 | 80.2 | ||
| Comp26637 | AGK07583.1 | 68.6 | ||
| Comp47321 | AGK07583.1 | 84.8 | ||
| Comp55126 | NP_001236377.1 | 84 | ||
|
| Comp45071 | SOC1 | AFQ31623.1 | 66.4 |
| Comp47662 | AFQ31623.1 | 74.8 | ||
| Comp58027 | AP1 | AFQ31623.1 | 99.6 | |
| Comp51302 | LFY | AGE45851.1 | 97.8 | |
|
| Comp45077 | VRN1 | AEV22381.1 | 58.9 |
| Comp59230 | FLC | ACZ26524.1 | 26.1 | |
| CL346Contig1 | FLD | AAX51267 | 58.7 | |
| Comp58957 | NP_001148070.1 | 50 |
Fig 6Phylogenetic analysis of the CONSTANS-like proteins from different plant species.
Amino acid sequences were aligned by the ClustalW 2.0, and phylogenetic relationships were reconstructed using a maximum-likelihood (ML) method in PHYML software with JTT amino acid substitution model. Bootstrap values for 1,000 replicates were used to assess the robustness of the trees. Previously published plant MADS-box protein sequences were retrieved from GenBank database. (AtCO: NP_197088, AtCOL1: NP_197089, AtCOL2: NP_186887, AtCOL3: Q9SK53, AtCOL4: Q940T9.2, AtCOL5: Q9FHH8, AtCOL6: Q8LG76, AtCOL7: Q9C9A9, AtCOL8: Q9M9B3, AtCOL9: NP_001118599, AtCOL10: Q9LUA9, AtCOL11: O23379, AtCOL12: Q9LJ44, AtCOL13: O82256, AtCOL14: O22800, AtCOL15: Q9C7E8, AtCOL16: Q8RWD0, PhalCOL: FJ469986, CsCO: GU168786. EpCOL8: KC836891.1, EpCOL10: KC836893.1)
Fig 7Accumulation of Cymbidium ensifolium COL genes in LD conditions.
Leaves from one-year-old C. ensifolium plants were collected in 4-h intervals for 24 h after the start of light exposure in LD conditions (16-h light /8-h dark).
Summary of DGE sequencing quality and alignment.
| Sepal | Petal | Labellum | Gynostemium | |
|---|---|---|---|---|
|
| 31,063,650(100%) | 31,237,880(100%) | 29,464,502(100%) | 26,291,414(100%) |
|
| 28,314,897(91.15%) | 28,885,091(92.47%) | 26,626,616(90.35%) | 20,985,335(79.82%) |
|
| 3,882,139(12.50%) | 3,057,936(9.79%) | 3,595,240(12.20%) | 5,328,168(20.27%) |
|
| 24,432,758(78.65%) | 25,827,155(82.68%) | 23,031,376(78.15%) | 15,657,167(59.55%) |
|
| 14,166,443(45.60%) | 14,454,262(46.27%) | 13,321,458(45.20%) | 10,499,498(39.94%) |
|
| 14,148,454(45.55%) | 14,430,829(46.20%) | 13,305,158(45.15%) | 10,485,837(39.88%) |
|
| 14,170,934(45.62%) | 14,437,837(46.22%) | 13,337,405(45.26%) | 10,524,193(40.03%) |
|
| 14,143,963(45.53%) | 14,447,254(46.25%) | 13,289,211(45.09%) | 10,461,142(39.79%) |
|
| 12,403,029(39.93%) | 15,231,879(48.76%) | 11,439,220(38.82%) | 2,605,567(9.91%) |
Fig 8Transcripts differentially expressed between different floral organs.
Up- and down-regulated transcripts were quantified. The results of six comparisons between each two samples are shown. 1, sepal; 2, petal; 3, Labellum; 4, gynostemium.
Tissue-specific genes revealed from DGE expression of various flower organs.
|
| |
|
|
|
|
| |
| Comp49718 | NADP-dependent oxidoreductase P2 [ |
|
| O-acyltransferase WSD1-like [ |
|
| endo-beta-1,4-glucanase [ |
|
| O-methyltransferase [ |
|
| Putative phospholipase a2 precursor [ |
|
| |
|
| myosin light chain 2 [ |
|
| Alpha-expansin 8 precursor [ |
|
| |
|
|
|
|
| |
|
| transketolase, chloroplastic-like [ |
|
| 1-D-desoxyxylulose 5-phosphate synthase (DXS) [ |
|
| transketolase [ |
|
| transketolase isoform 1 [ |
|
| sphinganine C(4)-monooxygenase 2-like isoform 1 [ |
|
| |
|
|
|
|
| |
|
| plasma membrane associated protein [ |
|
| leucine-rich repeat transmembrane protein kinase [ |
|
| putative apolipoprotein A-I precursor [ |
|
| 40S ribosomal protein S10 [ |
|
| Cell wall-associated hydrolase [ |
|
| |
|
| 3-hydroxyacyl-CoA dehydrogenase [ |
|
| transketolase [ |
|
| alcohol dehydrogenase [ |
|
| |
|
| NBS-LRR disease resistance protein precursor [ |
|
| ADP-ribosylation factor, arf [ |
|
| |
|
|
|
|
| |
|
| cell wall-associated hydrolase [ |
|
| cell wall-associated hydrolase [ |
|
| cell wall-associated hydrolase [ |
|
| cellulose synthase-like protein D4-like [ |
|
| expansin-A4-like [ |
|
| YABBY domain transcription factor family protein [ |
|
| Protein HIS-71 [ |
|
| UPF0497 membrane protein [ |
|
| major protein body membrane protein MP27 like [ |
|
| ABA-induced plasma membrane protein protein [ |
|
| putative MATE efflux membrane protein [ |
|
| 60S ribosomal protein L30-like isoform 1 [ |
|
| |
|
| polygalacturonase precursor-like [ |
|
| serine acetyltransferase [ |
|
| sesquiterpene synthase [ |
|
| Histone H3.3 [ |
|
| trytophan synthase alpha subunit, putative [ |
|
| cytochrome P450 A [ |
|
| metallothionein-like protein [ |
|
| heavy metal ATPase [ |
|
| ATP carrier protein 1-like [ |
|
| |
|
| stearoyl-ACP desaturase-like protein [ |
|
| apolipoprotein A-I [ |
|
| myo-inositol oxygenase [ |
|
| L-ascorbate oxidase-like protein [ |
|
| seq polyphenol oxidase [ |
|
| ACC oxidase [ |
|
| alcohol dehydrogenase, putative [ |
|
| AMP dependent CoA ligase, putative [ |
|
| cytokinin hydroxylase-like [ |
|
| |
|
| MADS-box transcription factor [ |
|
| MADS-box transcription factor [ |
|
| MADS-box transcription factor [ |
|
| MADS-box transcription factor [ |
|
| R2R3-MYB transcription factor [ |
|
| probable WRKY transcription factor [ |
|
| heat shock 70 kDa protein cognate [ |
|
| |
|
| phytochrome C [ |
|
| ras-related protein RABA1f-like [ |
|
| NBS-LRR disease resistance protein precursor [ |
|
| CBL-interacting protein kinase [ |
|
| ralf-like 22 protein [ |
Fig 9The quantitative RT-PCR analysis of gene expression in sepals, petals, labellums and gynostemiums.
The y-axis indicates fold change in expression among the samples. The Lg (Relative Quantitation) of the genes in the sepals was calibrated as zero. Error bars indicate the standard deviation of the mean (SD) (n = 3). Three replicates were analyzed, with similar results. a, b, and c, d, one way ANOVA with Bonferroni multiple comparison test significant at P<0.05 between two of the individual floral organs sepal, petal, Labellum, gynostemium.
Fig 10Transcription factors differentially expressed among different floral organs.
Up- and down-regulated transcripts were quantified. The results of six comparisons between each two samples are shown. 1, sepal; 2, petal; 3, Labellum; 4, gynostemium.
Fig 11Transcription factors correlated with the development of the gynostemium.
Up-(red) and down-regulated (green) in the gynostemium as opposed to other DGEs were quantified and annotated with the plant transcription factor database.
Fig 12Phylogenetic analysis of the MADS-box genes differentially expressed among different floral organs.
Amino acid sequences were aligned by the ClustalW 2.0, and phylogenetic relationships were reconstructed using a maximum-likelihood (ML) method in PHYML software with JTT amino acid substitution model. Previously published plant MADS-box protein sequences were retrieved from GenBank database (PeMADS2: AAR26628, CeAP3-like1: AFH66788, CeAP3-like2: AFH66787, CgAP3:ADI58460, CeMADS2: ADP00516, CeMADS1: ADP00515, CePI-like: AFH66786, OMADS11: ADJ6724, OMADS6: ADJ67238, OMADS1: ADJ67237, OMADS10: ADJ67240, OMADS2: AIJ29175, OMADS3: AAO45824, OMADS4: AIJ29176, DthyrAG1: AAY86364, DthyrAG2: AAY86365, PhalAG1: BAE80120, PhalAG2: BAE80121, PeMADS2: AY378149, PeMADS3: AY378150, PeMADS4: AY378151, PeMADS5: AY378148).
Fig 13Sequence comparison of the DEF-like genes of C. ensifolium and P. equestris.
Amino acid sequences were aligned by the ClustalW 2.0. The MADS-, I-, K-, and C-domains are indicated on top of the column.
Fig 14Flower morphology and expression levels of the MADS-box genes in Cymbidium ensifolium ‘tianesu’ (WT, A&B), multi-tepal mutant (Mu1, C&D) and non-gynostemium mutant (Mu2, E&F) Se, sepal primordium; Pe, petal primordium; La, Labellum; Gy, gynostemium; Po, pollinia.
The y-axis indicates fold change in expression among the samples. The ubiquitin gene served as the internal control. Error bars indicate the standard deviation of the mean (SD) (n = 3). Three replicates were analyzed, with similar results. a, b, c, and d, one way ANOVA with Bonferroni multiple comparison test significant at P<0.05 between two of the individual floral organs sepal, petal, Labellum, gynostemium.