| Literature DB >> 31817342 |
Jie Zheng1, Xiangjun Kong1, Bin Li1, Aziz Khan1, Zhiling Li1, Yiding Liu1, Haodong Kang1, Farman Ullah Dawar2, Ruiyang Zhou1.
Abstract
Cytoplasmic male sterility (CMS) is an important agronomic feature and provides an effective tool for heterosis utilization of crops. This study reports the comparative transcriptomic sketches between a novel allohexaploid cotton progeny CMS line LD6A and its maintainer line LD6B using de novo transcriptome sequencing technology at the pollen abortion stage. A total of 128,901 Unigenes were identified, in which 2007 were upregulated and 11,864 were downregulated. The significantly differentially expressed genes (DEGs) in LD6A show a distant and diverse genetic nature due to their distant hybrid hexaploidy progeny. Further analysis revealed that most of the DEGs participated in the tricarboxylic acid (TCA) cycle, oxidative phosphorylation, histone acetyltransferase activity, sepal development, stigma development, cotyledon development and microsporogenesis. A highly differentially expressed toxic protein, Abrin, was identified in the CMS line LD6A, which can catalyze the inactivation of ribosomes and consequently lead to cell death through the mitochondrial pathway in human cells. Twelve DEGs were selected randomly to validate transcriptome data using quantitative reverse-transcribed PCR (qRT-PCR). This study will contribute to new ideas and foundations related to the molecular mechanism of CMS and the innovation of cotton germplasm resources.Entities:
Keywords: allohexaploid; cotton; cytoplasmic male sterility; differentially expressed genes; transcriptome
Mesh:
Substances:
Year: 2019 PMID: 31817342 PMCID: PMC6940886 DOI: 10.3390/ijms20246127
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Overview of the assembled cotton transcriptome. Mb = megabytes.
| Type | LD6B | LD6A | |
|---|---|---|---|
| Total raw reads | 73.88 Mb | 73.88 Mb | |
| Total trinity transcripts | 709,971 | ||
| N50 (transcripts) | 1541 | ||
| Total assembled bases (transcripts) | 680,840,189 | ||
| Total trinity genes | 128,901 | ||
| N50 (genes) | 2036 | ||
| Average length (genes) | 1434 | ||
| Total assembled bases (genes) | 184,861,957 | ||
| GC (%) | 40.66 |
Figure 1Length distribution of the assembled cotton genes and CDS. The horizontal axis shows the size of the Unigenes/CDS and the vertical axis shows the number of Unigenes/CDS.
Figure 2Venn diagram of genes identified to different databases.
Figure 3Gene ontology classification of the assembled cotton genes.
Figure 4Distribution of species aligned by the assembled cotton.
Figure 5Number of transcripts and Unigenes annotated in different samples.
Figure 6Comparison of gene expression levels between LD6A and LD6B according to the DEseq2 method. DEGs = differentially expressed genes.
Figure 7(a) Indicates highly expressed genes exclusively identified in LD6A. (b) Top 10 highly expressed genes identified exclusively in LD6B samples. FPKM = fragments per kilobase of transcript per million fragments mapped.
GO enrichment analysis of DEGs in the LD6A vs. LD6B library.
| Type | ID | Term | Number | Rich Ratio | |
|---|---|---|---|---|---|
| Biological process | GO:0045490 | pectin catabolic process | 89 | 0.028507367 | 1.52 × 10−39 |
| GO:0042545 | cell wall modification | 61 | 0.019538757 | 4.79 × 10−26 | |
| GO:0005975 | carbohydrate metabolic process | 198 | 0.063420884 | 3.23 × 10−16 | |
| GO:0071555 | cell wall organization | 108 | 0.034593209 | 3.49 × 10−10 | |
| GO:0016042 | lipid catabolic process | 57 | 0.018257527 | 3.77 × 10−9 | |
| GO:0051017 | actin filament bundle assembly | 13 | 0.004163997 | 1.60 × 10−8 | |
| Cellular component | GO:0016021 | integral component of membrane | 2742 | 0.637377964 | 2.30 × 10−29 |
| GO:0005576 | extracellular region | 144 | 0.033472803 | 3.17 × 10−23 | |
| GO:0005618 | cell wall | 92 | 0.021385402 | 5.67 × 10−10 | |
| Molecular function | GO:0004857 | enzyme inhibitor activity | 69 | 0.011551984 | 1.53 × 10−28 |
| GO:0030599 | pectinesterase activity | 61 | 0.010212623 | 5.75 × 10−26 | |
| GO:0045330 | aspartyl esterase activity | 61 | 0.010212623 | 5.75 × 10−26 | |
| GO:0004650 | polygalacturonase activity | 54 | 0.009040683 | 1.65 × 10−20 | |
| GO:0030570 | pectate lyase activity | 32 | 0.005357442 | 6.03 × 10−17 | |
| GO:0005096 | GTPase activator activity | 59 | 0.009877783 | 2.81 × 10−14 | |
| GO:0003779 | actin binding | 63 | 0.010547464 | 5.97 × 10−11 | |
| GO:0045735 | nutrient reservoir activity | 21 | 0.003515821 | 7.83 × 10−11 | |
| GO:0004575 | sucrose alpha-glucosidase activity | 14 | 0.002343881 | 1.11 × 10−9 | |
| GO:0015299 | solute: proton antiporter activity | 34 | 0.005692282 | 2.59 × 10−9 | |
| GO:0020037 | heme binding | 132 | 0.022099448 | 7.05 × 10−8 |
Statistical enrichment analysis of Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic pathways (p ≤ 0.05, Q ≤ 0.05).
| Pathway | Pathway ID | Genes with Pathway Annotation | Q Value | ||
|---|---|---|---|---|---|
| DEGs (2934) | All Genes (19,296) | ||||
| Biosynthesis of other secondary metabolites | |||||
| Flavone and flavonol biosynthesis | ko00944 | 8 | 28 | 5.749022 × 10−3 | 2.874511 × 10−2 |
| Stilbenoid, diarylheptanoid and gingerol biosynthesis | ko00945 | 27 | 150 | 2.693941 × 10−3 | 1.454728 × 10−2 |
| Phenylpropanoid biosynthesis | ko00940 | 209 | 1613 | 2.371287 × 10−4 | 1.684862 × 10−3 |
| Carbohydrate metabolism | |||||
| Glycolysis/gluconeogenesis | ko00010 | 119 | 902 | 2.607198 × 10−3 | 1.454728 × 10−2 |
| Inositol phosphate metabolism | ko00562 | 98 | 709 | 1.457824 × 10−3 | 8.945738 × 10−3 |
| Starch and sucrose metabolism | ko00500 | 262 | 2022 | 4.067709 × 10−5 | 3.922434 × 10−4 |
| Galactose metabolism | ko00052 | 134 | 896 | 5.676684 × 10−6 | 8.515026 × 10−5 |
| Amino sugar and nucleotide sugar metabolism | ko00520 | 230 | 1532 | 2.096337 × 10−9 | 7.075137 × 10−8 |
| Pentose and glucuronate interconversions | ko00040 | 224 | 993 | 2.589022 × 10−30 | 3.495180 × 10−28 |
| Digestive system | |||||
| Cholesterol metabolism | ko04979 | 26 | 122 | 2.363962 × 10−4 | 1.684862 × 10−3 |
| Energy metabolism | |||||
| Carbon fixation in photosynthetic organisms | ko00710 | 71 | 485 | 1.431960 × 10−3 | 8.945738 × 10−3 |
| Oxidative phosphorylation | ko00190 | 131 | 930 | 1.187092 × 10−4 | 1.001609 × 10−3 |
| Photosynthesis | ko00195 | 39 | 189 | 1.710027 × 10−5 | 2.098670 × 10−4 |
| Lipid metabolism | |||||
| Glycerophospholipid metabolism | ko00564 | 134 | 992 | 5.903167 × 10−4 | 3.984638 × 10−3 |
| Linoleic acid metabolism | ko00591 | 30 | 143 | 1.108358 × 10−4 | 9.975222 × 10−4 |
| Ether lipid metabolism | ko00565 | 61 | 351 | 3.005129 × 10−5 | 3.120711 × 10−4 |
| Arachidonic acid metabolism | ko00590 | 37 | 178 | 2.341729 × 10−5 | 2.634445 × 10−4 |
| Glycerolipid metabolism | ko00561 | 122 | 814 | 1.313738 × 10−5 | 1.773546 × 10−4 |
| Steroid biosynthesis | ko00100 | 51 | 216 | 1.062244 × 10−8 | 2.390049 × 10−7 |
| Cutin, suberine and wax biosynthesis | ko00073 | 59 | 260 | 3.884301 × 10−9 | 1.048761 × 10−7 |
| Metabolism of other amino acids | |||||
| Cyan amino acid metabolism | ko00460 | 136 | 986 | 2.254576 × 10−4 | 1.684862 × 10−3 |
| Diterpenoid biosynthesis | ko00904 | 36 | 236 | 1.022848 × 10−2 | 4.931589 × 10−2 |
| Monoterpenoid biosynthesis | ko00902 | 24 | 135 | 5.264254 × 10−3 | 2.733363 × 10−2 |
| Signal transduction | |||||
| Phosphatidylinositol signaling system | ko04070 | 104 | 770 | 2.261160 × 10−3 | 1.327203 × 10−2 |
| Transcription | |||||
| RNA polymerase | ko03020 | 166 | 795 | 3.529257 × 10−19 | 1.588166 × 10−17 |
| Transport and catabolism | |||||
| Endocytosis | ko04144 | 285 | 2152 | 3.517584 × 10−6 | 5.935923 × 10−5 |
| Phagosome | ko04145 | 111 | 697 | 2.018673 × 10−6 | 3.893155 × 10−5 |
qRT-PCR confirmation of the expression profiles of selected genes.
| Gene ID | Protein Identity | Fold Change | |
|---|---|---|---|
| RNA-Seq | qRT-PCR | ||
| LOC107915747 | cytochrome P450 83B1-like | 5.16 | 4.40 |
| LOC107892026 | zinc finger protein CONSTANS-LIKE 16-like | 6.06 | 5.83 |
| LOC107903815 | methyltransferase-like protein | 5.21 | 1.5 |
| LOC107893683 | mitochondrial uncoupling protein 3-like | 5.56 | 3.13 |
| LOC107911279 | transcription factor MYB114-like | 5.04 | 5.03 |
| LOC107926337 | probable calcium-binding protein CML49 | 6.69 | 0.19 |
| LOC107905948 | probable calcium-binding protein CML13 | −10.76 | −2.74 |
| LOC107941623 | pollen allergen Che a 1-like | −12.94 | −26.59 |
| LOC107942901 | pollen-specific protein-like At4g18596 | −11.48 | −8.12 |
| LOC107915309 | plasma membrane ATPase 4-like | −10.61 | −1.30 |
| LOC107908343 | V-type proton ATPase subunit G1-like (ATP6V1G1) | −10.45 | −4.71 |
| LOC107903454 | cytochrome P450 76A2-like | −10.59 | −4.71 |
Figure 8The heat map of key candidate genes associated with CMS.
Figure 9The heatmap of key transcription factor family genes related to CMS.
Figure 10Diagrammatic view of the allohexaploid and backcrossed (BC) generations development procedure. CMS = cytoplasmic male sterility.