| Literature DB >> 30736327 |
Xianyan Zou1,2, Aiying Liu3, Zhen Zhang4, Qun Ge5, Senmiao Fan6, Wankui Gong7, Junwen Li8, Juwu Gong9, Yuzhen Shi10, Baoming Tian11, Yanling Wang12, Ruixian Liu13, Kang Lei14, Qi Zhang15, Xiao Jiang16, Yulong Feng17,18, Shuya Zhang19, Tingting Jia20, Lipeng Zhang21, Youlu Yuan22,23, Haihong Shang24,25.
Abstract
Upland cotton (Gossypium hirsutum) is grown for its elite fiber. Understanding differential gene expression patterns during fiber development will help to identify genes associated with fiber quality. In this study, we used two recombinant inbred lines (RILs) differing in fiber quality derived from an intra-hirsutum population to explore expression profiling differences and identify genes associated with high-quality fiber or specific fiber-development stages using RNA sequencing. Overall, 72/27, 1137/1584, 437/393, 1019/184, and 2555/1479 differentially expressed genes were up-/down-regulated in an elite fiber line (L1) relative to a poor-quality fiber line (L2) at 10, 15, 20, 25, and 30 days post-anthesis, respectively. Three-hundred sixty-three differentially expressed genes (DEGs) between two lines were colocalized in fiber strength (FS) quantitative trait loci (QTL). Short Time-series Expression Miner (STEM) analysis discriminated seven expression profiles; gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotation were performed to identify difference in function between genes unique to L1 and L2. Co-expression network analysis detected five modules highly associated with specific fiber-development stages, especially for high-quality fiber tissues. The hub genes in each module were identified by weighted gene co-expression network analysis. Hub genes encoding actin 1, Rho GTPase-activating protein with PAK-box, TPX2 protein, bHLH transcription factor, and leucine-rich repeat receptor-like protein kinase were identified. Correlation networks revealed considerable interaction among the hub genes, transcription factors, and other genes.Entities:
Keywords: DEGs; Gossypium hirsutum; WGCNA; fiber development; transcriptomic analysis
Mesh:
Substances:
Year: 2019 PMID: 30736327 PMCID: PMC6410125 DOI: 10.3390/genes10020119
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Statistics for transcript levels at each development stage, the numbers of genes and classification were divided by 0.5 < FPKM < 5, 5 < FPKM < 100, and 100 < FPKM in each sample. FPKM: fragments per kilobase of exon per million reads.
Figure 2Vertical of L1 relative to L2 and horizontal comparisons of genes that showed differential expression levels between the two lines at the same developmental stage and between different stages of an individual line. Red numbers represent upregulated genes; black numbers indicate down-regulated genes. The differentially expressed genes were identified by the criteria FPKM > 0.5, |log2(FC)| > 1.
Fragments per kilobase of exon per million reads (FPKMs) and functional categories of genes significantly up-regulated in L1 on 10DPA.
| Gene Name | L1D10 | L2D10 | Description in |
|---|---|---|---|
|
| |||
| Gh_A11G0692 | 6.49 | 0.9 | homeobox-leucine zipper protein 4 |
| Gh_D07G0362 | 8.39 | 1.6 | Integrase-type DNA-binding superfamily protein |
| Gh_D10G0270 | 49.31 | 11.65 | homeobox-3 |
|
| |||
| Gh_A05G2143 | 11.59 | 2 | arginase |
| Gh_A11G2354 | 6.86 | 1.47 | glutamate dehydrogenase 1 |
|
| |||
| Gh_D05G0184 | 19.25 | 2.36 | Cytochrome P450 superfamily protein |
| Gh_A05G0122 | 41.06 | 11.02 | Cytochrome P450 superfamily protein |
|
| |||
| Gh_A03G0507 | 19.78 | 1.02 | Glycosyl hydrolases family 32 protein |
| Gh_A05G3473 | 66.67 | 5.21 | 10-formyltetrahydrofolate synthetase |
| Gh_A05G3997 | 54.47 | 5.12 | 4-coumarate:CoA ligase 2 |
| Gh_A03G1938 | 10.11 | 1.62 | N-acetylglucosamine-1-phosphate uridylyltransferase 2 |
| Gh_A05G0479 | 5.74 | 1.09 | aldehyde dehydrogenase 11A3 |
| Gh_D07G0692 | 13.39 | 2.57 | UDP-glucose 6-dehydrogenase family protein |
| Gh_D02G0761 | 9.2 | 1.95 | Thiamin diphosphate-binding fold (THDP-binding) protein |
| Gh_A10G2327 | 7.37 | 1.75 | ACT domain-containing small subunit of acetolactate synthase protein |
| Gh_A09G2449 | 20.78 | 5.87 | NAD(P)-binding Rossmann-fold superfamily protein |
|
| |||
| Gh_A05G0238 | 6.55 | 0.21 | homolog of CFIM-25 |
| Gh_A03G1654 | 133.17 | 39.45 | poly(A) binding protein 2 |
| Gh_D02G2070 | 108.02 | 33.65 | poly(A) binding protein 2 |
|
| |||
| Gh_D11G1989 | 15.5 | 5.38 | Auxin-responsive GH3 family protein |
| Gh_D09G1585 | 14.32 | 1.17 | PYR1-like 4 |
| Gh_A09G2421 | 11.06 | 1.06 | PYR1-like 4 |
FPKMs and functional categories of genes significantly down-regulated in L1 on 10DPA.
| Gene Name | L1D10 | L2D10 | Description in |
|---|---|---|---|
|
| |||
| Gh_A13G0258 | 1.59 | 12.58 | aspartate aminotransferase |
| Gh_A13G0603 | 1.36 | 7.68 | 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase |
|
| |||
| Gh_A07G0619 | 0.42 | 9.8 | 3-phosphoserine phosphatase |
|
| |||
| Gh_D01G1478 | 1.61 | 6.54 | quinolinate phoshoribosyltransferase |
| Gh_A06G0128 | 5.31 | 48.44 | GA requiring 3 |
|
| |||
| Gh_D13G1883 | 11.97 | 96.74 | calreticulin 1a |
|
| |||
| Gh_A05G0528 | 2.73 | 11.38 | WRKY DNA-binding protein 11 |
| Gh_A13G1728 | 9.91 | 46.36 | Zinc finger C-x8-C-x5-C-x3-H type family protein |
FPKMs and functional categories of genes significantly upregulated in L1 on 20DPA.
| Genes id | L1D20 | L2D20 | Description |
|---|---|---|---|
|
| |||
| Gh_Sca007938G01 | 10.65 | 1.5 | beta-ketoacyl reductase 1 |
| Gh_D01G0186 | 139.55 | 29.59 | acyl-CoA oxidase 3 |
| Gh_A13G2171 | 112.41 | 36.72 | acyl-CoA oxidase 4 |
|
| |||
| Gh_D12G0436 | 13.79 | 1.9 | short-chain dehydrogenase-reductase B |
| Gh_A12G1414 | 56.97 | 16.99 | glutamate decarboxylase 4 |
| Gh_A07G0810 | 110.4 | 46.02 | glyoxylate reductase 1 |
|
| |||
| Gh_A07G0991 | 5.36 | 0.72 | Jojoba acyl CoA reductase-related male sterility protein |
| Gh_D05G1294 | 67.39 | 11.9 | Glucose-methanol-choline (GMC) oxidoreductase family protein |
| Gh_D01G1400 | 172.1 | 49 | Caleosin-related family protein |
|
| |||
| Gh_A01G1563 | 150.26 | 29.07 | 3-ketoacyl-CoA synthase 6 |
| Gh_D01G1810 | 213.82 | 54.5 | 3-ketoacyl-CoA synthase 6 |
| Gh_A09G0749 | 19.87 | 7.06 | 3-ketoacyl-acyl carrier protein synthase I |
| Gh_Sca006141G01 | 23.73 | 8.81 | acetyl-CoA carboxylase carboxyl transferase subunit β |
|
| |||
| Gh_A08G2381 | 635.59 | 71.34 | β-6 tubulin |
| Gh_A11G2095 | 1198.32 | 173.46 | tubulin α-3 |
| Gh_Sca012883G01 | 2274.54 | 359.05 | tubulin β 8 |
| Gh_D08G1960 | 460.89 | 75.32 | β-6 tubulin |
| Gh_D05G1052 | 572.74 | 176.55 | tubulin β 8 |
| Gh_A02G0819 | 357.99 | 136.45 | tubulin α-2 chain |
| Gh_D13G1047 | 259.25 | 106.17 | tubulin α-3 |
|
| |||
| Gh_D12G0607 | 16.71 | 4.38 | basic helix-loop-helix (bHLH) DNA-binding superfamily protein |
| Gh_A07G1171 | 82.82 | 17.27 | basic region/leucine zipper motif 53 |
| Gh_D05G0463 | 37.9 | 13.35 | CCCH-type zinc finger family protein |
| Gh_A12G0561 | 40.16 | 15.17 | B-box zinc finger family protein |
| Gh_D05G2596 | 267.05 | 106.76 | K-box region and MADS-box transcription factor family protein |
| Gh_A12G1244 | 21.19 | 4.2 | MYB-like 102 |
| Gh_D08G0157 | 10.98 | 2.96 | AP2/B3 transcription factor family protein |
| Gh_D02G0043 | 40.69 | 11.26 | WRKY family transcription factor family protein |
FPKMs and functional categories of genes significantly downregulated in L1 on 20DPA.
| Genes Name | L1D20 | L2D20 | Description in |
|---|---|---|---|
|
| |||
| Gh_D06G0096 | 0.12 | 12.6 | cytochrome P450, family 82, subfamily C, polypeptide 4 |
| Gh_D04G0605 | 0.38 | 9.89 | glycerol-3-phosphate acyltransferase 5 |
| Gh_A13G0603 | 1.47 | 10.65 | 3-deoxy-d-arabino-heptulosonate |
| Gh_D05G2554 | 4.26 | 22.71 | acetyl-CoA carboxylase 1 |
|
| |||
| Gh_Sca069862G01 | 18.38 | 189.16 | glutamine synthase clone R1 |
| Gh_D07G1773 | 54.01 | 333.88 | glutamine synthase clone R1 |
| Gh_A01G1586 | 1.4 | 7.56 | nitrate reductase 2 |
| Gh_D01G1872 | 7.83 | 26.87 | nitrate reductase 2 |
|
| |||
| Gh_D10G0473 | 11.3 | 63.26 | 4-coumarate:CoA ligase 1 |
| Gh_D08G1135 | 15.62 | 62.86 | O-methyltransferase 1 |
| Gh_A08G0711 | 32.62 | 105.85 | Peroxidase superfamily protein |
| Gh_A04G1032 | 29.29 | 80.06 | S-adenosyl-L-methionine-dependent methyltransferases |
|
| |||
| Gh_D05G1813 | 9.23 | 43.01 | cytokinin oxidase 5 |
| Gh_A05G0290 | 6.64 | 25.84 | cytokinin oxidase 7 |
| Gh_A05G1631 | 7.6 | 26.27 | cytokinin oxidase 5 |
| Gh_D05G0391 | 12.27 | 39.85 | cytokinin oxidase 7 |
|
| |||
| Gh_A02G1575 | 9.12 | 28.68 | GRAS family transcription factor |
| Gh_D01G0974 | 102.38 | 371.7 | basic leucine-zipper 44 |
| Gh_A11G2875 | 3.42 | 15.35 | myb-like transcription factor family protein |
| Gh_A11G2522 | 3.52 | 19.11 | myb domain protein 120 |
| Gh_A05G3778 | 2.5 | 24.64 | LOB domain-containing protein 13 |
| Gh_D07G1330 | 0.53 | 5.69 | NAC domain transcriptional regulator protein |
| Gh_D06G0254 | 0.52 | 6.37 | GATA-type zinc finger protein with TIFY domain |
| Gh_A10G0516 | 2.06 | 26.94 | myb domain protein 26 |
| Gh_D08G1424 | 0.11 | 6.98 | WRKY DNA-binding protein 3 |
Figure 3(a) Different gene expression profiles in the two recombinant inbred lines (RILs). Each square represents a profile of gene expression trend. The profile ID and gene number in the corresponding profile are shown in the top and bottom left corners, respectively. (b) and (c) Venn diagram showed the different genes set between L1 and L2 in profile 4 and profile 22, respectively. (d) and (e) Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways of common genes between L1 and L2 in expression profile 4 and 22. The size of the ball represents the genes number enriched in the pathway; the depth of the color represents the size of the p-value.
Figure 4Gene ontology (GO) enrichment of genes listed in expression profile 4. (a) and (b) show the top 30 terms of GO enrichment for 1112 and 1060 genes unique to L1 and L2, respectively. The main different terms are highlighted in red.
Figure 5GO enrichment of genes listed in expression profile 22. (a) and (b) show the top 30 terms of GO enrichment for 1128 and 748 genes unique to L1 and L2, respectively. The main different terms names are highlighted in red.
Figure 6Weighted gene co-expression network analysis (WGCNA) of differentially expressed genes (DEGs) in L1 and L2 at five time points of fiber development. (a) Hierarchical dendrogram showing co-expression modules identified by WGCNA. Each leaf in the tree represents one gene. The major tree was divided into 13 modules based on calculation of eigengenes; each module is highlighted in a different color. (b) Module–sample relationships. Each row represents a module, and the correlation coefficient and the e-value are shown in each square. Each column corresponds to a sample tissue and replication (R). The number of genes and transcription factors enriched in the module are shown in the left module squares and parentheses, respectively. The modules names in red were highly associated with the high-quality fiber line (L1).
Figure 7Heatmap comparison DEGs associated with fiber developmental stages. (a), (b), (c), and (d) show the DEGs associated to the overrepresented functional categories and transcription factors at 10, 15, 20, and 30 days post-anthesis (DPA), respectively.
Candidate hub genes in 10, 15, 20, and 30 DPA modules.
| Gene Name | KME | Function Description in | |
|---|---|---|---|
| 10 DPA specific darkred module | |||
| Gh_D12G2172 | 0.991 | AT1G12240 | Glycosyl hydrolases family 32 protein |
| Gh_A10G1961 | 0.991 | AT2G37620 | actin 1 |
| Gh_A11G1087 | 0.997 | AT4G03100 | Rho GTPase activating protein with PAK-box [ |
| Gh_D06G0618 | 0.991 | AT4G28950 | RHO-related protein from plants 9 |
| Gh_D05G0591 | 0.992 | AT4G31890 | ARM repeat superfamily protein |
| 15 DPA of L1 specific darkturquoise module | |||
| Gh_A02G1692 | 0.968 | AT3G51430 | Calcium-dependent phosphotriesterase superfamily protein |
| Gh_A07G0225 | 0.983 | AT4G32330 | TPX2 (targeting protein for Xklp2) protein family [ |
| Gh_A12G0226 | 0.971 | AT2G15780 | Cupredoxin superfamily protein |
| Gh_D03G1054 | 0.973 | AT1G01630 | Sec14p-like phosphatidylinositol transfer family protein |
| Gh_D12G0026 | 0.964 | AT3G51895 | sulfate transporter 3;1 |
| 20 DPA of L1 specific violet module | |||
| Gh_A03G0347 | 0.967 | AT2G28790 | Pathogenesis-related thaumatin superfamily protein |
| Gh_A05G1879 | 0.981 | AT4G33580 | beta carbonic anhydrase 5 |
| Gh_A13G1012 | 0.965 | AT4G09510 | cytosolic invertase 2 |
| Gh_D06G1151 | 0.968 | AT4G16130 | arabinose kinase |
| Gh_D10G0295 | 0.974 | AT1G68810 | basic helix-loop-helix (bHLH) DNA-binding superfamily protein 30 [ |
| Gh_D02G0740 | 0.97 | AT1G21460 | Nodulin MtN3 family protein |
| Gh_D07G0144 | 0.969 | AT5G20860 | Plant invertase/pectin methylesterase inhibitor superfamily [ |
| Gh_D08G2725 | 0.973 | AT3G62690 | AtL5 |
| 20&30 DPA of L1 specific pink module | |||
| Gh_A08G1486 | 0.95 | AT5G47120 | BAX inhibitor 1 |
| Gh_A08G2508 | 0.944 | AT4G27000 | RNA-binding (RRM/RBD/RNP motifs) family protein |
| Gh_A11G2726 | 0.961 | AT3G13540 | myb domain protein 5 [ |
| Gh_D03G1843 | 0.958 | AT1G32050 | SCAMP family protein |
| Gh_D09G0347 | 0.962 | AT5G14240 | Thioredoxin superfamily protein |
| 30 DPA of L1 specific darkgrey module | |||
| Gh_A09G1137 | 0.981 | AT5G04160 | UDP-URONIC acid transporter 1 [ |
| Gh_A10G1754 | 0.989 | AT5G06390 | FASCICLIN-like arabinogalactan protein 17 precursor |
| Gh_D01G1697 | 0.982 | AT1G20550 | O-fucosyltransferase family protein |
| Gh_D08G2152 | 0.985 | AT2G33170 | Leucine-rich repeat receptor-like protein kinase family protein [ |
| Gh_D10G1056 | 0.99 | AT5G64030 | S-adenosyl-L-methionine-dependent methyltransferases superfamily protein [ |
Figure 8Correlation networks for the dark red and violet modules. Hub genes are indicated by red circles; green circles indicate transcription factors; yellow circles indicate both a hub gene and a transcription factor.