| Literature DB >> 29848989 |
Richard Odongo Magwanga1,2, Pu Lu3, Joy Nyangasi Kirungu4, Latyr Diouf5, Qi Dong6, Yangguang Hu7, Xiaoyan Cai8, Yanchao Xu9, Yuqing Hou10, Zhongli Zhou11, Xingxing Wang12, Kunbo Wang13, Fang Liu14.
Abstract
Cotton production is on the decline due to ever-changing environmental conditions. Drought and salinity stress contribute to over 30% of total loss in cotton production, the situation has worsened more due to the narrow genetic base of the cultivated upland cotton. The genetic diversity of upland cotton has been eroded over the years due to intense selection and inbreeding. To break the bottleneck, the wild cotton progenitors offer unique traits which can be introgressed into the cultivated cotton, thereby improving their performance. In this research, we developed a BC₂F₂ population between wild male parent, G. tomentosum as the donor, known for its high tolerance to drought and the elite female parent, G. hirsutum as the recurrent parent, which is high yielding but sensitive to drought stress. The population was genotyped through the genotyping by sequencing (GBS) method, in which 10,888 single-nucleotide polymorphism (SNP) s were generated and used to construct a genetic map. The map spanned 4191.3 cM, with average marker distance of 0.3849 cM. The map size of the two sub genomes had a narrow range, 2149 cM and 2042.3 cM for At and Dt_sub genomes respectively. A total of 66,434 genes were mined, with 32,032 (48.2%) and 34,402 (51.8%) genes being obtained within the At and Dt_sub genomes respectively. Pkinase (PF00069) was found to be the dominant domain, with 1069 genes. Analysis of the main sub family, serine threonine protein kinases through gene ontology (GO), cis element and miRNA targets analysis revealed that most of the genes were involved in various functions aimed at enhancing abiotic stress tolerance. Further analysis of the RNA sequence data and qRT-PCR validation revealed 16 putative genes, which were highly up regulated under drought stress condition, and were found to be targeted by ghr-miR169a and ghr-miR164, previously associated with NAC(NAM, ATAF1/2 and CUC2) and myeloblastosis (MYB), the top rank drought stress tolerance genes. These genes can be exploited further to aid in development of more drought tolerant cotton genotypes.Entities:
Keywords: backcross; cotton; drought; genetic diversity; genotyping by sequencing
Mesh:
Year: 2018 PMID: 29848989 PMCID: PMC6032168 DOI: 10.3390/ijms19061614
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Sample Base Composition and the Error Rate Distribution. (A) The x-axis is the base of the reads base, the y-axis represent the order of bases from 5′ to 3′ on reads; the ordinate is all reads on the test. The percentage of bases A, C, G, T, and N, respectively, and different bases are represented by different colors. (B) The x-axis is the reads base sitting position, which represents the order of bases from 5′ to 3′ on reads; the y-axis are all the reads with average error rate in (%).
Figure 2Indel Detection and Annotation. (A) Indel size distribution, the y-axis is the length distribution of the Indel (a positive value indicates the length of the insertion; a negative value indicates the length of the missing segment), and the x-axis is the number of Indels corresponding to the length; (B) Indel location information statistics.
Figure 3High Density Genetic Map Construction. (A) Genetic map. The x-axis: map size in centiMorgans (cM), y-axis: the linkage groups. (B) Recombination map of 200 BC2F2. Red: G. hirsutum blue: G. tomentosum.
Genetic map characteristics.
| Linkages | Markers (P. map) | Markers (G. map) | G. map Size (cM) | Largest Gap (cM) | Smallest Gap (cM) | <1 cM | 1–5 cM | 5–10 cM | 10–20 cM | >20 cM | Ratio |
|---|---|---|---|---|---|---|---|---|---|---|---|
| LG1_chrA01 | 2306 | 772 | 185.4601 | 19.828 | 0.1706 | 767 | 2 | 0 | 1 | 0 | 0.99 |
| LG2_chrA02 | 2368 | 707 | 133.6326 | 2.1472 | 0.1527 | 705 | 1 | 0 | 0 | 0 | 1 |
| LG3_chrA03 | 221 | 70 | 183.6242 | 60.9522 | 0.7464 | 44 | 19 | 4 | 0 | 2 | 0.63 |
| LG4_chrA04 | 2057 | 803 | 184.7308 | 1.3336 | 0.1704 | 800 | 2 | 0 | 0 | 0 | 1 |
| LG5_chrA05 | 778 | 285 | 180.1844 | 10.5478 | 0.3584 | 247 | 35 | 0 | 2 | 0 | 0.87 |
| LG6_chrA06 | 193 | 88 | 135.7902 | 15.3278 | 0.4373 | 48 | 36 | 2 | 1 | 0 | 0.55 |
| LG7_chrA07 | 1572 | 474 | 146.5589 | 4.5868 | 0.2307 | 467 | 6 | 0 | 0 | 0 | 0.99 |
| LG8_chrA08 | 1664 | 542 | 164.4535 | 13.5821 | 0.1979 | 536 | 4 | 0 | 1 | 0 | 0.99 |
| LG9_chrA09 | 1054 | 230 | 176.1339 | 3.8988 | 0.6234 | 190 | 39 | 0 | 0 | 0 | 0.83 |
| LG10_chrA10 | 1139 | 380 | 184.0143 | 2.227 | 0.3827 | 361 | 18 | 0 | 0 | 0 | 0.95 |
| LG11_chrA11 | 1650 | 480 | 182.8387 | 20.4651 | 0.2612 | 474 | 4 | 0 | 0 | 1 | 0.99 |
| LG12_chrA12 | 353 | 113 | 131.5913 | 8.1154 | 0.6518 | 71 | 40 | 1 | 0 | 0 | 0.63 |
| LG13_chrA13 | 1285 | 354 | 159.996 | 9.8116 | 0.3271 | 350 | 1 | 2 | 0 | 0 | 0.99 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| LG14_chrD01 | 198 | 45 | 151.7771 | 145.8727 | 0.104742 | 43 | 0 | 0 | 0 | 1 | 0.96 |
| LG15_chrD02 | 237 | 85 | 129.8661 | 32.0047 | 0.75454 | 60 | 21 | 2 | 0 | 1 | 0.71 |
| LG16_chrD03 | 161 | 79 | 131.4133 | 17.51484 | 0.861499 | 44 | 31 | 2 | 1 | 0 | 0.56 |
| LG17_chrD04 | 365 | 129 | 151.0376 | 8.6138 | 0.841359 | 98 | 27 | 3 | 0 | 0 | 0.76 |
| LG18_chrD05 | 109 | 70 | 173.9945 | 78.08868 | 0.570595 | 33 | 34 | 0 | 1 | 0 | 0.47 |
| LG19_chrD06 | 2419 | 947 | 158.7206 | 0.81108 | 0.12249 | 499 | 0 | 0 | 0 | 0 | 0.53 |
| LG20_chrD07 | 393 | 164 | 159.8483 | 5.9477 | 0.617165 | 96 | 66 | 1 | 0 | 0 | 0.59 |
| LG21_chrD08 | 1918 | 824 | 178.4512 | 0.85601 | 0.164057 | 823 | 0 | 0 | 0 | 1 | 1 |
| LG22_chrD09 | 852 | 293 | 149.2598 | 101.1146 | 0.131356 | 291 | 0 | 0 | 0 | 1 | 0.99 |
| LG23_chrD10 | 1854 | 705 | 137.6682 | 2.3434 | 0.136199 | 700 | 4 | 0 | 0 | 0 | 0.99 |
| LG24_chrD11 | 691 | 218 | 186.8999 | 2.0093 | 0.714943 | 173 | 44 | 0 | 0 | 0 | 0.79 |
| LG25_chrD12 | 1593 | 558 | 133.5851 | 1.5444 | 0.178899 | 555 | 2 | 0 | 0 | 0 | 0.99 |
| LG26_chrD13 | 1230 | 455 | 199.7287 | 17.6884 | 0.300543 | 446 | 7 | 0 | 1 | 0 | 0.98 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
P. map: physical map; G. map: genetic map; Ratio: number of markers less than (<) 1 cM divided by total number of markers within the chromosome.
Figure 4Reorganization and collinearity analysis. (A) Marker Chain Diagram of chr01, each row and column are markers arranged in map order. Each small square in the upper left part of the graph represents two Markers, left bar: the average marker distance, right bar: the LOD values. (B) Syntenic analysis. The x-axis is the physical distance of each chromosome, and the y-axis is the genetic length of each linkage group, where marker is represented in the form of a scatter. Genome and genetic map collinearity. The more diagonal the mark, the better the collinearity between the genetic map and the genome.
Gene mining within the SNP markers for all the linkage groups based on their physical position in mega base pairs (Mb).
| Chr. | Number of Markers | Cover Length (Mb) | Chr. Length (Mb) | Coverage (%) | Density (marker/Mb) | Genes in Reference Genome/chr | Mined Genes/Chro. | % of Mined Genes/Chro. | Number of Domains |
|---|---|---|---|---|---|---|---|---|---|
| LG1_chrA01 | 2306 | 99.85026 | 99.8847 | 100 | 23.1 | 2162 | 1946 | 90 | 800 |
| LG2_chrA02 | 2368 | 83.27669 | 83.447906 | 99.8 | 28.4 | 1824 | 1692 | 92.8 | 673 |
| LG3_chrA03 | 221 | 100.2122 | 100.26305 | 99.9 | 2.21 | 2187 | 1860 | 85 | 771 |
| LG4_chrA04 | 2057 | 62.76193 | 62.913772 | 99.8 | 32.8 | 1491 | 1210 | 81.2 | 102 |
| LG5_chrA05 | 778 | 91.9822 | 92.047023 | 99.9 | 8.46 | 4026 | 2900 | 72 | 997 |
| LG6_chrA06 | 193 | 102.9532 | 103.17044 | 99.8 | 1.87 | 2119 | 1788 | 84.4 | 757 |
| LG7_chrA07 | 1572 | 78.15538 | 78.251018 | 99.9 | 20.1 | 2369 | 2078 | 87.7 | 831 |
| LG8_chrA08 | 1664 | 103.6089 | 103.62634 | 100 | 16.1 | 2571 | 2233 | 86.9 | 869 |
| LG9_chrA09 | 1054 | 74.86152 | 74.999931 | 99.8 | 14.1 | 2532 | 2168 | 85.6 | 873 |
| LG10_chrA10 | 1139 | 100.6926 | 100.8666 | 99.8 | 11.3 | 2357 | 2176 | 92.3 | 897 |
| LG11_chrA11 | 1650 | 93.30573 | 93.316192 | 100 | 17.7 | 3305 | 2947 | 89.2 | 993 |
| LG12_chrA12 | 353 | 87.39659 | 87.484866 | 99.9 | 4.04 | 2733 | 2498 | 91.4 | 938 |
| LG13_chrA13 | 1285 | 78.02031 | 79.961121 | 97.6 | 16.5 | 2356 | 1829 | 77.6 | 777 |
|
|
|
|
|
|
|
|
|
|
|
| LG14_chrD01 | 198 | 60.91516 | 61.456009 | 99.1 | 3.25 | 2383 | 2063 | 86.6 | 800 |
| LG15_chrD02 | 237 | 67.22226 | 67.284553 | 99.9 | 3.53 | 2448 | 2356 | 96.2 | 845 |
| LG16_chrD03 | 161 | 46.6713 | 46.690656 | 100 | 3.45 | 1860 | 1632 | 87.7 | 681 |
| LG17_chrD04 | 365 | 51.27027 | 51.45413 | 99.6 | 7.12 | 2040 | 1765 | 86.5 | 121 |
| LG18_chrD05 | 109 | 60.2436 | 61.933047 | 97.3 | 1.81 | 3942 | 3610 | 91.6 | 1128 |
| LG19_chrD06 | 2419 | 64.09113 | 64.294643 | 99.7 | 37.7 | 2394 | 2231 | 93.2 | 844 |
| LG20_chrD07 | 393 | 55.29315 | 55.312611 | 100 | 7.11 | 2503 | 2191 | 87.5 | 854 |
| LG21_chrD08 | 1918 | 65.83381 | 65.894135 | 99.9 | 29.1 | 2765 | 2621 | 94.8 | 951 |
| LG22_chrD09 | 852 | 50.90761 | 50.995436 | 99.8 | 16.7 | 2493 | 2401 | 96.3 | 921 |
| LG23_chrD10 | 1854 | 62.78548 | 63.374666 | 99.1 | 29.5 | 2646 | 2378 | 89.9 | 888 |
| LG24_chrD11 | 691 | 65.54361 | 66.087774 | 99.2 | 10.5 | 3539 | 3250 | 91.8 | 1038 |
| LG25_chrD12 | 1593 | 58.94278 | 59.109837 | 99.7 | 27 | 2838 | 2581 | 90.9 | 931 |
| LG26_chrD13 | 1230 | 59.8794 | 60.534298 | 98.9 | 20.5 | 2551 | 2367 | 92.8 | 922 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Chr: chromosome; Mb: mega base pair; Dt: Dt_sub genome; At: At_sub genome.
Figure 5The 26 linkage groups of the tetraploid cotton in which all the genes were mapped. Pink: are the various genes, Black: the flanking GBS markers. The positions are in mega base pairs.
Figure 6Relative expression levels of the transcriptome factors found to be targeted by ghr-miR390a. The axis x: the genes, y: the expression levels.
Cis elements detected for the mined genes of the serine threonine domain.
| Factor or Site Name | Signal Sequence | Function | Number of Genes |
|---|---|---|---|
| MYCATERD1 | CATGTG | water-stress responsiveness | 123 |
| AGCBOXNPGLB | AGCCGCC | stress signal-response factors | 32 |
| ASF1MOTIFCAMV | TGACG | Response to abiotic and biotic stress | 123 |
| AUXREPSIAA4 | KGTCCCAT | Response to abiotic and biotic stress | 16 |
| ATHB1ATCONSENSUS | CAATWATTG | Response to abiotic and biotic stress | 10 |
| ATHB5ATCORE | CAATNATTG | Response to abiotic and biotic stress | 10 |
| AUXRETGA1GMGH3 | TGACGTAA | Response to abiotic and biotic stress | 3 |
| ACGTATERD1 | ACGT | Early responsive to dehydration | 214 |
| CCAATBOX1 | CCAAT | Promoter of heat shock protein | 251 |
| MYBST1 | GGATA | Plant MYB binding site | 233 |
| MYBPZM | CCWACC | Plant MYB binding site | 176 |
| MYBPLANT | MACCWAMC | Plant MYB binding site | 79 |
| GCCCORE | GCCGCC | pathogen-responsive genes | 65 |
| CPBCSPOR | TATTAG | NADPH-protochlorophyllide reductase | 127 |
| CCA1ATLHCB1 | AAMAATCT | myb-related transcription factor | 24 |
| MYB2CONSENSUSAT | YAACKG | MYB recognition site/dehydration-responsive | 236 |
| MYBCOREATCYCB1 | AACGG | Myb core/M-phase-specific expression | 207 |
| MYBGAHV | TAACAAA | Myb binding site | 79 |
| LTRECOREATCOR15 | CCGAC | Low-temperature-responsive element | 171 |
| LTRE1HVBLT49 | CCGAAA | Low-temperature-responsive element | 159 |
| LTREATLTI78 | ACCGACA | Low-temperature-responsive element | 30 |
| INRNTPSADB | YTCANTYY | Light-responsive transcription | 229 |
| EBOXBNNAPA | CANNTG | Light-responsive and tissue-specific activation | 260 |
| GT1CORE | GGTTAA | Light-dependent transcriptional activation | 91 |
| IBOXCORE | GATAA | Light regulation | 248 |
| IBOX | GATAAG | Light regulation | 158 |
| IBOXCORENT | GATAAGR | Light regulation | 135 |
| GLMHVCHORD | RTGASTCAT | Involved in the nitrogen response | 10 |
| GMHDLGMVSPB | CATTAATTAG | Involved in the nitrogen response | 2 |
| TAAAGSTKST1 | TAAAG | Guard cell-specific gene expression | 254 |
| GT1CONSENSUS | GRWAAW | GT-1 binding site in many light-regulated genes | 260 |
| AGATCONSENSUS | TTWCCWWWWNNGGWW | Function in flower development | 3 |
| BOXIIPCCHS | ACGTGGC | Essential for light regulation | 9 |
| CRTDREHVCBF2 | GTCGAC | Regulation of low-temperature responsive genes | 33 |
| CTRMCAMV35S | TCTCTCTCT | Regulation of low-temperature responsive genes | 13 |
| ABRELATERD1 | ACGTG | Early responsive to dehydration | 157 |
| ABREOSRAB21 | ACGTSSSC | Early responsive to dehydration | 3 |
| DRECRTCOREAT | RCCGAC | Drought/high-light/cold responsive | 109 |
| DRE1COREZMRAB17 | ACCGAGA | Drought response | 102 |
| BIHD1OS | TGTCA | Disease resistance responses | 242 |
| MYB1AT | WAACCA | Dehydration-responsive gene | 236 |
| MYB2AT | TAACTG | Dehydration-responsive gene | 120 |
| MYB1LEPR | GTTAGTT | Dehydration-responsive gene | 29 |
| MYB26PS | GTTAGGTT | Dehydration-responsive gene | 10 |
| CBFHV | RYCGAC | Dehydration-responsive element (DRE) | 179 |
| MYCCONSENSUSAT | CANNTG | Dehydration-responsive | 260 |
| MYBCORE | CNGTTR | Dehydration/Water stress | 258 |
| MYCATRD22 | CACATG | Dehydration/Water stress | 123 |
| MYBATRD22 | CTAACCA | Dehydration/Water stress | 19 |
| AGMOTIFNTMYB2 | AGATCCAA | Defense-related gene | 49 |
| CARGATCONSENSUS | CCWWWWWWGG | Component of the vernalization (low-temperature) | 32 |
| TBOXATGAPB | ACTTTG | Chloroplast glyceraldehyde-3-phosphate dehydrogenase(GADPH) | 187 |
| GATABOX | GATA | Chlorophyll a/b binding protein/light regulations | 260 |
| CMSRE1IBSPOA | TGGACGG | Carbohydrate Metabolite Signal Responsive Element 1 | 18 |
| TCA1MOTIF | TCATCTTCTT | Salicylic acid/stress induced | 26 |
| ACGTABREMOTIFA2OSEM | ACGTGKC | ABA-responsive expression | 27 |
| ABREATCONSENSUS | YACGTGGC | ABA-responsive elements (ABREs) | 5 |
| ABRECE1HVA22 | TGCCACCGG | ABA-responsive elements (ABREs) | 3 |
| ABREATRD22 | RYACGTGGYR | ABA-responsive elements (ABREs) | 2 |
| ABREBZMRAB28 | TCCACGTCTC | ABA-responsive elements (ABREs) | 2 |
| ABREDISTBBNNAPA | GCCACTTGTC | ABA-responsive elements (ABREs) | 1 |
| WRKY71OS | TGAC | A transcriptional repressor of the gibberellin signaling pathway | 260 |
| E2F1OSPCNA | GCGGGAAA | Involved in transcriptional activation in actively dividing cells and tissue | 5 |
| DPBFCOREDCDC3 | ACACNNG | Abscisic acid response gene | 202 |
R: G A (puRine); Y: T C (pYrimidine); K: G T (Keto); M: A C (aMino); S: G C (Strong); W: A T (Weak) and N: A G C T (aNy).
Figure 7GO functional analysis of the major classes of genes of the family serine/threonine protein kinases. (A) Biological processes (BP), (B) cellular components (CC) and (C) molecular functions (MF).
Figure 8qRT-PCR validation, phylogenetic tree and structural analysis of the 35 highly up regulated genes as determined through RNA sequestration. (A) The heat map was visualized using Mev.exe program. (shown by log2 values) in control, and in treated samples seven and 14 days after drought treatment. BC–BC2F1 (offspring), Gt–Gossypium tomentosum and Gh–Gossypium hirsutum. Red—up regulated, Green—down regulated and black-no expression. (B) Phylogenetic relationship of the 35 genes, values represent percentage similarity. (C) Gene structure.