| Literature DB >> 36122204 |
Renhai Peng1, Yanchao Xu2, Shilin Tian3, Turgay Unver4, Zhen Liu1, Zhongli Zhou2, Xiaoyan Cai2, Kunbo Wang2, Yangyang Wei1, Yuling Liu1, Heng Wang2, Guanjing Hu2,5, Zhongren Zhang3, Corrinne E Grover6, Yuqing Hou2, Yuhong Wang2, Pengtao Li1, Tao Wang1, Quanwei Lu1, Yuanyuan Wang7, Justin L Conover6, Hassan Ghazal8, Qinglian Wang7, Baohong Zhang9, Marc Van Montagu10,11, Yves Van de Peer10,11,12,13, Jonathan F Wendel6, Fang Liu2.
Abstract
Allotetraploid cotton (Gossypium) species represents a model system for the study of plant polyploidy, molecular evolution, and domestication. Here, chromosome-scale genome sequences were obtained and assembled for two recently described wild species of tetraploid cotton, Gossypium ekmanianum [(AD)6, Ge] and Gossypium stephensii [(AD)7, Gs], and one early form of domesticated Gossypium hirsutum, race punctatum [(AD)1, Ghp]. Based on phylogenomic analysis, we provide a dated whole-genome level perspective for the evolution of the tetraploid Gossypium clade and resolved the evolutionary relationships of Gs, Ge, and domesticated G. hirsutum. We describe genomic structural variation that arose during Gossypium evolution and describe its correlates-including phenotypic differentiation, genetic isolation, and genetic convergence-that contributed to cotton biodiversity and cotton domestication. Presence/absence variation is prominent in causing cotton genomic structural variations. A presence/absence variation-derived gene encoding a phosphopeptide-binding protein is implicated in increasing fiber length during cotton domestication. The relatively unimproved Ghp offers the potential for gene discovery related to adaptation to environmental challenges. Expanded gene families enoyl-CoA δ isomerase 3 and RAP2-7 may have contributed to abiotic stress tolerance, possibly by targeting plant hormone-associated biochemical pathways. Our results generate a genomic context for a better understanding of cotton evolution and for agriculture.Entities:
Keywords: adaptive evolution; polyploid dynamics; structure variations; tetraploid cotton
Mesh:
Substances:
Year: 2022 PMID: 36122204 PMCID: PMC9522333 DOI: 10.1073/pnas.2208496119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
Features of three tetraploid cotton assemblies
| Genomic features |
|
|
|
|---|---|---|---|
| Assembly | |||
| Genome size (Mb) | 2,341.87 | 2,291.84 | 2,292.48 |
| Scaffold number | 160 | 243 | 277 |
| Scaffold N50 (Mb) | 108.06 | 108.2 | 106.96 |
| Contig size (Mb) | 2,341.51 | 2,291.47 | 2,292.40 |
| Contig number | 3,781 | 3,927 | 1,111 |
| Contig N50 (Mb) | 1.57 | 1.23 | 11.49 |
| Gap number | 3,621 | 3,684 | 834 |
| Gap length (Mb) | 0.36 | 0.37 | 0.08 |
| Pseudochromosomes size (Mb) | 2,337.03 | 2,272.89 | 2,283.07 |
| Annotation | |||
| TE percentage | 64.86 | 63.01 | 64.89 |
| Gene number | 74,178 | 74,970 | 74,520 |
| Genes in pseudochromosomes | 74,038 | 73,324 | 74,283 |
| Complete BUSCOs (%) | 95.50 | 97.10 | 95.40 |
Fig. 1.Genomic feature distribution in the three cotton genomes. The scale unit of each chromosome is 10 M. Track a: Gene density indicated by gene length per megabase. Track b: Gypsy density showed the distribution of Gypsy TE length per megabase across each chromosome. Track c: Copia density showed the distribution of Copia length per megabase across each chromosome. Track d: TE density showed the distribution of DNA transposon length per megabase across each chromosome.
Fig. 2.Phylogenetic analysis of the Gossypium genomes. (A) Maximum-likelihood tree inferred using G. kirkii (Gki) as the outgroup. (B) Distribution of Ks values for orthologous genes among Gossypium genomes. (C) Evolution of the allopolyploid cotton clade, formed following hybridization between an extinct A0 and ancestor of D5.
Fig. 3.Characterization of genomic variations in Gossypium At genome and Dt genome. Genic synteny blocks are connected by gray lines. Translocation blocks and inversion blocks are connected by red and green lines, respectively.
Fig. 4.SVs in tetraploid cotton genomes. (A) A 450-bp fragment SV occurred between Gh-like and Gb-like clades in a synteny block among At10 of all eight tetraploid Gossypium genomes. Coverage of Gh genome by the Illumina reads of eight tetraploid Gossypium genomes (Upper) and gene structure of Ghi_A10G09231 are shown (Lower). The deletion region of Ghi_A10G09231 is outlined in red and marked in gray bar. The yellow bars indicate the coding sequence regions. Evolutionary relationships are shown in the tree to the Left. (B) A 4.5-Mb inversion occurred between Gh-like and Gb-like clades. (C) A 980-kb inversion occurs at both Gb and Gh relative to their wild progenitors.
Fig. 5.Pan-genome analysis for eight tetraploid cotton genomes. (A) Increase in pan-gene families and decrease in core gene families with the addition of tetraploid cotton genomes. (B) Clustering of core and dispensable gene families of tetraploid cotton genomes.
Fig. 6.Abiotic stress adaption of the Ghp. (A) Venn map of DEGs in salt (300 mM NaCl) and drought (17% PEG) treatments at two time periods, respectively. (B) The mature plants of GhP. (C) Significant comparative analysis of the expression level for GhECI3 among different treatments. *P < 0.05. (D) Phenotypes of two treatment groups and a control (water) after salt and drought treatments in VIGS plants. (E) Expression detection of gene GhECI3 in its silenced plants (TRV2::GhECI). (F) Significant comparison analysis of ion leakage and relative leaf water content. ANOVA analysis was performed with the standard t test, with least significant difference used for multiple comparisons. The different letters above the error bars indicate significant differences (P < 0.05) in all combinations.