| Literature DB >> 34784936 |
Wei Hu1,2,3, Changmian Ji4,5,6, Zhe Liang7, Jianqiu Ye8, Wenjun Ou8, Zehong Ding4,5,6, Gang Zhou9, Weiwei Tie4,5,6, Yan Yan4,5,6, Jinghao Yang5,6, Liming Ma9, Xiaoying Yang10, Yunxie Wei11, Zhiqiang Jin5,6, Jianghui Xie5,6, Ming Peng5,6, Wenquan Wang5,6, Anping Guo4,5,6, Biyu Xu5,6, Jianchun Guo5,6, Songbi Chen8, Mingcheng Wang9, Yang Zhou9, Xiaolong Li9, Ruoxi Li12, Xinhui Xiao8, Zhongqing Wan8, Feifei An8, Jie Zhang8, Qingyun Leng8, Yin Li13, Haitao Shi14, Ray Ming15,16, Kaimian Li17.
Abstract
BACKGROUND: Heterozygous genomes are widespread in outcrossing and clonally propagated crops. However, the variation in heterozygosity underlying key agronomic traits and crop domestication remains largely unknown. Cassava is a staple crop in Africa and other tropical regions and has a highly heterozygous genome.Entities:
Keywords: Agronomic traits; Cassava; Heterozygosity; Resequencing; Selection
Mesh:
Substances:
Year: 2021 PMID: 34784936 PMCID: PMC8594203 DOI: 10.1186/s13059-021-02524-7
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Genome-wide variation map and phylogenetic relationship of cassava. a Geographic distribution of resequenced cassava accessions. The area of the circle is proportional to the number of cassava accessions. b SNP density distributions in 10-kb non-overlapping sliding windows of 388 resequenced cassava accessions. c Phylogeny of 388 cassava accessions generated using the neighbor-joining tree method with genome-wide SNPs. d Principal component analysis of 388 cassava accessions using genome-wide SNPs. PC1, first principal component; PC2, second principal component. e,f Phylogenic relationships of wild progenitors and landraces from different continents (e) and countries (f) analyzed using the neighbor-joining tree method with genome-wide SNPs
Fig. 2GWAS identification of candidate genes for variation in heterozygosity associated with key agronomic traits. a–f GWAS identification of Sc10g012040 and Sc10g012050 as candidate genes for SR epidermal type on chromosome 10. a Manhattan plots for SR epidermal type on chromosome 10 using FAST-LMM. Red arrow indicates the significant GWAS peak. b Local manhattan plot (top) and linkage disequilibrium heat map (bottom) surrounding the GWAS signal. Red dashed lines indicate the candidate region. Red dots indicate the core SNPs in the candidate gene. c,d Sc10g012040 and Sc10g012050 gene model, respectively. The dashed red lines represent the position of nonsynonymous SNPs. e Images of rough and smooth types of SR epidermis. f Comparison of SR epidermal types based on the haplotypes in Sc10g012040 (left) and Sc10g012050 (middle), as well as in both Sc10g012040 and Sc10g012050 (right). Hap 1 and hap 3 indicate the homozygous allele combinations, while hap 2 shows the heterozygous allele combinations. The symbol n represents the number of accessions with the same genotype. g–i GWAS identification of Sc03g001750 as a candidate gene for SR epidermal type on chromosome 3. j–m GWAS identification of Sc13g000920 as a candidate gene for SR scar on chromosome 13. n–p GWAS identification of Sc05g013530 as a candidate gene for above-ground weight of cassava plants on chromosome 5. q–s GWAS identification of Sc11g000910 as a candidate gene for SR amylopectin content on chromosome 11. t–v GWAS identification of Sc18g013220 as a candidate gene for cassava resistance to T. cinnabarinus on chromosome 18. For box plots in (p), (s), and (v), the center line represents the median, box limits indicate the upper and lower quartiles, and whiskers denote the range of the data. The significance of difference was derived with two-tailed t-test (*P< 0.05, **P< 0.01). For histograms in (f), (i), and (m), the significance of difference was derived with the chi-square test (*P< 0.05, **P< 0.01)
Fig. 3Selection signatures of heterozygosity in cassava. a–c Comparison of genomic diversity between heterozygous and homozygous regions using the values of π and Tajima’s D in 374 cultivars as well as FST value between 14 wild progenitors and 14 randomly selected cultivars. The significance of difference was derived with two-tailed t-test (*P < 0.05, **P < 0.01). d Selection signals for decrease in heterozygosity and nucleotide diversity from wild cassava progenitors to cultivars. The upper (1.27) and lower (1.86) horizontal blue lines indicate values above the 99% thresholds for selection sweeps in heterozygosity and nucleotide diversity, respectively. Red arrows indicate the positions of growth- and development-associated genes that are differentially expressed in storage roots between wild progenitors and cultivars within selective sweep regions with both decreases in heterozygosity and nucleotide diversity. Annotation of these genes is shown in Additional file 19: Table S19
Fig. 4Selection for decrease in heterozygosity associated with domestication of the large starchy storage root (SR) of cassava. a Morphological differences in SR between wild and cultivated cassavas after 350 d of cultivation. b Differences in starch content of SR between wild progenitors and cultivars after 350 d cultivation. c Differences between wild progenitors and cultivars in the expression of genes identified in selection sweeps (above 99% threshold) and related to growth and development in SR at 350 days after planting. Annotation of these genes is shown in Additional file 19: Table S19. d Selection signal detected in MeTIR1 and MeAHL17. The green rectangle indicates the selection signal region. The dashed red and blue lines represent the position (chr2:11,843,579) of a core SNP in the upstream of MeTIR1 and the position (chr2: 11,671,475) of a core SNP in the upstream of MeAHL17, respectively. e Comparison of MeTIR1 expression levels (FPKM) in SR between wild progenitors and cultivars at 350 days after planting. f MeTIR1 gene model. The dashed red line represents the position of the core SNP in the upstream region of MeTIR1. g Frequency of the SNP variants in MeTIR1 in wild and cultivated cassavas. h Box plots of SR starch content based on the SNPs of CC, CG, and GG. The symbol n represents the number of accessions with the same genotype. i Differences in expression of MeTIR1 alleles in SR at 350 d after planting among CC, CG, and GG containing cultivars. j Comparison of MeAHL17 expression levels in SR at 350 days after planting between wild progenitors and cultivars. k MeAHL17 gene model. The dashed red line represents the position of the core SNP in the promoter region of MeAHL17. l Comparison of activities between A-containing and G-containing MeAHL17 promoter regions (300 bp and 600 bp upstream of transcription start site) in dual luciferase assay. Each sample contains 8 biological replicates. m Frequency of the SNP variants in the MeAHL17 gene in wild and cultivated cassavas. n Box plots of SR weight based on the SNPs of AA, AG, and GG. The symbol n represents the number of accessions with the same genotype. o Differences in expression of MeAHL17 in SR at 350 days after planting among AA, AG, and GG containing cultivars. The data in e and j was detected by RNA-Seq and the data in i and o was examined by qRT-PCR. The symbol n represents the number of accessions. Each accession contains three biological replicates. The data for biological replicates were used to draw the box plots. For e, h, i, j, l, n, and o, the center line represents the median, box limits indicate the upper and lower quartiles, and whiskers denote the range of the data. The significance of difference in this figure was derived with two-tailed t-test (*P < 0.05, **P < 0.01)
Fig. 5Identification of the candidate gene MeAHL17 associated with cassava bacterial blight (CBB). a Manhattan plots for CBB resistance on chromosome 2 using FAST-LMM. b Local manhattan plot (top) and linkage disequilibrium heat map (bottom). Red dashed lines indicate the candidate region. Blue arrow indicates the SNP in the candidate gene. c Comparison of CBB resistance based on the SNPs of AA and AG. The symbol n represents the number of accessions with the same genotype. d Differences in expression of MeAHL17 between CBB-susceptible accessions (carrying AA allele) and CBB-resistant accessions (carrying AG allele) at (0 days post inoculation (dpi)) and 2 dpi of Xam infection, as detected by qRT-PCR (**P < 0.01; two-tailed t-test). The symbol n represents the number of accessions. Each accession contains three biological replicates. The center line represents the median, box limits indicate the upper and lower quartiles, and whiskers denote the range of the data. e Expression of MeAHL17 in cassava leaves transformed with pCAMBIA1304 (vector control, VC1), pCAMBIA1304::MeAHL17 (OE), pTRV (vector control, VC2), or pTRV::MeAHL17 (RNAi) in four cultivars, respectively, as detected by qRT-PCR (*P < 0.05, **P < 0.01; two-tailed t-test). Data are represented as mean ± s.d. (n = 3 biological replicates). f The bacterial number in cassava leaves transformed with pCAMBIA1304 (vector control, VC1), pCAMBIA1304::MeAHL17 (OE), pTRV (vector control, VC2), or pTRV::MeAHL17 (RNAi) in four cultivars at 0 and 6 days post inoculation (*P < 0.05, **P < 0.01; two-tailed t-test). Data are represented as mean ± s.d. (n = 4 biological replicates)
Fig. 6Effects of MeAHL17 and MeTIR1 allele combinations on trait variation. a Tajima’s D distribution surrounding MeAHL17 andMeTIR1 loci. b Linkage disequilibrium heat map surrounding MeAHL17 and MeTIR1 loci. Red lines indicate the candidate region. c Allele distribution of MeAHL17 at position 11,671,475 (Red letters) and MeTIR1 at position 11,843,579 (Black letters) on chromosome 2 in wild and cultivated cassavas. d, e Box plots of SR weight (d) and SR starch content (e) based on indicated allele combinations. The center line represents the median, box limits indicate the upper and lower quartiles, and whiskers denote the range of the data. Pairwise multiple comparisons (median test, adjusted P < 0.05) were used to identify significant differences shown with different letters and no significant differences shown with the same letter. f Comparison of frequency of CBB resistance based on the indicated allele combinations. The black and red letters in d, e, and f indicate the same as in c