| Literature DB >> 35968134 |
Yuhua Fu1, Sirong Jiang2,3, Meiling Zou2,3, Jianjia Xiao2,3, Long Yang1, Chunfang Luo1, Ping Rao1, Wenquan Wang3, Zhengui Ou1, Fanzhi Liu1, Zhiqiang Xia2,3.
Abstract
Canna edulis Ker-Gawl and Canna indica L. are species belonging to the Cannaceae family and both have a very high economic value. Here, we aimed to assemble genomes of C. edulis and C. indica at the chromosome level to generate a reference genome for the Cannaceae family. We also comparatively analyzed the genomes of C. edulis and C. indica and examined the molecular mechanisms responsible for the remarkable differences in plant characteristics in C. edulis varieties. Our results indicated that genome-wide duplication events had recently occurred in C. edulis and C. indica. The comparative analysis of the genomes of C. edulis and C. indica revealed that C. edulis exhibited a remarkable level of replication of genes in the starch and sucrose metabolic pathways, especially during sucrose hydrolysis. This finding is consistent with the fact that the starch content of the C. edulis tuber is higher than that of C. indica. Simplified genome re-sequencing revealed the population structure of 241 C. edulis genes, and a genome-wide association study of leaf traits revealed the location of key genes related to leaf color and morphology. These findings extend our understanding of Cannaceade at the molecular level, and provide an effective theoretical basis for further study and utilization of Cannaceae plants.Entities:
Keywords: Cannaceae; genome; germplasm; starch synthesis; whole-genome duplication
Year: 2022 PMID: 35968134 PMCID: PMC9371203 DOI: 10.3389/fpls.2022.955904
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Statistics for Canna edulis and Canna indica genome assembly.
| Assembly and annotation feature |
|
|
|---|---|---|
| Assembly size(bp) | 823,408,298 | 834,891,454 |
| Contig N50 size(bp) | 17,301,877 | 2,100,650 |
| Number of contigs | 398 | 699 |
| LAI complete percentage in assembly | 10.2 | 12.46 |
| BUSCO complete percentage in assembly (%) | 96.87 | 96.73 |
| Predicted protein-coding genes | 30,578 | 32,957 |
Figure 1(A) Genomic feature display. (a) The collinearity between the Canna edulis and Canna indica genome (b) Gene density (c) RNA-seq expression (d–f) Different repeat densities DNA, LINE, and LTR. (B) Chromosome interaction map for the C. edulis genome. (C) Chromosome interaction map for the C. indica genome. (g) The number and length (Mb) of pseudochromosomes are indicated outside of the ring.
Figure 2(A) Analysis of gene families of five species (Canna edulis, Canna indica, Amborella trichopoda, Musa acuminata, and V. vinifera). (B) Frequency distribution of synonymous substitution rates (Ks) between homologous gene pairs in syntenic blocks of C. edulis-C. edulis, C. indica-C. indica, C. edulis-C. indica, M. acuminata-C. edulis, M. acuminata-C. indica, V. vinifera-C. edulis, and V. vinifera-C. indica. (C) Phylogenetic tree of 14 species constructed using single-copy orthologous genes.
Figure 3(A) Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of unique genes in Canna edulis. (B) KEGG analysis of unique genes in C. indica. (C) Comparison of gene copy numbers of C. edulis, C. indica, and M. acuminata in the pathways for starch and sucrose metabolism. (D) The gray area represents the collinearity block of C. edulis-C. indica and M. acuminata-C. indica. The purple line indicates that C. edulis and M.acuminata exhibited significantly expanded gene pairs, as compared to C. indica.
Figure 4(A) Cv-error value under different K values. (B) PCA analysis based on 241 C. edulis samples. (C) The group structure of C. edulis when K = 2 or 3. Different colors represent different subgroups. (D) The neighbor-joining tree of the C. edulis population was constructed based on genetic distances; the blue color represents subgroup1, and the green color represents subgroup2. (E) Genome-wide association study (GWAS) analysis of two traits of leaf shape and color of C. edulis.