| Literature DB >> 35438173 |
Yan Zhong1, Yong Chen2, Danjing Zheng2, Jingyi Pang1, Ying Liu1, Shukai Luo2, Shiyuan Meng2, Lei Qian2, Dan Wei3, Seping Dai2, Renchao Zhou1.
Abstract
Cercidoideae, one of the six subfamilies of Leguminosae, contains one genus Cercis with its chromosome number 2n = 14 and all other genera with 2n = 28. An allotetraploid origin hypothesis for the common ancestor of non-Cercis genera in this subfamily has been proposed; however, no chromosome-level genomes from Cercidoideae have been available to test this hypothesis. Here, we conducted a chromosome-level genome assembly of Bauhinia variegata to test this hypothesis. The assembled genome is 326.4 Mb with the scaffold N50 of 22.1 Mb and contains 37,996 protein-coding genes. The Ks distribution between gene pairs in the syntenic regions indicates two whole-genome duplications (WGDs): one is B. variegata-specific, and the other is shared among core eudicots. Although Ks between gene pairs generated by the recent WGD in Bauhinia is greater than that between Bauhinia and Cercis, the WGD was not detected in Cercis, which can be explained by an accelerated evolutionary rate in Bauhinia after divergence from Cercis. Ks distribution and phylogenetic analysis for gene pairs generated by the recent WGD in Bauhinia and their corresponding orthologs in Cercis support the allopolyploidy origin hypothesis of Bauhinia. The genome of B. variegata also provides a genomic resource for dissecting genetic basis of its ornamental traits.Entities:
Keywords: zzm321990 Bauhinia variegatazzm321990 ; allopolyploidization; genome assembly; rapid evolution; whole-genome duplication
Mesh:
Year: 2022 PMID: 35438173 PMCID: PMC9052405 DOI: 10.1093/dnares/dsac012
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.477
Sources of genomic and transcriptomic data of other species included in the study
| Species | Sequence type | Source |
|---|---|---|
|
| Genomic | Phytozome |
|
| Genomic | Phytozome |
|
| Genomic | Phytozome |
|
| Genomic | Phytozome |
|
| Genomic | GigaDB |
|
| Genomic | GigaDB |
|
| Transcriptomic |
|
|
| Transcriptomic |
|
|
| Transcriptomic |
|
|
| Transcriptomic |
|
|
| Transcriptomic |
|
Figure 1Genome size estimation and genome assembly assessment. (A) Genome survey of Bauhinia variegata with GenomeScope. (B) BUSCO assessment of the genome assemblies of five legumes with eudicots_odb10 dataset. (C) BUSCO assessment of the genome assemblies of five legumes with fabales_odb10 dataset.
Statistics of the genome assembly for Bauhinia variegata
| Assembly features | |
|---|---|
| Genome size (bp) | 326,375,084 |
| GC content | 34.95% |
| Scaffolds number | 411 |
| Scaffold N50 (bp) | 22,089,475 |
| Scaffold L50 | 7 |
| Contig N50 (bp) | 4,549,988 |
| Contig L50 | 21 |
| Annotation features | |
| Number of predicted gene models | 37,996 |
| Mean of exon number per gene | 5.4 |
| Mean of exon length (bp) | 297.5 |
| Mean of intron length (bp) | 382.5 |
| Repeat content (% of the genome assembly) | 27.22% |
| Functional annotation | |
| Total number of annotated genes | 35,659 |
| Number of genes annotated by InterProScan | 35,189 |
| Number of genes annotated by Eggnog | 34,601 |
| Number of genes annotated by Pannzer2 | 29,589 |
| Number of genes annotated by Mercator4 | 26,311 |
N50: sequence length of the shortest contig/scaffold at 50% of the total genome length.
L50: the smallest number of contigs/scaffolds whose length sum makes up half of genome size.
Figure 2Intra-genomic synteny analysis and other genomic features of Bauhinia variegata. Tracks from outside to inside show 14 pseudo-chromosomes (a), gene density (b), transposable elements (TE) density (c), GC content (d), the density of genes located in syntenic regions (e) and intragenomic synteny (f).
Figure 3Phylogenomic analysis and gene family analysis. (A) Phylogenetic tree of 10 legumes and an outgroup based on concatenated sequences of 129 single-copy genes, with the numbers of expanded (left) and contracted (right) gene families shown on each branch. (B) Venn diagram showing the shared and unique gene families among five legumes. (C) KEGG pathway enrichment analysis for significantly expanded gene families in Bauhinia variegata. Each row represents an enriched pathway, and the length of the bar represents the enrichment ratio, which is calculated as ‘input gene number’/‘background gene number’. Different clusters are shown in different colours for the bar.
Figure 4Identification of whole genome duplication (WGD) in Bauhinia variegata. (A) The histogram of synonymous substitution rate (Ks) between gene pairs on syntenic blocks in the genome of B. variegata. (B) The frequency density distribution of synonymous substitution rate (Ks) between B. variegata and Cercis canadensis. Shown are Ks distribution of gene pairs on syntenic blocks between the two species, and that between each of the WGD-generated duplicated genes in B. variegata and its corresponding ortholog in C. canadensis. (C) Synteny analysis between B. variegata and C. canadensis. Only 11 longest contigs of C. canadensis are shown here.
Figure 5Alternative models for the origin of Bauhinia. (A) Autopolyploidy occurred in the ancestor of Bauhinia after divergence from Cercis. (B) Hybridization between the ancestor of Cercis and an extinct, diverged diploid species and genome doubling produced the allopolyploid ancestor of Bauhinia.