| Literature DB >> 32736519 |
Wenpan Dong1,2, Chao Xu1, Jun Wen1,3, Shiliang Zhou4,5.
Abstract
BACKGROUND: Chloroplast genome sequence data is very useful in studying/addressing the phylogeny of plants at various taxonomic ranks. However, there are no empirical observations on the patterns, directions, and mutation rates, which are the key topics in chloroplast genome evolution. In this study, we used Calycanthaceae as a model to investigate the evolutionary patterns, directions and rates of both nucleotide substitutions and structural mutations at different taxonomic ranks.Entities:
Keywords: Calycanthaceae; Chloroplast genome; Indels; Structural mutations; Substitution rate
Mesh:
Substances:
Year: 2020 PMID: 32736519 PMCID: PMC7393888 DOI: 10.1186/s12862-020-01661-0
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Phylogenetic relationships five species belonging to two subfamilies, three genera of the Calycanthaceae using two species of Magnoliaceae as outgroups
Fig. 2Structural mutation types in chloroplast genomes. a indel; b repeat; (c) and (d) microsatellite; e inversion
Major features of the chloroplast genomes of five species in Calycanthus, Chimonanthus and Idiospermum
| Genome feature | |||||
|---|---|---|---|---|---|
| GenBank accession numbers | AJ428413 | MH377059 | MH377058 | MH377057 | MH377056 |
| Size (bp) | 153,337 | 153,346 | 153,250 | 153,252 | 154,767 |
| LSC length (bp) | 86,948 | 86,983 | 86,882 | 86,912 | 85,482 |
| IR length (bp) | 23,295 | 23,284 | 23,330 | 23,287 | 24,860 |
| SSC length (bp) | 19,799 | 19,795 | 19,708 | 19,766 | 19,565 |
| Total number of genes, including | 131 | 131 | 131 | 131 | 132 |
| Number of genes in IR | 17 | 17 | 17 | 17 | 18 |
| Number of genes with introns | 18 | 18 | 18 | 18 | 18 |
| GC content of the genome (%) | 39.3 | 39.27 | 39.27 | 39.25 | 39.23 |
| GC content of protein-coding genes, tRNAs and rRNAs (%) | 59.04 | 59.24 | 59.07 | 59.07 | 58.99 |
Fig. 3Percentage of each substitution type in the chloroplast genomes of five species belonging to different subfamilies and genera in Calycanthaceae
Taxonomic and genomic distribution of the biallelic single nucleotide polymorphic loci in the five chloroplast genomes of Calycanthaceae
| Subfamily | Genus | Species | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Genome region | Length (bp) | Value | % | Value | % | % | % | Total | % | ||
| Total substitutions | 130,345 | 1996 | 1.53 | 269 | 0.21 | 283 | 0.22 | 233 | 0.18 | 2781 | 2.13 |
| Coding region | 68,763 | 980 | 1.43 | 118 | 0.17 | 116 | 0.17 | 118 | 0.18 | 1332 | 1.94 |
| Nonsynonymous | / | 489 | 0.71 | 53 | 0.08 | 61 | 0.09 | 64 | 0.09 | 667 | 0.97 |
| Synonymous | / | 491 | 0.71 | 65 | 0.09 | 55 | 0.08 | 54 | 0.08 | 665 | 0.97 |
| dN/dS | / | 1 | / | 0.82 | / | 1.11 | / | 1.19 | / | 1 | / |
| Intron | 14,104 | 185 | 1.31 | 30 | 0.21 | 36 | 0.26 | 24 | 0.17 | 275 | 1.95 |
| Intergenic spacer | 43,139 | 831 | 1.93 | 121 | 0.28 | 131 | 0.3 | 91 | 0.21 | 1174 | 2.72 |
Fig. 4The ratios of Ts/Tv and dN/dS in different functional gene groups
Numbers, directions and locations of indels and repeats in the five chloroplast genomes of Calycanthaceae
| Subfamily | Genus | Species | Total | |||
|---|---|---|---|---|---|---|
| Mutation type | ||||||
| Indel | Total | 143 | 11 | 11 | 13 | 178 |
| Size | 1 ~ 10 bp | 112 | 6 | 5 | 12 | 135 |
| 11 ~ 50 bp | 19 | 2 | 3 | 1 | 25 | |
| 50 ~ 100 bp | 5 | 3 | 2 | 0 | 10 | |
| >100 bp | 7 | 0 | 1 | 0 | 8 | |
| Location | exon | 8 | 1 | 1 | 0 | 10 |
| intron | 23 | 1 | 2 | 1 | 27 | |
| space | 112 | 9 | 8 | 12 | 141 | |
| Direction | insertion | 14 | 1 | 1 | 1 | 17 |
| deletion | 113 | 10 | 10 | 12 | 145 | |
| uncertain | 16 | 0 | 0 | 0 | 16 | |
| Repeat | Total | 89 | 17 | 7 | 3 | 116 |
| Size | 3 bp | 4 | 0 | 0 | 0 | 4 |
| 4 bp | 25 | 3 | 0 | 0 | 28 | |
| 5 bp | 40 | 9 | 2 | 1 | 52 | |
| 6 bp | 15 | 2 | 2 | 2 | 21 | |
| 7 bp | 1 | 0 | 1 | 0 | 2 | |
| 9 bp | 3 | 0 | 0 | 0 | 3 | |
| 10 bp | 1 | 1 | 0 | 0 | 3 | |
| >10 bp | 0 | 2 | 2 | 0 | 4 | |
| Location | exon | 4 | 2 | 0 | 0 | 6 |
| intron | 18 | 2 | 1 | 0 | 21 | |
| space | 67 | 13 | 6 | 3 | 89 | |
| Direction | insertion | 75 | 15 | 6 | 3 | 99 |
| deletion | 11 | 2 | 1 | 0 | 14 | |
| uncertain | 3 | 0 | 0 | 0 | 3 | |
The genomic location, length and taxonomic distributions of 13 inversions. uc: direction uncertain; Cc: C. chinensis; Cf: C. floridus; Chn: Ch.nitens; Chp: Ch. praecox; Ia: I. australiense; Lt: Liriodendron tulipifera; Mk: Magnolia kwangsiensis
| Location | Length | Inversions | Taxonomic rank | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| loop | stem | Cf | |||||||||
| IR | 2 | 14 | no | no | no | yes | yes | no | no | Genus | |
| IR | 92 | 17 | no | no | no | no | yes | yes | no | Species | |
| LSC | 5 | 6 | uc | uc | no | yes | yes | yes | yes | Subfamily | |
| LSC | 16 | 36 | no | no | yes | no | no | no | no | Subfamily | |
| LSC | 7 | 11 | no | no | yes | yes | no | no | yes | Species | |
| LSC | 16 | 17 | no | no | no | yes | yes | yes | yes | Subfamily | |
| LSC | 10 | 9 | no | no | yes | no | no | no | no | Subfamily | |
| LSC | 2 | 7 | no | no | no | yes | yes | yes | yes | Subfamily | |
| LSC | 3 | 12 | no | no | no | yes | yes | yes | yes | Subfamily | |
| LSC | 4 | 7 | no | no | yes | yes | yes | no | no | Genus | |
| SSC | 8 | 11 | no | yes | yes | no | no | no | no | Subfamily | |
| SSC | 4 | 9 | uc | uc | no | yes | no | no | no | Species | |
| SSC | 2 | 8 | no | uc | no | yes | yes | no | yes | Subfamily, Species | |
Associations of nucleotide mutations (N) to structural mutations (S). gdN: the genetic distance of the regions without structural mutations; gdS: the genetic distance of the regions with structural mutations. The distances are based on SNPs. Ia: Idiospermum australiense; Cc: Calycanthus chinensis; Cf: Calycanthus floridus; Chn: Chimonanthus nitens; Chp: Chimonanthus praecox
| Level | Comparison | Whole genome | Noncoding region | Coding region | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | S | N/S | gdS | gdN | gdS/gdN | gdS | gdN | gdS/gdN | gdS | gdN | gdS/gdN | ||
| Subfamily | 2664 | 250 | 10.66 | 0.0317 | 0.0130 | 2.44 | 0.0315 | 0.0145 | 2.17 | 0.0323 | 0.0126 | 2.56 | |
| Subfamily | 2614 | 249 | 10.50 | 0.0312 | 0.0129 | 2.42 | 0.0307 | 0.0137 | 2.24 | 0.0325 | 0.0127 | 2.56 | |
| Subfamily | 2647 | 264 | 10.03 | 0.0313 | 0.0129 | 2.43 | 0.0307 | 0.0145 | 2.12 | 0.0329 | 0.0126 | 2.61 | |
| Subfamily | 2587 | 260 | 9.95 | 0.0307 | 0.0127 | 2.42 | 0.0301 | 0.0137 | 2.20 | 0.0325 | 0.0125 | 2.60 | |
| Genus | 715 | 46 | 15.54 | 0.0076 | 0.0037 | 2.05 | 0.0081 | 0.0059 | 1.37 | 0.0063 | 0.0032 | 1.97 | |
| Genus | 667 | 41 | 16.27 | 0.0072 | 0.0033 | 2.18 | 0.0077 | 0.0048 | 1.60 | 0.0059 | 0.0030 | 1.97 | |
| Genus | 687 | 49 | 14.02 | 0.0075 | 0.0035 | 2.14 | 0.0077 | 0.0049 | 1.57 | 0.0070 | 0.0032 | 2.19 | |
| Genus | 617 | 44 | 14.02 | 0.0069 | 0.0032 | 2.16 | 0.0070 | 0.0040 | 1.75 | 0.0065 | 0.0030 | 2.17 | |
| Species | 383 | 17 | 22.53 | 0.0043 | 0.0018 | 2.39 | 0.0044 | 0.0026 | 1.69 | 0.0040 | 0.0016 | 2.50 | |
| Species | 334 | 16 | 20.88 | 0.0033 | 0.0019 | 1.74 | 0.0035 | 0.0028 | 1.25 | 0.0030 | 0.0017 | 1.76 | |