| Literature DB >> 34107868 |
Wenpan Dong1, Chao Xu2, Yanlei Liu2,3, Jipu Shi4, Wenying Li5, Zhili Suo6.
Abstract
BACKGROUND: Crape myrtles, belonging to the genus Lagerstroemia L., have beautiful paniculate inflorescences and are cultivated as important ornamental tree species for landscaping and gardening. However, the phylogenetic relationships within Lagerstroemia have remained unresolved likely caused by limited sampling and the insufficient number of informative sites used in previous studies.Entities:
Keywords: Chloroplast genome; Divergence time; Lagerstroemia; Phylogeny
Mesh:
Year: 2021 PMID: 34107868 PMCID: PMC8191006 DOI: 10.1186/s12864-021-07769-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of newly sequenced plastomes
| Species | LSC length (bp) | IR length (bp) | SSC length (bp) | Plastome size (bp) | GC content (%) | Gene number | Protein coding genes | tRNA | rRNA |
|---|---|---|---|---|---|---|---|---|---|
| 84,058 | 25,631 | 16,729 | 152,049 | 37.6 % | 112 | 78 | 30 | 4 | |
| 84,008 | 25,726 | 16,798 | 152,258 | 37.6 % | 112 | 78 | 30 | 4 | |
| 84,025 | 25,625 | 16,919 | 152,194 | 37.6 % | 112 | 78 | 30 | 4 | |
| 84,047 | 25,625 | 16,917 | 152,214 | 37.6 % | 112 | 78 | 30 | 4 | |
| 83,920 | 25,625 | 16,904 | 152,074 | 37.6 % | 112 | 78 | 30 | 4 | |
| 83,919 | 25,625 | 16,904 | 152,073 | 37.6 % | 112 | 78 | 30 | 4 | |
| 84,000 | 25,716 | 16,793 | 152,225 | 37.7 % | 112 | 78 | 30 | 4 | |
| 84,026 | 25,625 | 16,729 | 152,005 | 37.6 % | 112 | 78 | 30 | 4 | |
| 83,809 | 25,625 | 16,909 | 151,968 | 37.6 % | 112 | 78 | 30 | 4 | |
| 84,060 | 25,625 | 16,919 | 152,229 | 37.6 % | 112 | 78 | 30 | 4 | |
| 84,058 | 25,625 | 16,919 | 152,227 | 37.6 % | 112 | 78 | 30 | 4 | |
| 83,997 | 25,732 | 16,850 | 152,311 | 37.6 % | 112 | 78 | 30 | 4 | |
| 83,951 | 25,651 | 16,905 | 152,158 | 37.6 % | 112 | 78 | 30 | 4 | |
| 83,982 | 25,726 | 16,800 | 152,234 | 37.7 % | 112 | 78 | 30 | 4 | |
| 84,008 | 25,721 | 16,795 | 152,245 | 37.7 % | 112 | 78 | 30 | 4 | |
| 84,084 | 25,625 | 16,920 | 152,254 | 37.6 % | 112 | 78 | 30 | 4 | |
| 84,183 | 25,714 | 16,832 | 152,443 | 37.6 % | 112 | 78 | 30 | 4 | |
| 84,188 | 25,804 | 16,833 | 152,629 | 37.6 % | 112 | 78 | 30 | 4 | |
| 84,009 | 25,726 | 16,797 | 152,258 | 37.7 % | 112 | 78 | 30 | 4 | |
| 84,003 | 25,705 | 16,795 | 152,208 | 37.7 % | 112 | 78 | 30 | 4 |
Fig. 1General chloroplast genome map of Lagerstroemia. Specific sizes for the chloroplast genomes of each species are presented in Table 1. Genes drawn outside of the map circle are transcribed clockwise, while those drawn inside are transcribed counterclockwise. Genes belonging to different functional groups are color-coded. The darker gray in the inner circle corresponds to GC, while the lighter gray corresponds to AT content
Fig. 2Sliding window analysis of nucleotide variability (Pi) across 35 complete chloroplast genome sequences of Lagerstroemia
Variability of four hyper-variable markers and the universal chloroplast DNA barcodes (rbcL, matK and trnH-psbA) in Lagerstroemia
| Markers | Length | Variable sites | Information sites | Nucleotide diversity | ||
|---|---|---|---|---|---|---|
| Numbers | % | Numbers | % | |||
| 1,051 | 38 | 3.62 % | 36 | 3.43 % | 0.01953 | |
| 1,229 | 56 | 4.56 % | 51 | 4.15 % | 0.02040 | |
| 947 | 56 | 5.91 % | 53 | 5.60 % | 0.01848 | |
| 827 | 41 | 4.96 % | 36 | 4.35 % | 0.01890 | |
| Combine four variable markers | 4,054 | 191 | 4.71 % | 176 | 4.34 % | 0.01941 |
| 1,428 | 14 | 0.98 % | 14 | 0.98 % | 0.00395 | |
| 1,500 | 24 | 1.60 % | 24 | 1.60 % | 0.00636 | |
| 138 | 17 | 12.32 % | 16 | 11.59 % | 0.06441 | |
| Combine three universial markers | 3,066 | 55 | 1.79 % | 54 | 1.76 % | 0.00770 |
Characteristics of the six different data sets
| Dataset | Composition | Total number of characters (bp) | Variable sites (bp) | Parsimony-informative sites (bp) | Singleton sites (bp) |
|---|---|---|---|---|---|
| 1 | Nucleotide sequences of all 82 genes | 71,424 | 628 | 514 | 114 |
| 2 | Complete chloroplast genome sequences (deletion some sites according to the outgroups ) | 151,431 | 1,910 | 1,694 | 216 |
| 3 | Complete chloroplast genome sequences | 154,185 | 2,029 | 1,821 | 205 |
| 4 | LSC | 85,125 | 1,199 | 1,058 | 141 |
| 5 | IR | 25,998 | 170 | 156 | 14 |
| 6 | SSC | 17,065 | 488 | 434 | 54 |
Fig. 4Molecular phylogeny of Lagerstroemia resulting from ML and BI analyses using whole chloroplast genome sequences (dataset-3). Maximum likelihood bootstrap values (BS) and posterior probabilities (PP) are shown at nodes. Branches with * indicate 100% BS and a PP of 1.0
Prior setting for calibration evidence for different calibration combinations. All values are given in Ma and prior distributions are given as mean and standard deviation (stdev). Normal (N) prior distributions are applied to the secondary calibration. Lognormal (logN) prior distributions are applied to each of the fossil-calibrated nodes and are constrained to be older than the highest bound of the fossil age (offset). Priors labelled ’none’ may be interpreted as uniform, uninformative priors
| Analysis | Prior distributions | Posterior distribution (95% HPD) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Root | Onagraceae Crown | Lythraceae Crown | Lythrum + Peplis | Sonneratia + Trapa | Lagerstroemia + Duabanga | Punica + Pemphis | Lawsonia + Rotala | Lagerstroemia Stem ~N 56.34 (4.78) Ma | Lagerstroemia Crown ~N 31.06 (2.82) Ma | |
| 104.6 Ma | 85.4 Ma | 95.5 Ma | >81 Ma | >63.8 Ma | >56 Ma | >40.4 Ma | >16 Ma | |||
| 1 | N 104.6 (1.0) | N 85.4 (1.0) | N 95.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 60.12 (56.20-66.27) | 31.60 (14.93-49.16) |
| 2 | none | N 85.4 (1.0) | N 95.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 60.02 (56.18-65.88) | 34.03 (16.83-51.40) |
| 3 | N 104.6 (1.0) | none | N 95.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 60.17 (56.22-66.28) | 31.69 (15.61-50.68) |
| 4 | N 104.6 (1.0) | N 85.4 (1.0) | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 60.10 (56.22-66.23) | 31.54 (14.69-50.03) |
| 5 | N 104.6 (1.0) | N 85.4 (1.0) | N 95.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 49.29 (28.47-69.35) | 27.27 (11.18-44.64) |
| 6 | none | none | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 57.19 (35.71-76.81) | 34.46 (17.65-52.61) |
| 7 | none | none | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 60.79 (56.19-68.21) | 35.71 (19.35-51.33) |
| 8 | N 104.6 (1.0) | none | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 49.76 (26.61-71.72) | 27.78 (12.63-46.89) |
| 9 | N 104.6 (1.0) | none | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 60.23 (56.21-66.58) | 33.02 (15.88-49.95) |
| 10 | N 104.6 (1.0) | N 85.4 (1.0) | N 95.5 (1.0) | none | none | LogN 1.5 (1.0) | none | none | 58.64 (56.14-62.53) | 30.23 (13.90-48.40) |
| 11 | N 104.6 (1.0) | N 85.4 (1.0) | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 49.87 (27.83-69.97) | 27.67 (12.09-46.04) |
| 12 | N 104.6 (1.0) | none | N 95.5 (1.0) | LogN 1.5 (1.0) | LogN 1.5 (1.0) | none | LogN 1.5 (1.0) | LogN 1.5 (1.0) | 49.88 (29.13-69.74) | 27.72 (12.80-45.44) |
Fig. 5Maximum clade credibility (MCC) tree of Lythraceae obtained from BEAST analysis. Mean divergence time estimates are shown with 95% highest posterior density (HPD; blue bars). Black circles indicate the eight calibration points
Fig. 3Molecular phylogeny of Lagerstroemia from ML (maximum likelihood) and BI (Bayesian inference) analyses using different data sets. A. Eighty-three coding genes (dataset-1); B. the chloroplast genome sequences (dataset-2). Maximum likelihood bootstrap values (BS) and posterior probabilities (PP) are shown at nodes. Branches with * indicate 100% BS and a PP of 1.0