| Literature DB >> 26950701 |
Cuihua Gu1,2, Luke R Tembrock2, Nels G Johnson3, Mark P Simmons2, Zhiqiang Wu2.
Abstract
Lagerstroemia (crape myrtle) is an important plant genus used in ornamental horticulture in temperate regions worldwide. As such, numerous hybrids have been developed. However, DNA sequence resources and genome information for Lagerstroemia are limited, hindering evolutionary inferences regarding interspecific relationships. We report the complete plastid genome of Lagerstroemia fauriei. To our knowledge, this is the first reported whole plastid genome within Lythraceae. This genome is 152,440 bp in length with 38% GC content and consists of two single-copy regions separated by a pair of 25,793 bp inverted repeats. The large single copy and the small single copy regions span 83,921 bp and 16,933 bp, respectively. The genome contains 129 genes, including 17 located in each inverted repeat. Phylogenetic analysis of genera sampled from Geraniaceae, Myrtaceae, and Onagraceae corroborated the sister relationship between Lythraceae and Onagraceae. The plastid genomes of L. fauriei and several other Lythraceae species lack the rpl2 intron, which indicating an early loss of this intron within the Lythraceae lineage. The plastid genome of L. fauriei provides a much needed genetic resource for further phylogenetic research in Lagerstroemia and Lythraceae. Highly variable markers were identified for application in phylogenetic, barcoding and conservation genetic applications.Entities:
Mesh:
Year: 2016 PMID: 26950701 PMCID: PMC4780714 DOI: 10.1371/journal.pone.0150752
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Map of the L. fauriei plastid genome.
Genes shown outside the outer circle are transcribed clockwise and genes inside the outer circle are transcribed counterclockwise. Genes in different functional groups are color coded. The shaded area inside the inner circle indicates the GC content, with dark shading indicating percent CG.
Comparison of plastid genome size among seven Myrtales species.
| Region | |||||||
|---|---|---|---|---|---|---|---|
| LSC | |||||||
| Length (bp) | 83,923 | 88,511 | 88,768 | 88,522 | 88,925 | 88,247 | 89,081 |
| GC content (%) | 36 | 37 | 35 | 35 | 35 | 35 | 35 |
| Length percentage (%) | 55 | 54 | 55 | 55 | 56 | 55 | 56 |
| SSC | |||||||
| Length (bp) | 16,933 | 19,000 | 18,772 | 18,672 | 18,468 | 18,544 | 18,508 |
| GC Content (%) | 31 | 35 | 30 | 31 | 31 | 31 | 31 |
| Length Percentage (%) | 11 | 12 | 12 | 12 | 12 | 12 | 12 |
| IR | |||||||
| Length (bp) | 25,792 | 28,772 | 26,392 | 26,409 | 26,378 | 26,385 | 26,392 |
| GC Content (%) | 43 | 43 | 43 | 43 | 43 | 43 | 43 |
| Length Percentage (%) | 34 | 35 | 33 | 33 | 33 | 33 | 33 |
| Total | |||||||
| Length (bp) | 152,440 | 165,055 | 160,326 | 160,012 | 160,149 | 159,561 | 160,373 |
| GC Content (%) | 38 | 39 | 37 | 37 | 37 | 37 | 37 |
Comparison of coding and non-coding region size among seven Myrtales species.
| Region | Species | |||||||
|---|---|---|---|---|---|---|---|---|
| Protein coding | Length (bp) | 68,477 | 70,706 | 68,257 | 68,889 | 68,085 | 68,746 | 68,448 |
| GC content (%) | 45 | 43 | 43 | 43 | 43 | 43 | 43 | |
| Length percentage (%) | 38 | 40 | 37 | 37 | 37 | 37 | 38 | |
| tRNA | length (bp) | 2,373 | 2,303 | 3,184 | 2,199 | 2,270 | 2,387 | 2,310 |
| GC content (%) | 54 | 53 | 49 | 53 | 53 | 52 | 53 | |
| Length percentage (%) | 2 | 1 | 2 | 1 | 1 | 2 | 1 | |
| rRNA | length (bp) | 4,517 | 4,551 | 4,510 | 4,528 | 4,528 | 4,528 | 4,525 |
| GC content (%) | 56 | 55 | 55 | 55 | 55 | 55 | 55 | |
| Length percentage (%) | 3 | 3 | 3 | 3 | 3 | 3 | 3 | |
| Intron | length (bp) | 14,503 | 13,311 | 15,514 | 15,499 | 14,720 | 15,465 | 15,496 |
| GC content (%) | 36 | 38 | 35 | 36 | 36 | 36 | 36 | |
| Length percentage (%) | 10 | 8 | 10 | 10 | 9 | 10 | 10 | |
| Intergenic | length (bp) | 62,570 | 70,706 | 68,861 | 68,897 | 70,546 | 68,435 | 69,594 |
| GC content (%) | 36 | 37 | 35 | 35 | 35 | 35 | 35 | |
| Length percentage (%) | 41 | 43 | 43 | 43 | 44 | 43 | 43 |
List of genes in the L. fauriei plastid genome.
| Gene category | Group of genes | Name of genes |
|---|---|---|
| Self-replication | Transfer RNA genes | |
| Small subunit of ribosome | ||
| Ribosomal RNA genes | ||
| Large subunit of ribosome | ||
| DNA dependent RNA polymerase | ||
| Photosynthesis | Subunits of photosystem I | |
| Subunits of photosystem II | ||
| Subunits of cytochrome | ||
| Subunits of ATP synthase | ||
| ATP-dependent protease subunit p gene | ||
| Large subunit of Rubisco | ||
| Subunits of NADH dehydrogenase | ||
| Other genes | Maturase | |
| Envelop membrane protein | ||
| Subunit of acetyl-CoA-carboxylase | ||
| c-type cytochrome synthesis gene | ||
| Genes of unknown function | Conserved open reading frames |
a: Genes containing introns;
b: Duplicated gene (Genes present in the IR regions).
Fig 2Comparison of junctions between the LSC, SSC, and two IR regions among seven Myrtales species.
ψ means pseudogene; distance in the figure is not to scale.
Fig 3The structure and sequence variation for rpl2 gene with and without intron.
(A) The structural components of rpl2 gene from Arabidopsis thaliana (NC_000932) and the other six Myrtales species. (B) The boundary sequences of the two exons: the dashed lines represent the intron sequences; the sequence from the maple shade is from first exon and from the green shade is from the second intron. (C) Exons borders of rpl2 sequences from genus Lagerstroemia and other species from Lythraceae: the maple and green shades mean the sequence from exon 1 and 2.
Top ten coding regions ordered with respect to their potential phylogenetic signal.
| No. | Region | Length (bp) | Aligned length (bp) | Conserved sites | Pars. Inf. | Pars. Inf.% | RI |
|---|---|---|---|---|---|---|---|
| 1 | 1002 | 1101 | 989 | 77 | 6.99 | 0.96 | |
| 2 | 1500 | 1593 | 1295 | 98 | 6.15 | 0.92 | |
| 3 | 273 | 297 | 253 | 15 | 5.05 | 0.86 | |
| 4 | 471 | 486 | 396 | 24 | 4.94 | 0.82 | |
| 5 | 174 | 306 | 271 | 15 | 4.90 | 0.77 | |
| 6 | 960 | 966 | 827 | 47 | 4.87 | 0.80 | |
| 7 | 2244 | 2349 | 1967 | 113 | 4.81 | 0.82 | |
| 8 | 5613 | 7356 | 5102 | 350 | 4.76 | 0.70 | |
| 9 | 531 | 555 | 496 | 24 | 4.32 | 0.88 | |
| 10 | 96 | 96 | 91 | 4 | 4.17 | 0.75 |
a: Length: refers to sequence length in L.fauriei;
b: Aligned length: refers to the alignment of seven Myrtales species considered in the comparative analysis (see Materials and Methods);
c: Number of parsimony informative sites;
d: Percentage of parsimony informative sites;
e: RI-Ensemble retention index.
Fig 4Phylogenetic tree inferred by Bayesian inference, maximum likelihood, and parsimony using 73 shared protein-coding genes among 10 plastid genomes (1 Lythraceae, 1 Onagraceae, 5 Myrtaceae, 3 Geraniaceae).
Numbers above nodes indicate posterior probability followed by bootstrap values.