| Literature DB >> 29065181 |
Jong-Hwa Kim1, Sung-Il Lee2,3, Bo-Ram Kim1, Ik-Young Choi4, Peter Ryser5, Nam-Soo Kim2,6.
Abstract
Chloroplast (cp) genomes of Lilium amabile, L. callosum, L. lancifolium, and L. philadelphicum were fully sequenced. Using these four novel cp genome sequences and five other previously sequenced cp genomes, features of the cp genomes were characterized in detail among species in the genus Lilium and other related genera in the order Liliales. The lengths and nucleotide composition showed little variation. No structural variation was found among the cp genomes in Liliales. Gene contents were conserved among four newly sequenced cp genome in Lilium species, the only differences being in two pseudogenes. We identified 112 genes in 13 functional categories, 18 of which carried introns that were conserved among the species in Liliales. There were 16-21 SSR loci (>12 bp, >3 repeats) in the cp genomes in Lilium and the genomic locations of these loci were highly variable among the species. Average mutations were 15 SNPs per 1kb and 5 indels per 1kb, respectively, in the cp genomes of the newly sequenced four Lilium species. Phylogenetic classifications revealed some discrepancies between trees based on the cp genomes and previous classifications based on the morphology and geographic distributions.Entities:
Mesh:
Year: 2017 PMID: 29065181 PMCID: PMC5655457 DOI: 10.1371/journal.pone.0186788
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Chloroplast genome length and A+T contents among eight Lilium species.
| species | ||||||||
|---|---|---|---|---|---|---|---|---|
| Total length (bp) | 152,567 | 152,626 | 152,574 | 152,175 | 152,655 | 152,793 | 153,235 | 152,598 |
| LSC (bp) | 82,001 | 82,040 | 82,005 | 81,580 | 82,051 | 82,230 | 82,217 | 82,031 |
| SSC (bp) | 17,582 | 17,604 | 17,585 | 17,521 | 17,620 | 17,523 | 17,038 | 17,487 |
| IRs (bp) | 26,492 | 26,491 | 26,492 | 26,537 | 26,492 | 26,520 | 26,990 | 26,540 |
| % of AT | ||||||||
| LSC | 65.16 | 63.92 | 65.15 | 65.07 | 65.18 | 62.98 | 65.28 | 65.08 |
| SSC | 69.26 | 69.30 | 69.26 | 69.41 | 69.31 | 69.21 | 69.26 | 69.32 |
| IRs | 57.52 | 57.51 | 57.52 | 57.50 | 57.52 | 57.56 | 57.80 | 57.52 |
| Total | 62.97 | 62.99 | 62.97 | 62.93 | 63.00 | 62.98 | 63.01 | 62.94 |
Fig 1Chloroplast genome maps of four Lilium species.
The gene orders of the cp genomes of L. amabile, L. callosum, and L. lancifolium were identical but different from the cp genome of L. philadelphicum by two pseudogenes (red arrows). The former three cp genomes have a pseudogene ndhG in SSC region, but this pseudo gene was absent in L. philadelphicum. The pseudogene cemA in LSC was present in the cp genome in L. philadelphicum, but absent in the former three cp genomes.
Gene products of the cp genomes in L. amabile, L. callosum, L. lancifolium and L. philadelphicum.
| Photosystem I | psaA, B, C, I, J, ycf3 |
| Photosystem II | psbA, B, C, D, E, F, H, I, J, K, L, M, N, T, Z |
| Cytochrome b6/f | petA, B |
| ATP synthase | atpA, B, E, F |
| Rubisco | rbcL |
| NADH oxidoreductase | ndhA |
| Large subunit ribosomal proteins | rpl2 |
| Small subunit ribosomal proteins | rps2, 3, 4, 7 |
| RNA polymerase | rpoA, B, C1 |
| Unknown function protein coding gene | ycf1, 2 |
| Other genes | accD, ccsA, cemA |
| Ribosomal RNAs | rrn16 |
| Transfer RNAs | trnA-UGC |
1) Gene containing a single intron
2) Gene containing two introns
3) Two gene copies in IRs
4) Trans-splicing gene
5) Pseudogene in L. philadelphicum
6) Pseudogene in L. amabile, L. callosum and L. lancifolium
Presence or absence of introns in 18 genes in 13 species in the order Liliales and Allium cepa.
| Genes | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| petB | o | o | o | o | o | o | o | o | o | o | o | o | o | - |
| petD | o | o | o | o | o | o | o | o | o | o | o | o | o | - |
| rpl2 | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| rpl16 | o | o | o | o | o | o | o | o | o | o | o | o | o | - |
| rps16 | o | o | o | o | o | o | o | o | o | o | o | o | o | - |
| atpF | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| rpoC1 | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| ndhA | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| ndhB | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| ycf3 | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| rps12 | o | o | o | o | o | o | o | o | o | o | o | o | o | - |
| clpP | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| trnA-UGC | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| trnG-UCC | o | o | o | o | o | o | o | o | - | - | - | o | o | - |
| trnI-GAU | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| trnK-UUU | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| trnL-UAA | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
| trnV-UAC | o | o | o | o | o | o | o | o | o | o | o | o | o | o |
Note: L am; L. amabile: L. ca; L. callosum: L la; L. lancifolium: L. ha; L. hannai: L. su; L. superbum: L. di; L. distichum: L. fa; L. fargesii: L. ci; L. cirrnum: F. ci; Fritillaria cirrhosa: F. ta; F. taipaiensis: S. ch; Smilax china: A. au; Altroemeria aurea: A. ce; Allium cepa.
Distribution of SSR sequences in the cp genomes of Lilium species.
| SSR type | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| AT | - | - | - | - | - | - | - | 6 | 6 | LSC(trnK-UUU) |
| AT | 6 | 6 | 6 | 7 | 6 | 6 | 6 | 6 | - | LSC(trnS-GCU- trnG-UCC) |
| AT | - | - | - | - | - | 6 | - | - | - | LSC(trnS-GCU- trnG-UCC) |
| AT | - | - | - | - | - | - | 6 | - | 6 | LSC(rpoB-trnC-GCA) |
| AT | - | - | - | - | - | 6 | 6 | 7 | 6 | LSC(trnL-UAA) |
| AT | - | - | - | - | - | 6 | - | - | - | LSC(petB-petD) |
| AT | 10 | 10 | 8 | - | 8 | - | - | - | - | LSC(trnL-UAA) |
| AC | - | - | - | - | - | - | 6 | - | - | LSC(psbK-psbI) |
| AAT | - | - | - | - | 4 | - | - | - | - | LSC(accD-psaI) |
| ATA | 4 | 4 | 4 | - | 4 | 4 | - | 4 | 4 | LSC(petD-rpoA) |
| ATT | - | - | - | - | - | 4 | - | - | - | LSC(ycf1) |
| TTG | - | - | - | - | - | 6 | - | - | - | LSC(matK-rps16) |
| TTG | 4 | 4 | - | - | - | - | - | - | - | LSC(rps16) |
| TAT | 4 | 4 | 4 | 4 | 4 | - | 4 | - | 4 | LSC(trnT-UGU-trnL-UAA) |
| TAT | - | - | - | - | - | - | 4 | 4 | 4 | LSC(petD-rpoA) |
| TTA | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | LSC(trnV-UAC) |
| TTA | - | - | - | 4 | - | - | - | - | - | LSC(psaJ-rpl33) |
| TTA | 4 | 4 | 4 | 4 | 5 | 4 | - | - | - | SSC(rps15-ycf1) |
| GAA | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | LSC(accD-psaI) |
| AAAT | - | - | - | - | - | 3 | - | - | - | LSC(rps16-rtnQ-UUG) |
| AAAT | - | - | - | 4 | - | - | - | - | - | LSC(psbM-trnD-GUC) |
| AAAT | - | - | - | - | - | - | - | 3 | 3 | LSC(psaJ-rpl33) |
| AAAT | 4 | 4 | 4 | 3 | 4 | 3 | 3 | 3 | 4 | SSC(ndhG-ndhI) |
| AATA | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | LSC(rpl22) |
| AATA | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | SSC(ndhD) |
| AATT | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | LSC(atpI-rps2) |
| ATTT | - | - | - | - | - | - | - | - | 3 | LSC(rpoC1) |
| AGAA | - | - | - | - | - | - | - | 3 | 3 | LSC(trnS-GCU-trnR-UCU) |
| AGAA | 3 | 3 | 3 | 3 | 3 | 3 | 3 | - | - | LSC(trnG-UCC-trnR-UCU) |
| TTTA | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | - | LSC(rpoC1) |
| TTTA | 3 | 3 | 3 | 3 | 3 | 3 | - | - | - | LSC(rpoC1) |
| TTTA | - | - | - | 3 | - | - | 4 | 3 | 3 | LSC(psaA-ycf3) |
| TTAT | - | - | - | 3 | - | - | 3 | 3 | 3 | LSC(ycf3) |
| TTAT | - | - | - | 3 | - | - | - | 3 | 3 | LSC(trnT-UGU-trnL-UAA) |
| TTCT | - | - | - | 3 | - | - | 3 | 3 | 3 | LSC(rpl16) |
| TAAT | 3 | 3 | 3 | - | 3 | 3 | - | - | - | LSC(psbM-trnD-GUC) |
| AAATA | - | - | - | - | - | - | - | 3 | - | LSC(atpH-atpI) |
| AATTA | - | - | - | - | - | 3 | - | - | - | LSC(rbcL-accD) |
| AAATA | - | - | - | - | - | - | 3 | 3 | - | LSC(atpH-atpI) |
| TTTAC | 3 | 3 | 3 | - | 3 | - | - | 3 | - | LSC(rpoC1) |
| TTAAG | - | - | - | - | 3 | - | - | - | 3 | LSC(accD-psaI) |
* Intron region
** Coding region
*** Intergenic region
Numbers of SNPs and indels in different regions of cp genomes in L. amabile, L. callosum, L. lancifolium and L. philadelphicum.
| Intergenic | Genic | Total | ||
|---|---|---|---|---|
| 51,365 bp | Exon | Intron | 152,567 bp | |
| 84,026 bp | 17,176 bp | |||
| No. of SNPs | 1,305 | 966 | 1,047 | 2,271 |
| Frequency (%) | 2.54 | 1.15 | 6.1 | 1.49 |
| No. of InDels | 558 | 73 | 116 | 747 |
| Frequency (%) | 1.09 | 0.09 | 0.68 | 0.49 |
| Total | 1,863 | 1,039 | 1,163 | 3,018 |
| Frequency (%) | 3.63 | 1.24 | 6.77 | 1.98 |
Fig 2Numbers of SNPs and indels in 82 genes among nine cp genomes in Lilium species.
Fig 3Sequence identity plots among 13 species in the order Liliales and Allium cepa.
Fig 4Phylogenetic trees based on the whole cp genome sequences (A) and functional genes (B) among 13 species in the order Liliales and Allium cepa. The trees were made using maximum likelihood algorithm and the numbers on the nods designate the bootstrap values.