| Literature DB >> 23950788 |
Abstract
Monocots are one of the most diverse, successful and economically important clades of angiosperms. We attempt to analyse the complete plastid genome sequences of two lilies and their lengths were 152,793bp in Lilium longiflorum (Liliaceae) and 155,510bp in Alstroemeria aurea (Alstroemeriaceae). Phylogenetic analyses were performed for 28 taxa including major lineages of monocots using the sequences of 79 plastid genes for clarifying the phylogenetic relationship of the order Liliales. The sister relationship of Liliales and Asparagales-commelinids was improved with high resolution. Comparative analyses of inter-familial and inter-specific sequence variation were also carried out among three families of Liliaceae, Smilacaceae, and Alstroemeriaceae, and between two Lilium species of L. longflorum and L. superbum. Gene content and order were conserved in the order Liliales except infA loss in Smilax and Alstroemeria. IR boundaries were similar in IRa, however, IRb showed different extension patterns as JLB of Smilax and JSB in Alstroemeria. Ka/Ks ratio was high in matK among the pair-wise comparison of three families and the most variable genes were psaJ, ycf1, rpl32, rpl22, matK, and ccsA among the three families and rps15, rpoA, matK, and ndhF between Lilium.Entities:
Mesh:
Year: 2013 PMID: 23950788 PMCID: PMC3688979 DOI: 10.1371/journal.pone.0068180
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of taxa used for the phylogenetic analysis of major lineage of monocots.
|
|
|
|
|
|
|---|---|---|---|---|
| NC010093 |
|
|
| 153,819 |
| NC007407 |
|
|
| 153,821 |
| NC010109 |
|
|
| 165,955 |
| NC009601 |
|
|
| 152,609 |
| HQ180687-HQ183091 |
|
|
| unknown |
| NC014056 |
|
|
| 146,484 |
| NC007499 |
|
|
| 148,964 |
| DQ069347-DQ069702 |
|
|
| unknown |
| HM536959 |
|
|
| 157,878 |
| HQ180423-HQ183692 |
|
|
| unknown |
| This study |
|
|
| 152,789 |
| This study |
|
|
| 155,506 |
| NC013991 |
|
|
| 158,462 |
| NC015830 |
|
|
| 139,493 |
| NC015831 |
|
|
| 139,467 |
| NC011713 |
|
|
| 136,048 |
| NC009950 |
|
|
| 135,282 |
| NC005973 |
|
|
| 134,494 |
| NC008155 |
|
|
| 134,496 |
| NC015826 |
|
|
| 139,839 |
| NC002762 |
|
|
| 134,545 |
| NC001666 |
|
|
| 140,384 |
| NC013823 |
| Typaceae |
| 161,572 |
| NC005086 |
|
|
| 162,686 |
| NC006050 |
|
|
| 159,930 |
| NC009618 |
|
|
| 163,403 |
| NC004677 |
|
|
| 117,190 |
| NC010654 |
|
|
| 119,726 |
Figure 1Map of the complete plastid genome of and represented as a circular molecule.
List of genes found in chloroplast genome.
| Group of gene | Name of gene | No. | |||
|---|---|---|---|---|---|
|
| Ribosomal RNAs | rrn4.5(x2), rrn5(x2), rrn16(x2), rrn23(x2) | 8 | ||
| Transfer RNAs | trnA-UGC[ | 37 | |||
|
| Photosynthesis | ||||
| Photosystem I | psaA, psaB, psaC, psaI, psaJ | 5 | |||
| Photosystem II | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ | 15 | |||
| Cytochrome | petA, petB[ | 6 | |||
| ATP synthase | atpA, atpB, atpE, atpF[ | 6 | |||
| Rubisco | rbcL | 1 | |||
| NADH dehydrogenease | ndhA[ | 12 | |||
| ATP-dependent protease subunit P | clpP[ | 1 | |||
| Chloroplast envelope membrane protein | cemA | 1 | |||
|
| large units | rpl2[ | 11 | ||
| small units | rps2, rps3, rps 4, rps7(x2), rps8, rps11, rps12[ | 15 | |||
|
| RNA polymerase | rpoA, rpoB, rpoC1[ | 4 | ||
|
| Initiation factor | infA | 1 | ||
| Miscellaneous proteins | accD, ccsA, matK | 3 | |||
| Hypothetical proteins & Conserved reading frame | ycf1(x2), ycf2(x2), ycf3[ | 10 | |||
| Total | 136 | ||||
* (x2): duplicated genes, genes having introns
Figure 2RAxML tree monocot orders using 79 protein-coding genes.
Support values for ML, BI and MP are provided at the nodes. A branch with dotted end indicated a high supporting values BP100 in RAxML / PP1.0 in BI, and BP 100 in MP tree and gene loss (sexangle with number) were described on the branch.
Comparison of the plastid genome sequences of 3 families in Liliales.
| Species(Family) |
|
|
|
|---|---|---|---|
| Total length (bp) | 152,793 | 157,878 | 155,510 |
| LSC (bp) | 82,230 | 84,608 | 84,241 |
| SSC (bp) | 17,523 | 18,536 | 17,867 |
| IRs (bp) | 26,520 | 27,367 | 26,701 |
| % of G+C | 37.02 | 37.26 | 38.05 |
| % of A+T | 62.98 | 62.74 | 61.95 |
| Conserved region compare to | 123,888bp (88.0%) | 124,907 (88.4%) | 127,738bp (87.5%) |
Figure 3Schemes describing the IR junction in three plastid genomes represent of the family Liliaceae, Smilacaceae, and Alstroemeriaceae of Liliales.
The colour of each gene was identical to Figure 1.
Figure 4Ka/Ks ratio of pair-wise comparison among the three families of Liliales according to the function of each gene.
A) genes encoding cytochrome, ATP synthase, and NADH hydrogenease, B) genes encoding Photosynthesis, C) genes encoding ribosomal protein, D) the other genes.
Most variable 10 genes in comparison of vs .
| gene name | total length | no. of variable sites | % of variable site | indel (type) | |
|---|---|---|---|---|---|
|
| 273 | 10 | 3.66 | ||
|
| 114 | 4 | 3.51 | ||
|
| 114 | 3 | 2.63 | ||
|
| 231 | 5 | 2.16 | CTTTTA (del/repeat) | |
|
| 657 | 12 | 1.83 | ||
|
| 303 | 5 | 1.65 | ||
|
| 1008 | 16 | 1.59 | ||
|
| 1539 | 24 | 1.56 | ||
|
| 2223 | 32 | 1.44 | ||
|
| 279 | 4 | 1.43 | ||