| Literature DB >> 26132382 |
Wei Zhang1, Yuan Yuan2, Shuo Yang3, Jianjun Huang3, Luqi Huang2.
Abstract
DNA barcoding is a promising species identification method, but it has proved difficult to find a standardized DNA marker in plant. Although the ITS/ITS2 RNA transcript has been proposed as the core barcode for seed plants, it has been criticized for being too conserved in some species to provide enough information or too variable in some species to align it within the different taxa ranks. We selected 30 individuals, representing 16 species and four families, to explore whether ITS2 can successfully resolve species in terms of secondary structure. Secondary structure was predicted using Mfold software and sequence-structure was aligned by MARNA. RNAstat software transformed the secondary structures into 28 symbol code data for maximum parsimony (MP) analysis. The results showed that the ITS2 structures in our samples had a common four-helix folding type with some shared motifs. This conserved structure facilitated the alignment of ambiguous sequences from divergent families. The structure alignment yielded a MP tree, in which most topological relationships were congruent with the tree constructed using nucleotide sequence data. When the data was combined, we obtained a well-resolved and highly supported phylogeny, in which individuals of a same species were clustered together into a monophyletic group. As a result, the different species that are often referred to as the herb "Mu tong" were successfully identified using short fragments of 250 bp ITS2 sequences, together with their secondary structure. Thus our analysis strengthens the potential of ITS2 as a promising DNA barcode because it incorporates valuable secondary structure information that will help improve discrimination between species.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26132382 PMCID: PMC4488503 DOI: 10.1371/journal.pone.0131185
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Samples and their ITS2 Genbank accession numbers included in this study.
| Species | Herb name | Family | Source | Genbank No. |
|---|---|---|---|---|
|
| Mu tong | Lardizabalaceae | Jinggangshan, Jiangxi, China | KR025497 |
|
| Mu tong | Lardizabalaceae | Weihai, Shandong, China | KR025499 |
|
| Mu tong | Lardizabalaceae | Laojun shan, Sichuan, China | KR025493 |
|
| Mu tong | Lardizabalaceae | Genbank | GQ434615 |
|
| Mu tong | Lardizabalaceae | Genbank | GQ434616 |
|
| Mu tong | Lardizabalaceae | Huangshan, Anhui, China | KR025494 |
|
| Mu tong | Lardizabalaceae | Jinggangshan, Jiangxi, China | KR025495 |
|
| Mu tong | Lardizabalaceae | Genbank | KF022325 |
|
| Mu tong | Lardizabalaceae | Tongcheng, Anhui, China | KR025496 |
|
| Mu tong | Lardizabalaceae | Huating, Gansu, China | KR025498 |
|
| —— | Lardizabalaceae | Binzhou, Hunan, China | KR025500 |
|
| —— | Lardizabalaceae | Genbank | AY029795 |
|
| —— | Lardizabalaceae | Huangshan, Anhui, China | KR025501 |
|
| —— | Lardizabalaceae | Genbank | AY029787 |
|
| —— | Lardizabalaceae | Huangshan, Anhui, China | KR025502 |
|
| Huai Mu tong | Aristolochiaceae | Genbank | AM501928 |
|
| Huai Mu tong | Aristolochiaceae | Genbank | AM501930 |
|
| Guang Mu tong | Aristolochiaceae | Ji’an, Jilin, China | KR025503 |
|
| Guang Mu tong | Aristolochiaceae | Jingyu, Jilin, China | KR025504 |
|
| —— | Aristolochiaceae | Genbank | EF427951 |
|
| Chuan Mu tong | Ranunculaceae | Kaixian, Chongqing, China | KR025505 |
|
| Chuan Mu tong | Ranunculaceae | Genbank | GQ434613 |
|
| Chuan Mu tong | Ranunculaceae | Midu, Yunnan, China | KR025506 |
|
| Chuan Mu tong | Ranunculaceae | Zhangmu,Xizang | KR025507 |
|
| Chuan Mu tong | Ranunculaceae | Dali, Yunnan, China | KR025508 |
|
| Chuan Mu tong | Ranunculaceae | Genbank | GU732593 |
|
| Chuan Mu tong | Ranunculaceae | Genbank | JF714642 |
|
| —— | Actinidiaceae | Genbank | AF443211 |
|
| —— | Actinidiaceae | Genbank | JF980325 |
|
| —— | Actinidiaceae | Genbank | AY216736 |
Fig 1The predicted ITS2 secondary structure of the “Mu tong” taxa.
(a) Consensus secondary structure model of the ITS2 region based on 30 sequences covering four families (Actinidiaceae, Aristolochiaceae, Lardizabalaceae and Ranunculaceae). The four helices, each with a stem–loop, are labeled I–IV. Compatible base pairs are colored and the degree of conservation over the whole alignment is indicated with different degrees of color saturation. (b) One of the example secondary structures of Akebia quinata. (c) The position of the strictly conserved base pair sites (highlight in blue circle) found in the 70% consensus ITS2 secondary structure model.
Fig 2Comparison of multiple sequence alignments from different methods.
a. alignment from MARNA with secondary structure guiding; b. alignment from Clustal X without secondary structure guiding.
Statistics of phylogenetic analysis from different alignments.
| Statistic | Sequence alignment 1 (guiding without structure) | Sequence alignment 2 (guiding with structure) | Secondary structure | Sequence 2+structure |
|---|---|---|---|---|
| Aligned length | 321 | 314 | 315 | 629 |
| Indel character | 37 | 140 | 0 | 140 |
| Total length | 358 | 454 | 315 | 769 |
| No. of variable characters | 262 | 325 | 234 | 559 |
| No. of parsimony information character | 207 | 222 | 134 | 355 |
| No. of shortest trees | 128 | 8 | 90 | 61 |
| Tree length | 474 | 536 | 403 | 941 |
| CI | 0.8143 | 0.8190 | 0.8362 | 0.8236 |
| RI | 0.9364 | 0.9222 | 0.9241 | 0.9241 |
Fig 3Comparison of MP trees inferred from sequences with different align methods.
a. MP tree inferred from sequence alignment by Clustal X; b. MP tree inferred from sequence alignment by secondary structure information. Numbers on the branches indicate the bootstrap values of MP above 50%, numbers following a species name represent individual numbers.
Fig 4Comparison of MP trees inferred from different data sets.
a. MP tree inferred from secondary structure coding information; b. MP tree inferred from the combined nucleotide sequences and secondary structure information. Numbers on the branches indicate the bootstrap values of MP above 50%, numbers following a species name represent individual numbers.