| Literature DB >> 20644717 |
Barbara A Whitlock1, Amanda M Hale, Paul A Groff.
Abstract
BACKGROUND: The chloroplast trnH-psbA spacer region has been proposed as a prime candidate for use in DNA barcoding of plants because of its high substitution rate. However, frequent inversions associated with palindromic sequences within this region have been found in multiple lineages of Angiosperms and may complicate its use as a barcode, especially if they occur within species. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2010 PMID: 20644717 PMCID: PMC2903610 DOI: 10.1371/journal.pone.0011533
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Portion of 35 trnH-psbA sequences including 25–27bp inversion and 18bp flanking inverted repeats.
Groups of conspecific sequences are shaded. Asterisks mark sequences with the A inversion configuration. Plus signs at the top of the alignment mark invariant sites. Inversion region is in box.
Figure 2Neighbor-joining trees resulting from two treatments of trnH-psbA inversion.
(A) NJ tree resulting from analysis of unedited raw sequence matrix, in which different taxa have either one or the other of two inversion configurations, and (B) NJ tree resulting from analysis of unedited uniform inversion matrix, in which the loop region of sequences with the A configuration has been replaced with its reverse complement. Conspecific sequences are indicated in colored boxes. Gs. = Gentianopsis, F. = Frasera, S. = Swertia, L. = Lomatogonium, C. = Comastoma, Gl. = Gentianella, G. = Gentiana. Two subtribes of Gentianaceae and two sections of Gentiana are indicated in gray boxes. Sequences with the A configuration are noted with an asterisk. Bootstrap values >50% are given above branches. The scale bar refers to both trees.
Characteristics of conspecific trnH-psbA sequences of Gentianaceae polymorphic for the inversion regiona.
| Raw sequence matrix | Uniform inversion matrix | ||||||||
| Taxon | N | Sequence length (bp) | Inversion length (bp) | # Variable sites | Maximum intraspecific distance | Minimum interspecific distance | # Variable sites | Maximum intraspecific distance | Minimum interspecific distance |
|
| 4 | 378−397 | 27 (6.8−7.1%) | 25 (6.8%) | 0.06424 | 0.04852 | 4 (1.1%) | 0.04788 | 0.04854 |
|
| 3 | 444 | 25 (5.6%) | 17 (3.8%) | 0.01798 | 0.01117 | 0 | 0.00225 | 0.01117 |
|
| 3 | 284 | 25 (8.8%) | 17 (6.0%) | 0.02807 | 0 | 0 | 0 | 0 |
|
| 3 | 424−425 | 25 (5.9%) | 17 (4.0%) | 0.01887 | 0.00937 | 0 | 0 | 0.00939 |
|
| 3 | 343−374 | 27 (7.2−7.9%) | 23 (6.7%) | 0.04053 | 0.1147 | 2 (0.6%) | 0.0114 | 0.11615 |
|
| 4 | 226 | 27 (12%) | 21 (9.3−9.7%) | 0.09292 | 0.08445 | 0 | 0 | 0.08445 |
Calculations were performed on unedited alignments. Similar results were obtained from manually adjusted, edited alignments. Gapped characters due to insertions and deletions were excluded from calculations. All distances were measured as uncorrected p-distances. Minimum interspecific distances were calculated by comparing the sequences of the species of interest to sequences of all other species in the dataset.