| Literature DB >> 21637336 |
Peter M Hollingsworth1, Sean W Graham, Damon P Little.
Abstract
The main aim of DNA barcoding is to establish a shared community resource of DNA sequences that can be used for organismal identification and taxonomic clarification. This approach was successfully pioneered in animals using a portion of the cytochrome oxidase 1 (CO1) mitochondrial gene. In plants, establishing a standardized DNA barcoding system has been more challenging. In this paper, we review the process of selecting and refining a plant barcode; evaluate the factors which influence the discriminatory power of the approach; describe some early applications of plant barcoding and summarise major emerging projects; and outline tool development that will be necessary for plant DNA barcoding to advance.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21637336 PMCID: PMC3102656 DOI: 10.1371/journal.pone.0019254
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic timeline of the consideration of different markers as plant barcodes.
Colours (red = warm; blue = cool) represent an informal measure of enthusiasm among DNA barcoding researchers in the systematics community for CBOL and iBOL adoption of different markers. The different shading of trnL (P6) reflects the parallel use of the P6 loop for DNA profiling of degraded DNAs in ecological studies (see text). * = the two markers that form the core-barcode for land plants. rbcL a is used in this figure to distinguish this shorter barcode region of the gene proposed by Kress and Erikson [7] and the full length (ca. 1400 bp) gene sequence of rbcL. Elsewhere in the text, when we refer to rbcL we are referring to the short barcode region. The dashed lines indicate the timing of three international barcoding conferences in London (2005), Taipei (2007) and Mexico City (2009). The consideration of the different markers as barcodes are from the following sources: Kress et al. [5], Chase et al. [102], Chen et al. [54], Kew consortium [4], [6], Kim et al. see [103], Lahaye et al. [8], Newmaster et al. [104], Kress and Erickson [7], Taberlet et al. [44], Presting et al. [105].
Characteristics of different markers that have routinely been included in plant barcoding studies.
| Marker | Genomic source | Type | Approx number of GenBank accessions | Approx number of GenBank genera | Approx number of GenBank species | Median amplicon length (bases) in completely sequenced plastid genomes | IQR amplicon length (bases) | Amplicon length range (bases) | Number of samples used to estimate amplicon length | Freq. of amplicons with mono-nculeotide repeats ≥10 bases |
|
| Nuclear | Transcribed spacers and 5.8S gene | 102684 | 13307 | 52450 | 705 | 683–724 | 407–1630 | 5020 | 0.013 |
|
| Nuclear | Transcribed spacer | 111370 | 15817 | 57579 | 494 | 492–506 | 157–670 | 646 | 0.005 |
|
| Plastid | Inter-genic spacer | 1180 | 274 | 664 | 669 | 578–707 | 390–918 | 134 | 0.440 |
|
| Plastid | Protein coding | 34647 | 7454 | 22701 | 889 | 880–889 | 862–910 | 132 | 0.235 |
|
| Plastid | Inter-genic spacer | 1241 | 208 | 626 | 468 | 444–492 | 112–1253 | 134 | 0.500 |
|
| Plastid | Protein coding | 27725 | 8959 | 20374 | 654 | 654–654 | 654–654 | 134 | 0.000 |
|
| Plastid | Protein coding | 3341 | 751 | 1970 | 548 | 548–548 | 536–590 | 132 | 0.008 |
|
| Plastid | Protein coding | 5314 | 1110 | 3075 | 616 | 616–616 | 610–622 | 132 | 0.000 |
|
| Plastid | Inter-genic spacer | 23526 | 2833 | 11539 | 509 | 401–617 | 226–934 | 135 | 0.296 |
|
| Plastid | Intron and inter-genic spacer | 59197 | 9129 | 35130 | 994 | 907–1037 | 201–2114 | 132 | 0.280 |
|
| Plastid | Intron | 70811 | 10561 | 38329 | 87 | 83–91 | 51–135 | 130 | 0.054 |
The number of sequences deposited in GenBank for each marker was estimated from sequence annotations and should be considered only as an approximation. Estimates of amplicon size (including priming sites) and frequencies of mononucleotide repeats were made from all annotated land plant whole plastid genomes deposited in GenBank and for nrITS from a curated set of nrITS sequences. Mononucleotide repeats less than 10 bp in length were excluded because these generally do not affect sequence quality [36], [37].
Figure 2Schematic representation of the impacts of intra-specific gene flow on species discrimination success.
Parts (A) and (B) each represent two species (one shades of red, one shades of blue), each consisting of three populations. The black line between the species indicates a barrier to gene flow, with the thickness of the line indicating the strength of the barrier. In (A) intra-specific gene flow among populations is high (indicated by the vertical arrows). Thus, where gene flow occurs between species (wavy arrow), there is a barrier to extensive neutral introgression because establishment of immigrant alleles is prevented by a regular influx of conspecific alleles from other populations. In (B) intra-specific gene flow among populations is low. Thus populations are more differentiated from one another and are less likely to show taxon-specific barcode markers. In addition, the flux preventing establishment of introgressed alleles is lower because it involves only alleles in the (middle) recipient population and not the other populations of the ‘blue’ species.
Key factors likely to lead to lower levels of success in species discrimination in DNA barcoding studies.
| Factor | Situations where lower species discrimination success is expected |
| Hybridization | Groups in which hybridization is frequent and hybrids show some fertility |
| Polyploidy | Groups in which speciation frequently involves polyploidy |
| Life history | Groups of long lived organisms and/or those with slow mutation rates |
| Breeding system | Species groups consisting of closely related agamospermous or autogamous lineages |
| Species history | Species groups where speciation has been recent and rapid, or where continuously large historical population sizes lead to maintenance of ancestral polymorphism |
| Level of taxonomic ‘splitting’ | Groups in which the species limits have been very narrowly defined |
| Seed dispersal | Angiosperm species groups in which seed dispersal is poor (plastid barcodes) |
Applications of DNA barcoding in plants.
| Application | Barcode markers used | Notes | Ref. |
| Identification of cryptic orchid species |
|
| |
| Identification of cryptic |
|
| |
| Identification of cryptic |
|
| Bell unpub. |
| Identification of cryptic | nrITS, | nrITS, |
|
| Identification of seedlings in tropical forest plots | nrITS, |
|
|
| Community phylogenetics of a tropical forest plot |
|
| |
| Identification of Chinese medicinal plants: Polygonaceae |
|
| |
| Identification of Chinese medicinal plants: Fabaceae | nrITS2, |
| |
| Identification of |
|
| |
| Identification of |
|
| |
| Identification of medicinal plant species | nrITS2, nrITS, | nrITS2 rated as the best performing marker |
|
| Identification among berry species in foods | nrITS |
|
|
| Assessing the plant components of honey |
|
| |
| Identification of invasive species ( | nrITS, |
| |
| Identification of invasive species ( |
|
|
|
| Molecular identification of roots in grassland communities |
|
| |
| Identification of |
|
| |
| Identification of an aquatic fern gametophyte |
|
|
|
| Identification of traded fern species |
|
| |
| Identification of CITES listed ramin timber and products |
|
| |
| Identification of poisonous plants |
|
|
|
| Identification of plant components of herbivore diet |
|
| |
| Identification of plant components of herbivore diet |
|
| |
| Identification of plant components of herbivore diet |
|
| |
| Proof-of-concept for application to forensics |
|
|
The table includes studies using a range of barcoding markers (beyond the rbcL+matK core barcode) to capture the broad spectrum of current applications.
Examples of plant DNA barcoding projects underway or in the planning stage in 2011.
| Project | Lead Institute |
| TreeBOL: Barcoding the world's tree species | The New York Botanic Garden |
| GrassBOL: Barcoding grasses and grass-like plants | Adelaide University and University of British Columbia |
| Flora of the Kruger National Park | University of Johannesburg |
| Flora of the Area de Conservacion Guanacaste Costa Rica | University of Pennsylvania |
| Flora of Korea | Korea University |
| Plant Barcoding China: DNA barcoding of 5000 Chinese plant species | Kunming Institute of Botany |
| All-genera: DNA barcoding of representatives of all angiosperm genera | The New York Botanic Garden |
| DNA barcoding of Centre for Tropical Forestry Plots | Smithsonian Institute |
| DNA barcoding Chinese medicinal plants | Institute of Medicinal Plant Development Beijing |
| DNA barcoding the flora of Wales | National Botanic Garden of Wales |
| DNA barcoding British bryophytes | Royal Botanic Garden Edinburgh |