| Literature DB >> 21060687 |
Sribash Roy1, Antariksh Tyagi, Virendra Shukla, Anil Kumar, Uma M Singh, Lal Babu Chaudhary, Bhaskar Datt, Sumit K Bag, Pradhyumna K Singh, Narayanan K Nair, Tariq Husain, Rakesh Tuli.
Abstract
BACKGROUND: The concept of DNA barcoding for species identification has gained considerable momentum in animals because of fairly successful species identification using cytochrome oxidase I (COI). In plants, matK and rbcL have been proposed as standard barcodes. However, barcoding in complex genera is a challenging task. METHODOLOGY AND PRINCIPALEntities:
Mesh:
Substances:
Year: 2010 PMID: 21060687 PMCID: PMC2965122 DOI: 10.1371/journal.pone.0013674
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The geographical distribution of different species of Berberis, Ficus and Gossipium.
Circles indicate the regions from where species were collected. Exact GPS data are shown in Table S1. Region A (western Himalayas): B. aristata, B. asiatica, B. chitria, B. glaucocarpa, B. jaeschkeana, B. lycium, B. pachyacantha, B. umbellata; Region B (Eastern Himalayas): B. angulosa, B. griffithiana, B. insignis, B. macrosepala, B. replicata; Region C (Satpura - Central India): B. Hainesii; Region D (Nilgiri Hills-Southern India): B. tinctoria, B. wightiana; Region E (Gujrat): G. herbaceum, G. arboreum Region F (Assam): G. barbadense; Region G (Karnataka): G. herbaceum, G. arboretum, G. hirsutum Region H (Andhra Pradesh): G. hirsutum; Region I (Tamil Nadu): G. barbadense; Region J (Uttar Pradesh): Ficus species.
Comparison of four loci tested on Berberis, Ficus and Gossypium.
|
|
|
| ||||||||||
| Locus | ITS |
|
|
| ITS |
|
|
| ITS |
|
|
|
| Number of species | 16.0 | 13.0 | 15.0 | 14.0 | 11.0 | 8.0 | 11.0 | 11.0 | 4.0 | 4.0 | 4.0 | 4.0 |
| Number of sequences | 129.0 | 78.0 | 97.0 | 83.0 | 33.0 | 25.0 | 32.0 | 31.0 | 51.0 | 46.0 | 42.0 | 37.0 |
| Mean sequence length | 602.2 | 488.1 | 479.0 | 410.0 | 619.7 | 704.5 | 630.1 | 386.0 | 551.1 | 628.5 | 626.1 | 418.3 |
| % PCR success | 97.0 | 76.0 | 97.0 | 92.0 | 100.0 | 85.0 | 100.0 | 91.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| % Sequencing success | 94.0 | 85.0 | 95.0 | 90.0 | 100.0 | 85.0 | 98.0 | 91.0 | 100.0 | 100.0 | 100.0 | 100.o |
| % PIC | 4.9 | 1.8 | 0.6 | 2.6 | 21.7 | 1.9 | 0.6 | 6.9 | 8.0 | 0.9 | 0.3 | 1.8 |
| % Variable sites | 6.1 | 3.0 | 0.8 | 3.2 | 30.0 | 3.6 | 1.2 | 7.1 | 8.7 | 3.2 | 1.0 | 3.2 |
| Mean inter-specific distances (K2P) | 0.011 (0.002) | 0.005 (0.001) | 0.001 (0.0005) | 0.009 (0.003) | 0.0824 (0.053) | 0.008 (0.004) | 0.001 (0.001) | 0.017 (0.009) | 0.051 (0.031) | 0.002 (0.004) | 0.002 (0.001) | 0.002 (0.002) |
| Mean intra-specific distances (K2P) | 0.003 (0.002) | 0.001 (0.001) | 0.0001 (0.0003 | 0.0009 (0.001) | 0.004 (0.003) | 0.004 (0.004) | 0.0 | 0.0 | 0.002 (0.002) | 0.002 (0.0039) | 0.001 (0.001) | 0.001 (0.001) |
*Parsimony informative characters.
Figures in parenthesis indicate standard deviation.
Percent sequencing success refers to the fraction of sequences having QV above 30 and at least 70% overlap between sequence reads using forward and reverse primers (except in some cases of matK) of total number of PCR products.
Figure 2Presence/Absence of barcode gap.
Sequence divergence for different gene regions in different species of Berberis. X-axis: maximum intraspecific, Y-axis: minimum interspecific K2P distances.
Proportion (%) of monophyletic species recovered with different phylogenetic methods using four individual loci and their combinations.
| Genus | Locus | NJ | MP | UPGMA |
|
| ITS | 6.3 | 6.3 | 6.3 |
|
| 7.7 | 7.7 | 7.7 | |
|
| 0.0 | 0.0 | 0.0 | |
|
| 7.1 | 7.1 | 7.1 | |
|
| ITS | 7.7 | 23.1 | 15.4 |
|
| 7.7 | 15.4 | 15.4 | |
|
| 7.7 | 7.7 | 7.7 | |
|
| 15.4 | 15.4 | 15.4 | |
| ITS | 30.8 | 30.8 | 30.8 | |
| ITS | 15.4 | 15.4 | 15.4 | |
| ITS | 30.8 | 23.1 | 23.1 | |
|
| 23.1 | 15.4 | 15.4 | |
|
| 15.4 | 15.4 | 15.4 | |
|
| 23.1 | 15.4 | 23.1 | |
| ITS | 23.1 | 30.8 | 15.4 | |
| ITS | 23.1 | 23.1 | 23.1 | |
|
| 15.4 | 23.1 | 15.4 | |
| ITS | 30.8 | 23.1 | 23.1 | |
| ITS | 30.8 | 23.1 | 15.4 | |
|
| ITS | 100.0 | 90.9 | 100.0 |
|
| 25.0 | 25.0 | 0.0 | |
|
| 9.1 | 0.0 | 9.1 | |
|
| 63.6 | 81.8 | 81.8 | |
|
| ITS | 100.0 | 100.0 | 75.0 |
|
| 0.0 | 0.0 | 0.0 | |
|
| 0.0 | 0.0 | 0.0 | |
|
| 25.0 | 0.0 | 0.0 |
*When all the sequences for individual locus are considered.
When data set was reduced to 58 sequences representing at least three common accessions per species for each locus.
Diagnostic characters for each species in ITS, matK, rbcL and trnH-psbA.
| ITS |
|
|
| ||
| Ref. genbank accession No. | AF403383.1 | AB038184.1 | GQ435419.1 | ||
|
| 29-T, 107-T, 526-G, 561-A, 562-T | 824-T, 1055-G | 476-C, 399-A, 248-A, 219-G | ||
| Ref. genbank accession No. | AY730065.1 | EU002177.1 | EU002278.1 | GQ982218.1 | |
|
| 73-A | ||||
|
| 62-C | 379-G | |||
|
| 65-T, 161-G, 202-T, 413-G, 443-T, 447-T, 480-T, 485-T, 507-C | 964-T | 433-A, 684-T | 279-T | |
|
| 78-G+181-G | ||||
|
| 450-T | ||||
|
| 42-T, 45-A, 56-A, 65-A, 74-T, 75-A, 78-T, 81-T, 106-A, 107-T, 129-T, 132-T, 138-A, 157-T, 159-A, 200-G, 219-T, 259-A, 285-C, 288-A, 289-T, 297-T, 308-A, 367-T, 372-T, 379-T, 382-T, 383-A, 385-C, 386-C, 418-A, 440-A, 442-T, 448-A, 453-A, 471-T, 474-T, 486-A, 492-T, 502-G, 509-A | 600-T, 601-T | 255-G | ||
|
| 167-C, 188-C, 200-A, 201-C, 408-G, 415-A, 510-G, 515-A | 583-G, | 188-T | ||
|
| 51-A | ||||
|
| 66-A, 86-T | 280-G, 156-T | |||
|
| 66-C, 71-G, 181-C, 466-A | 228-A, 212-C | |||
|
| 36-G+ (66-T/86-C), 160-T+(66-T or 86-C) | 690-A | |||
| Ref. genbank accession No. | AF057757.1 | DQ345959.1 | |||
|
| 154-C | ||||
|
| 103-C | 235-G | |||
|
| 193-T, 569-T | ||||
|
| 80-G+274-G |
Diagnostic characters for each locus were identified with reference to GenBank sequence for the locus of particular species or related species (if not available in the GenBank data base). For ITS, the start of ITS1 of reference sequence was considered as position 1, for matK and rbcL, the first nucleotide of start codon of reference sequence was considered as position 1 and for trnH-psbA, the first position of intergenic spacer of reference sequence was considered as position 1.
Figure 3Ratios of mean inter-and intraspecific K2P distances of different species of Berberis (A), Ficus (B) and Gossypium (C).
Ratios could not be determined in some species where there was either single accession or sequencing failure in some loci.
Figure 4Presence/Absence of barcode gap.
Sequence divergence for different gene regions for different species of Ficus (A) and Gossypium (B). X-axis: maximum intraspecific, Y-axis: minimum interspecific K2P distances.
Level of polymorphism, fingerprinting patterns and unique bands of AFLP markers in different species of Berberis.
| Primer pair | No. of Bands | No. of Polymorphic Bands | % Polymorphic Bands | Unique Bands (species) |
| P1. | 85 | 84 | 98.8 | 1 ( |
| P2. | 97 | 97 | 100 | 0 |
| P3. | 88 | 88 | 100 | 0 |
| P4. | 64 | 64 | 100 | 0 |
| P5. | 82 | 82 | 100 | 0 |
| P6. | 76 | 72 | 94.74 | 4 ( |
| P7. | 88 | 88 | 100 | 0 |
| P8. | 94 | 93 | 98.94 | 1 ( |
| P9. | 64 | 62 | 96.88 | 2 ( |
| P10. | 46 | 46 | 100 | 0 |
Distribution of total bands of AFLP in different species using different primer combinations.
| Primer pair | ||||||||||
| Species | P1 | P2 | P3 | P4 | P5 | P6 | P7 | P8 | P9 | P10 |
|
| 48 | 7 | 33 | 0 | 52 | 40 | 38 | 33 | 33 | 0 |
|
| 30 | 54 | 60 | 44 | 9 | 18 | 29 | 35 | 6 | 17 |
|
| 39 | 45 | 65 | 42 | 6 | 21 | 31 | 49 | 7 | 26 |
|
| 23 | 46 | 54 | 30 | 5 | 6 | 29 | 37 | 14 | 4 |
|
| 29 | 58 | 64 | 43 | 36 | 39 | 50 | 54 | 14 | 21 |
|
| 23 | 33 | 41 | 27 | 22 | 23 | 22 | 30 | 9 | 0 |
|
| 29 | 3 | 7 | 0 | 24 | 26 | 13 | 11 | 11 | 0 |
|
| 35 | 48 | 53 | 34 | 34 | 29 | 44 | 48 | 20 | 26 |
|
| 22 | 24 | 46 | 28 | 13 | 18 | 29 | 36 | 6 | 19 |
|
| 27 | 7 | 32 | 0 | 42 | 31 | 30 | 24 | 20 | 0 |
|
| 24 | 26 | 37 | 35 | 16 | 19 | 30 | 34 | 7 | 7 |
|
| 37 | 4 | 25 | 0 | 37 | 22 | 30 | 30 | 14 | 0 |
|
| 19 | 17 | 48 | 27 | 9 | 15 | 26 | 29 | 4 | 15 |
*Primer name as mentioned in Table 4.
Figure 5Results of principal coordinate analysis of AFLP markers for Berberis species showing separation of two clusters as revealed by phenograms.
The three vectors, one, two and three contribute 10.2%, 4.9 and 3.6% of total variability respectively. Colors represent regions from where species were collected. Green: Western Himalaya, Red: Eastern Himalaya, Blue: Central India and Dark Red: Southern India.
Figure 6UPGMA cluster analysis thirteen species of Berberis based on Jaccard's coefficient similarities of AFLP molecular banding patterns.
The numbers preceding the abbreviated species name denotes the DNA number. Abbreviations are: B.ang: Berberis angulosa, B.ari: Berberis aristata, B.asi: Berberis asiatica, B.chi: Berberis chitria, B.glu: Berberis glaucocarpa, B.hai: Berberis hainesii, B.ins: Berberis insignis, B.lyc: Berberis lycium, B.pac: Berberis pachyacantha, B.rep: Berberis replicata, B.tin: Berberis tinctoria, B.umb: Berberis umbellata, B.wit: Berberis wightiana.