| Literature DB >> 20957043 |
Hui Yao1, Jingyuan Song, Chang Liu, Kun Luo, Jianping Han, Ying Li, Xiaohui Pang, Hongxi Xu, Yingjie Zhu, Peigen Xiao, Shilin Chen.
Abstract
BACKGROUND: The internal transcribed spacer 2 (ITS2) region of nuclear ribosomal DNA is regarded as one of the candidate DNA barcodes because it possesses a number of valuable characteristics, such as the availability of conserved regions for designing universal primers, the ease of its amplification, and sufficient variability to distinguish even closely related species. However, a general analysis of its ability to discriminate species in a comprehensive sample set is lacking. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2010 PMID: 20957043 PMCID: PMC2948509 DOI: 10.1371/journal.pone.0013102
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Box plots of the ITS2 sequence length of plants and animals.
In a box plot, the box shows the interquartile range (IQR) of the data. The IQR is defined as the difference between the 75th percentile and the 25th percentile. The solid and dotted line through the box represent the median and the average length, respectively.
Figure 2Box plots of GC contents of ITS2 of plants and animals.
In a box plot, the box shows the IQR of the data. The IQR is defined as the difference between the 75th percentile and the 25th percentile. The solid and dotted line through the box represent the median and the average GC contents, respectively.
Analysis of intra- and inter-specific divergences of congeneric species in plants and animals.
| Taxa | Animals | Dicotyledons | Monocotyledons | Gymnosperms | Mosses | Ferns |
|
| 0.3761±0.5982 | 0.1042±0.1393 | 0.1829±0.1940 | 0.0537±0.0892 | 0.1007±0.0913 | 0.4758±0.3547 |
|
| 0.2820±0.4257 | 0.0999±0.1118 | 0.1127±0.1310 | 0.0573±0.0744 | 0.1874±0.1792 | 0.4995±0.2906 |
|
| 0.1361±0.2254 | 0.0370±0.0667 | 0.0386±0.0809 | 0.0195±0.0576 | 0.0838±0.1466 | 0.2399±0.3173 |
|
| 0.0522±0.1150 | 0.0214±0.0809 | 0.0309±0.0712 | 0.0170±0.0413 | 0.0114±0.0456 | 0.0082±0.0160 |
|
| 0.0274±0.0809 | 0.0231±0.0781 | 0.0244±0.0764 | 0.0255±0.0511 | 0.0289±0.0792 | 0.0262±0.0254 |
|
| 0.0596±0.1962 | 0.0363±0.1739 | 0.0360±0.1213 | 0.0368±0.0653 | 0.0452±0.1087 | 0.0336±0.0256 |
Identification efficiency of ITS2 regions in plants and animals using BLAST1 method.
| Taxa | Taxa level | Correct identification (%) | Ambiguous identification (%) |
| Animals | Species | 91.7 | 8.3 |
| Genus | 99.7 | 0.3 | |
| Dicotyledons | Species | 76.1 | 23.9 |
| Genus | 99.1 | 0.9 | |
| Monocotyledons | Species | 74.2 | 25.8 |
| Genus | 97.9 | 2.1 | |
| Gymnosperms | Species | 67.1 | 32.9 |
| Genus | 99.5 | 0.5 | |
| Mosses | Species | 77.4 | 22.6 |
| Genus | 98.6 | 1.4 | |
| Ferns | Species | 88.1 | 11.9 |
| Genus | 100.0 | 0 |
Figure 3Identification efficiency when using ITS2 regions to distinguish between closely related species in different families of plants and animals using the BLAST1 method.
The ITS2 sequences of all animal phyla, dicotyledon, gymnosperm, and mosses families with more than 10 genera, monocotyledon families with more than 5 genera, and all fern families are shown in this figure.
Success rates of ITS2 for species identification in genera with more than 80 species in dicotyledons.
| Family name | Genus name | No. of species | No. of samples | Success rate at the species level (%) |
| Fabaceae |
| 322 | 381 | 65.9 |
| Fabaceae |
| 234 | 266 | 95.5 |
| Fabaceae |
| 223 | 334 | 70.1 |
| Melastomataceae |
| 206 | 223 | 66.4 |
| Brassicaceae |
| 199 | 452 | 27.2 |
| Asteraceae |
| 185 | 284 | 58.5 |
| Plantaginaceae |
| 178 | 264 | 90.2 |
| Oxalidaceae |
| 176 | 201 | 80.6 |
| Moraceae |
| 174 | 215 | 85.6 |
| Solanaceae |
| 162 | 248 | 83.9 |
| Asteraceae |
| 161 | 219 | 77.6 |
| Fabaceae |
| 138 | 165 | 55.8 |
| Fabaceae |
| 127 | 151 | 72.8 |
| Rosaceae |
| 124 | 199 | 72.9 |
| Begoniaceae |
| 124 | 236 | 97.9 |
| Polygalaceae |
| 123 | 128 | 89.8 |
| Asteraceae |
| 118 | 159 | 63.5 |
| Rosaceae |
| 118 | 151 | 67.5 |
| Acanthaceae |
| 117 | 151 | 79.5 |
| Euphorbiaceae |
| 117 | 168 | 86.9 |
| Balsaminaceae |
| 117 | 137 | 97.8 |
| Apiaceae |
| 113 | 136 | 62.5 |
| Myrtaceae |
| 106 | 135 | 61.5 |
| Euphorbiaceae |
| 104 | 142 | 59.9 |
| Calceolariaceae |
| 99 | 103 | 74.8 |
| Convolvulaceae |
| 98 | 261 | 74.7 |
| Caryophyllaceae |
| 97 | 141 | 40.4 |
| Lamiaceae |
| 96 | 213 | 81.2 |
| Berberidaceae |
| 94 | 164 | 55.5 |
| Ericaceae |
| 86 | 233 | 21.9 |
| Euphorbiaceae |
| 84 | 127 | 66.9 |
| Sapindaceae |
| 83 | 745 | 81.5 |
| Rosaceae |
| 82 | 222 | 78.8 |
| Urticaceae |
| 81 | 88 | 97.7 |
| Rubiaceae |
| 81 | 111 | 72.1 |
Success rates of ITS2 for species identification in genera with more than 30 species in monocotyledons.
| Family name | Genus name | No. of species | No. of samples | Success rate at the species level (%) |
| Alliaceae |
| 273 | 717 | 72.7 |
| Amaryllidaceae |
| 43 | 57 | 86.0 |
| Amaryllidaceae |
| 34 | 34 | 52.9 |
| Arecaceae |
| 49 | 161 | 95.7 |
| Asphodelaceae |
| 52 | 99 | 16.2 |
| Costaceae |
| 50 | 94 | 52.1 |
| Cyperaceae |
| 318 | 506 | 80.6 |
| Cyperaceae |
| 52 | 122 | 90.2 |
| Hyacinthaceae |
| 31 | 50 | 70.0 |
| Juncaceae |
| 45 | 56 | 51.8 |
| Juncaceae |
| 42 | 51 | 68.6 |
| Liliaceae |
| 79 | 228 | 56.1 |
| Liliaceae |
| 78 | 124 | 79.0 |
| Liliaceae |
| 49 | 58 | 82.8 |
| Musaceae |
| 37 | 63 | 82.5 |
| Orchidaceae |
| 227 | 482 | 62.9 |
| Orchidaceae |
| 139 | 215 | 65.1 |
| Orchidaceae |
| 121 | 160 | 91.9 |
| Orchidaceae |
| 120 | 143 | 79.7 |
| Orchidaceae |
| 100 | 260 | 22.7 |
| Orchidaceae |
| 85 | 192 | 76.6 |
| Orchidaceae |
| 56 | 232 | 65.9 |
| Orchidaceae |
| 48 | 49 | 79.6 |
| Orchidaceae |
| 46 | 55 | 49.1 |
| Orchidaceae |
| 42 | 59 | 98.3 |
| Orchidaceae |
| 42 | 52 | 71.2 |
| Orchidaceae |
| 41 | 75 | 69.3 |
| Orchidaceae |
| 38 | 46 | 76.1 |
| Orchidaceae |
| 36 | 66 | 81.8 |
| Orchidaceae |
| 33 | 61 | 31.1 |
| Orchidaceae |
| 33 | 40 | 100.0 |
| Orchidaceae |
| 30 | 58 | 74.1 |
| Poaceae |
| 115 | 178 | 46.1 |
| Poaceae |
| 66 | 80 | 76.3 |
| Poaceae |
| 54 | 155 | 74.2 |
| Poaceae |
| 51 | 69 | 72.5 |
| Poaceae |
| 31 | 36 | 80.6 |
| Poaceae |
| 31 | 481 | 81.7 |
| Potamogetonaceae |
| 33 | 211 | 72.5 |
| Zingiberaceae |
| 60 | 103 | 57.3 |
| Zingiberaceae |
| 46 | 85 | 68.2 |
| Zingiberaceae |
| 37 | 52 | 94.2 |
Success rates of ITS2 for species identification in genera with more than 20 species in animals.
| Family name | Genus name | No. of species | No. of samples | Success rate at the species level (%) |
| Aphelenchoididae |
| 32 | 86 | 81.4 |
| Camaenidae |
| 27 | 122 | 100.0 |
| Ceratopogonidae |
| 39 | 134 | 100.0 |
| Chrysomelidae |
| 42 | 183 | 97.3 |
| Chrysomelidae |
| 23 | 45 | 73.3 |
| Clausiliidae |
| 25 | 31 | 96.8 |
| Clausiliidae |
| 20 | 23 | 95.7 |
| Conidae |
| 23 | 23 | 100.0 |
| Culicidae |
| 23 | 241 | 98.8 |
| Culicidae |
| 21 | 154 | 93.5 |
| Dolichopodidae |
| 38 | 65 | 73.8 |
| Drosophilidae |
| 40 | 43 | 81.4 |
| Enidae |
| 24 | 44 | 95.5 |
| Gyrodactylidae |
| 49 | 135 | 99.3 |
| Heteroderidae |
| 41 | 211 | 93.8 |
| Longidoridae |
| 25 | 52 | 100.0 |
| Lycaenidae |
| 75 | 111 | 90.1 |
| Nesticidae |
| 26 | 51 | 100.0 |
| Nitidulidae |
| 79 | 82 | 87.8 |
| Planorbidae |
| 22 | 91 | 95.6 |
| Poritidae |
| 20 | 206 | 89.3 |
| Pratylenchidae |
| 22 | 154 | 97.4 |
| Psychodidae |
| 24 | 129 | 100.0 |
| Reduviidae |
| 28 | 127 | 94.5 |
| Sarcophagidae |
| 24 | 33 | 100.0 |
| Simuliidae |
| 22 | 177 | 80.8 |
| Steinernematidae |
| 46 | 140 | 96.4 |
| Trichogrammatidae |
| 59 | 278 | 99.3 |
Figure 4The secondary structure of ITS2 in different species of dicotyledons.
Figure 5The workflow diagram for the construction of ITS2 sequences libraries.