| Literature DB >> 26900844 |
Sámed I I A Hadi1,2,3, Hugo Santana1,4, Patrícia P M Brunale1, Taísa G Gomes2, Márcia D Oliveira5, Alexandre Matthiensen6, Marcos E C Oliveira7, Flávia C P Silva1, Bruno S A F Brasil1.
Abstract
This study evaluated the feasibility of using the Ribulose Bisphosphate Carboxylase Large subunit gene (rbcL) and the Internal Transcribed Spacers 1 and 2 of the nuclear rDNA (nuITS1 and nuITS2) markers for identifying a very diverse, albeit poorly known group, of green microalgae from neotropical inland waters. Fifty-one freshwater green microalgae strains isolated from Brazil, the largest biodiversity reservoir in the neotropics, were submitted to DNA barcoding. Currently available universal primers for ITS1-5.8S-ITS2 region amplification were sufficient to successfully amplify and sequence 47 (92%) of the samples. On the other hand, new sets of primers had to be designed for rbcL, which allowed 96% of the samples to be sequenced. Thirty-five percent of the strains could be unambiguously identified to the species level based either on nuITS1 or nuITS2 sequences' using barcode gap calculations. nuITS2 Compensatory Base Change (CBC) and ITS1-5.8S-ITS2 region phylogenetic analysis, together with morphological inspection, confirmed the identification accuracy. In contrast, only 6% of the strains could be assigned to the correct species based solely on rbcL sequences. In conclusion, the data presented here indicates that either nuITS1 or nuITS2 are useful markers for DNA barcoding of freshwater green microalgae, with advantage for nuITS2 due to the larger availability of analytical tools and reference barcodes deposited at databases for this marker.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26900844 PMCID: PMC4767179 DOI: 10.1371/journal.pone.0149284
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of primers used in this study, including the primer sequences, amplicon length, annealing temperature and the sequencing success rate for a total of 51 strains tested.
| Primer pair | Molecular marker | Sequence | Amplicon length (Nucleotides span) | Annealing temperature | Sequencing success rate | Reference |
|---|---|---|---|---|---|---|
| Fw_ITS1/Rv_ITS4 | ITS1-5.8S-ITS2 | Fw_ITS1: 5’–AGGAGAAGTCGTAACAAGGT– 3’ Rv_ITS4: 5’–TCCTCCGCTTATTGATATGC– 3’ | ≈ 650 pb | 52°C | 92,15% | [ |
| Fw_rbcL_192/Rv_rbcL_657 | rbcL | Fw_rbcL_192: 5’–GGTACTTGGACAACWGTWTGGAC– 3’ Rv_rbcL_657: 5’–GAAACGGTCTCKCCARCGCAT– 3’ | 465 pb (position 192 to 657) | 52°C | 82,35% | This study |
| Fw_rbcL_375/Rv_rbcL_1089 | rbcL | Fw_rbcL_375: 5’–TTTGGTTTCAAAGCIYTWCGTGC– 3’ Rv_rbcL_1089: 5’–ATACCACGRCTACGRTCTTT– 3’ | 714 pb (position 375 to 1089) | 52°C | 50,98% | This study |
| Fw_rbcL_192/Rv_rbcL_1089 | rbcL | Fw_rbcL_192: 5’–GGTACTTGGACAACWGTWTGGAC– 3’ Rv_rbcL_1089: 5’–ATACCACGRCTACGRTCTTT– 3’ | 897 pb (position 192 to 1089) | 52°C | 37,25% | This study |
| Fw_rbcLa_f/Rv_rbcL_ajf634R | rbcL | Fw_rbcLa_f 5’–ATGTCACCACAAACAGAAACTAAAGC– 3’ Rv_rbcL_ajf634R: 5’–GAAACGGTCTCTCCAACGCAT– 3’ | 654 pb (position 1 to 654) | 54°C | 15,69% | [ |
| Fw_rbcL_109/Rv_rbcL_657 | rbcL | Fw_rbcL_109: 5’–TTCTTGCTGCITTYCGTATG– 3’ Rv_rbcL_657: 5’–GAAACGGTCTCKCCARCGCAT– 3’ | 548 pb (position 109 to 657) | 52°C | 13,75% | This study |
| Fw_rbcLa_f/rbcLA_rev | rbcL | Fw_rbcLa_f: 5’–ATGTCACCACAAACAGAGACTAAAGC– 3’ rbcLA_rev: 5’–GTAAAATCAAGTCCACCRCG– 3’ | 599 pb (position 1 to 599) | 54°C | 7,84% | [ |
| Fw_rbcL_109/Rv_rbcL_1089 | rbcL | Fw_rbcL_109: 5’–TTCTTGCTGCITTYCGTATG– 3’ Rv_rbcL_1089: 5’–ATACCACGRCTACGRTCTTT– 3’ | 980 pb (position 109 to 1089) | 52°C | 1,96% | This study |
| Fw_rbcL_RH1/rbcL_724R | rbcL | Fw_rbcL_RH1: 5’–ATGTCACCACAAACAGAAACTAAAGC– 3’ rbcL_724R: 5’–TCGCATGTACCTGCAGTAGC– 3’ | 743 pb (position 1 to 743) | 54°C | 1,96% | [ |
| Fw_rbcL_RH1/rbcL_1385R | rbcL | Fw_rbcL_RH1: 5’–ATGTCACCACAAACAGAAACTAAAGC– 3’ rbcL_1385R: 5’–AATTCAAATTTAATTTCTTTCC– 3’ | 1406 pb (position 1 to 1406) | 48°C | 0% | [ |
Molecular identification of the strains used in this study, including the percentual of identity, accession number and the name of the identified species on the Barcode of Life Database (based on rbcL marker sequence) and GenBank (based on nuITS2 marker sequence).
| Strain | ITS1 (GenBank) | ITS2 (GenBank) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Closest match species | Identity | GenBank access | Closest match species | Identity | Number of CBCs / hCBCs | GenBank access | Closest match species | Identity | GenBank access | |
| LBA#1 | 95% | 91% | 1 / 7 | 90% | ||||||
| LBA#2 | 100% | 100% | 0 / 0 | 93% | ||||||
| LBA#3 | 100% | 100% | 0 / 0 | 90% | ||||||
| LBA#4 | 70% | 91% | 1 / 2 | 91% | ||||||
| LBA#5 | 79% | 84% | 4 / 2 | 93% | ||||||
| LBA#6 | 70% | 94% | - | 91% | ||||||
| LBA#7 | 79% | 84% | 4 / 2 | 93% | ||||||
| LBA#8 | 94% | 87% | 0 / 1 | 95% | ||||||
| LBA#9 | 90% | 93% | 0 / 1 | 92% | ||||||
| LBA#10 | 90% | 93% | 0 / 1 | 92% | ||||||
| LBA#11 | 90% | 93% | 0 / 1 | 92% | ||||||
| LBA#12 | 90% | 93% | 0 / 1 | - | - | |||||
| LBA#13 | 96% | 95% | 0 / 2 | 90% | ||||||
| LBA#14 | 90% | 93% | 0 / 1 | 92% | ||||||
| LBA#15 | 90% | 93% | 0 / 1 | 92% | ||||||
| LBA#16 | - | - | - | - | - | 90% | ||||
| LBA#17 | 90% | 93% | 0 / 1 | 92% | ||||||
| LBA#18 | 90% | 93% | 0 / 1 | 92% | ||||||
| LBA#19 | - | - | - | - | - | 90% | ||||
| LBA#20 | 76% | 81% | 0 / 6 | 93% | ||||||
| LBA#21 | 96% | 95% | 0 / 2 | 94% | ||||||
| LBA#22 | 100% | 99% | 0 / 0 | 93% | ||||||
| LBA#23 | 100% | 99% | 0 / 0 | 94% | ||||||
| LBA#24 | 94% | 94% | 0 / 3 | 94% | ||||||
| LBA#25 | 94% | 94% | 0 / 3 | 94% | ||||||
| LBA#26 | 100% | 100% | 0 / 0 | 92% | ||||||
| LBA#27 | 100% | 100% | 0 / 0 | 99% | ||||||
| LBA#28 | - | - | - | - | - | 94% | ||||
| LBA#29 | 92% | 91% | 0 / 2 | 93% | ||||||
| LBA#30 | 100% | 100% | 0 / 0 | 93% | ||||||
| LBA#31 | 92% | 91% | 0 / 2 | 93% | ||||||
| LBA#32 | 99% | FM205863.1 | 100% | 0 / 0 | FM205863.1 | 99% | FM205863.1 | |||
| LBA#33 | 99% | FM205863.1 | 100% | 0 / 0 | FM205863.1 | 99% | FM205863.1 | |||
| LBA#34 | 99% | FM205863.1 | 100% | 0 / 0 | FM205863.1 | 99% | FM205863.1 | |||
| LBA#35 | 100% | 99% | 0 / 1 | 93% | ||||||
| LBA#36 | 100% | 100% | 0 / 0 | 93% | ||||||
| LBA#37 | 90% | 92% | 1 / 2 | 94% | ||||||
| LBA#38 | 90% | 92% | 1 / 2 | 94% | ||||||
| LBA#39 | 100% | 100% | 0 / 0 | 99% | ||||||
| LBA#40 | 79% | 95% | 2 / 0 | 96% | ||||||
| LBA#41 | 94% | 87% | 0 / 3 | 95% | ||||||
| LBA#42 | 99% | FM205863.1 | 100% | 0 / 0 | FM205863.1 | 99% | FM205863.1 | |||
| LBA#43 | 99% | FM205863.1 | 100% | 0 / 0 | FM205863.1 | 99% | FM205863.1 | |||
| LBA#44 | 99% | FM205863.1 | 100% | 0 / 0 | FM205863.1 | 99% | FM205863.1 | |||
| LBA#45 | 82% | 86% | - | 92% | ||||||
| LBA#46 | 98% | 100% | 0 / 0 | - | - | |||||
| LBA#47 | 95% | 85% | 2 / 0 | 91% | ||||||
| LBA#48 | - | - | - | - | - | 93% | ||||
| LBA#49 | 83% | 90% | 0 / 1 | 90% | ||||||
| LBA#50 | 99% | 96% | 0 / 0 | 99% | ||||||
| LBA#51 | 74% | 84% | 2 / 1 | 95% | ||||||
The compensatory and hemi-compensatory base changes (CBCs/hemi-CBCs) between the indicated sequence and its closest match in the ITS2 Database are shown. An hyphen (-) is indicated for samples that could not be amplified and/or sequenced, and for the nuITS2 sequences for which secondary structure predictions and CBCs/Hemi-CBCs analysis were not possible.
Fig 1Representative DIC microscopic images of Embrapa|LBA strains assigned to not formally described species.
(A and D) Embrapa|LBA#36. (B and E) Embrapa|LBA#32. (C and F) Embrapa|LBA#35. Scale bars = 5 μm.
Fig 2Phylogenetic tree for Chlorella and Micractinium genera inferred based on ITS1-5.8S-ITS2 sequences.
Chlorella sp. and Micractinium sp. ITS1-5.8S-ITS2 reference barcode sequences reported by Luo et al. (2010) [39] and Bock et al. (2011) [14] were included in the analysis together with Embrapa|LBA#27, #32–34, #39, #42–44 and #50 strains sequences and their respectively closest sequences at GenBank. Identical sequences were omitted for simplification. The phylogenetic tree was inferred using the Maximum Composite Likelihood method based on dataset of 472 aligned positions of 31 nucleotide sequences. For the analysis, the GTR+G+I model was chosen. For the analysis, the GTR model with invariable sites (I) and gamma distribution shape parameter (G) was chosen. The bootstrap values (1000 replicates) are shown next to the branches.
Fig 3Phylogenetic tree for Desmodesmus genus inferred based on ITS1-5.8S-ITS2 sequences.
Demodesmus sp. ITS1-5.8S-ITS2 reference barcode sequences reported by Fawley et al. (2011) [53] and Gorelova et al. (2014) [54] were included in the analysis together with Embrapa|LBA#2–3, #22–23, #26, #30 and #35–36 strains sequences and their respectively closest sequences at GenBank. Identical sequences were omitted for simplification. The phylogenetic tree was inferred using the Maximum Composite Likelihood method based on a dataset of 470 aligned positions of 34 nucleotide sequences. For the analysis, the GTR+G+I model was chosen. The bootstrap values (1000 replicates) are shown next to the branches.