| Literature DB >> 28719667 |
Lo'ai Alanagreh1, Caitlin Pegg1, Amritha Harikumar1, Mark Buchheim1.
Abstract
Primary and secondary structural data from the internal transcribed spacer two (ITS2) have been used extensively for diversity studies of many different eukaryotic organisms, including the green algae. Ease of amplification is due, at least in part, to the fact that ITS2 is part of the tandemly-repeated rRNA array. The potential confounding influence of intragenomic variability has yet to be addressed except in a few organisms. Moreover, few of the assessments of intragenomic variation have taken advantage of the deep sequencing capacity of sequence-by-synthesis protocols. We present results from our adaptation of the 16S Metagenomics Sequencing Library Preparation/Illumina protocol for deep sequencing of the ITS2 genes in selected isolates of the green algal genus, Haematococcus. Deep sequencing yielded from just under 20,000 to more than 500,000 merged reads, outpacing results from recent pyrosequencing efforts. Furthermore, a conservative evaluation of these data revealed a range of three to six ITS2 sequence haplotypes (defined as unique sets of nucleotide polymorphisms) across the taxon sampling. The frequency of the dominant haplotype ranged from 0.35 to 0.98. In all but two cases, the haplotype with the greatest frequency corresponded to a sequence obtained by the Sanger method using PCR templates. Our data also show that results from the sequencing-by-synthesis approach are reproducible. In addition to advancing our understanding of ribosomal RNA variation, the results of this investigation will allow us to begin testing hypotheses regarding the maintenance of homogeneity across multi-copy genes.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28719667 PMCID: PMC5515447 DOI: 10.1371/journal.pone.0181491
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Electropherogram from Sanger sequencing of rRNA amplicons for a clonal isolate of Haematococcus pluvialis.
Arrows indicate sites of putative intragenomic variation in a relatively short stretch of primary sequence.
List of isolates studied and the quantitative tallies from results of deep sequencing for each isolate and replicate runs (R1, R2, or R3) for selected isolates.
| Isolate of | Filtered reads | Merged |
|---|---|---|
| 2,247,678 | 591,186 | |
| 3,197,440 | 1,512,414 | |
| 3,230,078 | 1,644,994 | |
| 531,328 | 44,766 | |
| 3,012,714 | 1,176,852 | |
| 2,072,814 | 799,258 | |
| 993,916 | 111,240 | |
| 3,781,386 | 1,725,154 | |
| 660,812 | 35,178 | |
| 2,186,362 | 764,202 | |
| 601,442 | 31,258 | |
| 2,185,128 | 936,544 | |
| 987,942 | 82,684 | |
| 4,348,136 | 2,057,670 | |
| 720,308 | 19,396 | |
| 1,525,506 | 167,520 | |
| 3,513,038 | 1,296,446 | |
| 1,332,554 | 685,584 |
P-distances for within-group (lineage) comparison of ITS2 haplotypes.
| Lineage | Within Group Mean |
|---|---|
| 0.011461282 | |
| 0.00763203 | |
| 0.022749559 | |
| 0.040381791 | |
| 0.007980508 | |
| 0.009270965 | |
| 0.016579356 |
P-distances for between-group (lineage) comparison of ITS2 haplotypes.
| Lineage 1 | Lineage 2 | Between Group Mean |
|---|---|---|
| 0.101 | ||
| 0.088 | ||
| 0.042 | ||
| 0.043 | ||
| 0.091 | ||
| 0.076 | ||
| 0.167 | ||
| 0.126 | ||
| 0.118 | ||
| 0.155 | ||
| 0.116 | ||
| 0.104 | ||
| 0.093 | ||
| 0.097 | ||
| 0.153 | ||
| 0.105 |
P-distances for within-group (isolate) comparison of ITS2 haplotypes for isolates affiliated with the H. pluvialis (A) lineage.
| Isolate | Within Group Mean |
|---|---|
| 0.011758569 | |
| 0.008810573 | |
| 0.011804383 | |
| 0.010791175 |
P-distances for between-group (isolate) comparison of ITS2 haplotypes for isolates affiliated with the H. pluvialis (A) lineage.
| Isolate 1 | Isolate 2 | Between Group Mean |
|---|---|---|
| 0.012467350 | ||
| 0.010637853 | ||
| 0.011981126 | ||
| 0.011695443 |
Variants of ITS2 (cloned and deep sequenced) with number of haplotypes from cloning and deep sequencing and total number of nucleotide polymorphisms.
| Lineage & Isolate | Number of Cloned Sequences | Total Number of Cloned ITS2 Haplotypes | Total Number of Deep Sequenced ITS2 Haplotypes | Total Number of Deep Sequenced Nucleotide Polymorphisms |
|---|---|---|---|---|
| 9 | 5 | 6 | 9 | |
| N/A | N/A | 4 | 5 | |
| 11 | 3 | 3 | 10 | |
| 18 | 6 | 3 | 14 | |
| 10 | 4 | 4 | 6 | |
| 10 | 4 | 5 | 7 | |
| 11 | 3 | 3 | 3 | |
| 8 | 3 | 6 | 6 | |
| N/A | N/A | 4 | 3 |
Fig 2Results from phylogenetic analysis of data from intragenomic ITS2 variants (i.e., haplotypes) of Haematococcus.
Reconstruction is from ML analysis of alignment guided by secondary structure. Branch lengths are drawn proportional to evolutionary change as recorded by the ML analysis. Bootstrap values from NJ, ML and SS are mapped (in that order) to corresponding branches. Lineage labels are from Allewaert et al. [37] and Buchheim et al. [35]. Haplotypes from deep sequencing are denoted with “H” followed by a number. Haplotypes from cloning are denoted with “CS” followed by a number. Published ITS2 sequences that were used as references are denoted by a double arrowhead (<<). The four, haplotypes of SAG 34-1h (H. rubens lineage) that are allied with the H. pluvialis (A) lineage are highlighted with stars. The cloned haplotype of SAG 34-1f (Lineage C) that is allied with the H. pluvialis (A) lineage is highlighted with a pentagon. Haplotypes with unique combinations of nucleotide variants (relative to the reference) are presented in red (clone and sequence) or blue (deep sequenced) boldface.
Comparison of haplotype frequency calculations from replicate deep-sequencing for three isolates from the H. pluvialis (A) lineage.
| SAG 49.94 | Mean | SD | SE | SAG 34-1b | Mean | SD | SE | HP111 | Mean | SD | SE | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Haplotype Frequency | Haplotype Frequency | Haplotype Frequency | ||||||||||||||
| 0.57# | 0.60# | 0.586 | 0.022 | 0.016 | 0.39# | 0.35# | 0.35* | 0.363 | 0.024 | 0.0137 | 0.43# | 0.39# | 0.408 | 0.0304 | 0.0215 | |
| 0.28# | 0.26# | 0.271 | 0.015 | 0.011 | 0.34# | 0.3# | 0.32* | 0.322 | 0.023 | 0.0130 | 0.42# | 0.45# | 0.436 | 0.0247 | 0.0175 | |
| 0.07# | 0.07# | 0.069 | 0.005 | 0.004 | 0.17# | 0.28# | 0.27* | 0.238 | 0.060 | 0.0346 | 0.14# | 0.14# | 0.143 | 0.0021 | 0.0015 | |
| 0.05# | 0.05# | 0.048 | 0.002 | 0.002 | 0.05# | 0.04# | 0.04* | 0.045 | 0.008 | 0.0046 | 0.01# | 0.02# | 0.015 | 0.0028 | 0.002 | |
| 0.03# | 0.03# | 0.027 | 0.000 | 0.000 | 0.03# | 0.02# | 0.01* | 0.020 | 0.009 | 0.0049 | x | x | x | x | x | |
| x | x | x | x | x | 0.01# | 0.01# | 0.01* | 0.012 | 0.002 | 0.0012 | x | x | x | x | x | |
Replicates from the same template are indicated by #.
Replicates derived from a new template are indicated by *.
H = haplotype. SD = standard deviation. SE = standard error of the mean.
Comparison of haplotype frequency calculations from replicate deep-sequencing for two isolates from the H. rubicundus lineage.
| SAG 34-1m | Mean | SD | SE | SAG 34-1c | Mean | SD | SE | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Haplotyype Frequency | Haplotype Frequency | ||||||||||
| 0.67# | 0.762# | 0.716 | 0.065 | 0.046 | 0.67# | 0.65# | 0.63* | 0.649 | 0.019 | 0.0112 | |
| 0.288# | 0.245# | 0.267 | 0.030 | 0.022 | 0.27# | 0.28# | 0.32* | 0.290 | 0.029 | 0.0166 | |
| 0.028# | 0.033# | 0.031 | 0.004 | 0.003 | 0.03# | 0.03# | 0.02* | 0.027 | 0.007 | 0.0043 | |
| 0.015# | 0.018# | 0.017 | 0.002 | 0.002 | 0.02# | 0.02# | 0.02* | 0.020 | 0.004 | 0.0021 | |
| x | X | x | x | x | 0.01# | 0.01# | 0.02* | 0.014 | 0.002 | 0.0009 | |
Replicates from the same template are indicated by #.
Replicates derived from a new template are indicated by *.
H = haplotype. SD = standard deviation. SE = standard error of the mean.