| Literature DB >> 22278883 |
V G Fonseca1, B Nichols, D Lallias, C Quince, G R Carvalho, D M Power, S Creer.
Abstract
Eukaryotic diversity in environmental samples is often assessed via PCR-based amplification of nSSU genes. However, estimates of diversity derived from pyrosequencing environmental data sets are often inflated, mainly because of the formation of chimeric sequences during PCR amplification. Chimeras are hybrid products composed of distinct parental sequences that can lead to the misinterpretation of diversity estimates. We have analyzed the effect of sample richness, evenness and phylogenetic diversity on the formation of chimeras using a nSSU data set derived from 454 Roche pyrosequencing of replicated, large control pools of closely and distantly related nematode mock communities, of known intragenomic identity and richness. To further investigate how chimeric molecules are formed, the nSSU gene secondary structure was analyzed in several individuals. For the first time in eukaryotes, chimera formation proved to be higher in both richer and more genetically diverse samples, thus providing a novel perspective of chimera formation in pyrosequenced environmental data sets. Findings contribute to a better understanding of the nature and mechanisms involved in chimera formation during PCR amplification of environmentally derived DNA. Moreover, given the similarities between biodiversity analyses using amplicon sequencing and those used to assess genomic variation, our findings have potential broad application for identifying genetic variation in homologous loci or multigene families in general.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22278883 PMCID: PMC3351157 DOI: 10.1093/nar/gks002
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Mean numbers of OTUs, denoised sequences, chimera percentages and reads for the pools of close and distantly related nematodes with 48, 24 and 12 individuals, respectively
| Species phylogeny | Species number | OTUs at 99% | Denoised sequences | Chimera (%) | Read number |
|---|---|---|---|---|---|
| Close | 48 | 87.6 | 138.40 | 35.60 | 13 882.20 |
| Close | 24 | 40.4 | 63.20 | 34.55 | 3809.00 |
| Close | 12 | 35.8 | 42.80 | 14.57 | 6159.80 |
| Distant | 48 | 63.2 | 161.00 | 58.98 | 5657.80 |
| Distant | 24 | 53.6 | 119.00 | 53.57 | 10 134.20 |
| Distant | 12 | 34.4 | 58.20 | 39.93 | 7638.20 |
Figure 1.Chimera percentage and Shannon Index of closely and distantly related pools of nematodes.
Figure 2.(a) Nucleotide diversity (Shannon Index) and (b) breakpoint frequencies occurrence in single nematodes and parental chimeric sequences, respectively.
Figure 3.Most frequent predicted secondary structures found on the 18S rDNA amplicon at (a) 55°C and (b) 65°C folding temperatures, using as an example two single nematodes. Arrows indicate where the most frequent breakpoints occur generally matching hairpin-loops. 79TN11 consensus and 25LBFB8F04 indicate the labels given for each nematode. dG: free energy necessary for sequence stability at a given temperature.