| Literature DB >> 21151906 |
Karen Dawson1, Roger S Thorpe, Anita Malhotra.
Abstract
Genetic variation is the driving force of evolution and as such is of central interest for biologists. However, inadequate discrimination of errors from true genetic variation could lead to incorrect estimates of gene copy number, population genetic parameters, phylogenetic relationships and the deposition of gene and protein sequences in databases that are not actually present in any organism. Misincorporation errors in multi-template PCR cloning methods, still commonly used for obtaining novel gene sequences in non-model species, are difficult to detect, as no previous information may be available about the number of expected copies of genes belonging to multi-gene families. However, studies employing these techniques rarely describe in any great detail how errors arising in the amplification process were detected and accounted for. Here, we estimated the rate of base misincorporation of a widely-used PCR-cloning method, using a single copy mitochondrial gene from a single individual to minimise variation in the template DNA, as 1.62×10(-3) errors per site, or 9.26×10(-5) per site per duplication. The distribution of errors among sequences closely matched that predicted by a binomial distribution function. The empirically estimated error rate was applied to data, obtained using the same methods, from the Phospholipase A(2) toxin family from the pitviper Ovophis monticola. The distribution of differences detected closely matched the expected distribution of errors and we conclude that, when undertaking gene discovery or assessment of genetic diversity using this error-prone method, it will be informative to empirically determine the rate of base misincorporation.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21151906 PMCID: PMC2997787 DOI: 10.1371/journal.pone.0015204
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Deviations from the consensus sequence in the fully sequenced test samples for Phospholipase A2 genes from Ovophis monticola.
| Sample code (no. of sequences) | Seq. length | Number of samples showing given no. of base differences from group consensus sequence: observed (expected) | X2 value |
| |||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | ||||
| B664 | 1754 | 4 (2) | 5 (7) | 11 (10) | 5 (8) | 7 (7) | 8 (4) | 2 (2) | 0 (1) | 8.78 | 0.19 |
| B664 | 1724 | 1 (2) | 6 (4) | 9 (6) | 6 (6) | 2 (4) | 1 (2) | 0 (1) | 0 (0) | 5.33 | 0.38 |
| ROM 39382 group A (44) | 1847 | 3 (2) | 7 (7) | 7 (10) | 11 (10) | 11 (7) | 2 (4) | 3 (2) | 0 (1) | 4.65 | 0.59 |
| ROM 39382 group B (23) | 1694 | 1 (1) | 2 (4) | 3 (6) | 8 (5) | 5 (3) | 3 (2) | 1 (1) | 0 (0) | 5.82 | 0.44 |
*a further 10 samples from B664 did not fall into either group or show sufficient similarity to one another to form a third group.
The expected frequencies are calculated based on the error rates calculated for MT-CYB (see text), under a binomial probability distribution using the average length of the sequence in that group. χ2 values for test groups are derived from a comparison of the number of sequences showing given numbers of deviations from the consensus sequence with the expected distribution of errors in a set of sequences of the same length.
PCR and sequencing primers for pitviper Phospholipase A2 toxin genes.
| Name | Direction | Location (approx) | Sequence (5′-3′) |
| New PLA5′ | Forward | 1–19 |
|
| PLA2 Up1 | Forward | Upstream of position 1 |
|
| iF | Forward | 575–594 |
|
| R for 5′ | Reverse | 710–729 |
|
| F for 3′ | Forward | 1402–1423 |
|
| iR | Reverse | 1650–1668 |
|
| New PLA3′ | Reverse | 2102–2129 |
|
Samples used various combinations of the following primers to obtain full length sequence. Asterisks indicate primers used for PCR. PLA2 Primers were developed in previous PLA2 sequencing work (Anita Malhotra, unpublished data) apart from Up1, which is a novel primer designed as an alternative to New PLA5′ in some amplifications. Sequence locations are derived from the aligned gene sequence data used in the analyses.