| Literature DB >> 31803173 |
Jun-Liang Zhou1,2, Jianping Xu3,4, An-Guo Jiao1, Li Yang1, Jie Chen5, Philippe Callac6, Yu Liu1, Shou-Xian Wang1.
Abstract
The polymerase chain reaction (PCR) is widely used in modern biology and medicine. However, PCR artifacts can complicate the interpretation of PCR-based results. The internal transcribed spacer (ITS) region of the ribosomal RNA gene cluster is the consensus fungal barcode marker and suspected PCR artifacts have been reported in many studies, especially for the analyses of environmental fungal samples. At present, the patterns of PCR artifacts in the whole fungal ITS region (ITS1+5.8S+ITS2) are not known. In this study, we analyzed the error rates of PCR at three template complexity levels using the divergent copies of ITS from the mushroom Agaricus subrufescens. Our results showed that PCR using the Phusion® High-Fidelity DNA Polymerase has a per nucleotide error rate of about 4 × 10-6 per replication. Among the detected mutations, transitions were much more frequent than transversions, insertions, and deletions. When divergent alleles were mixed as templates in the same reaction, a significant proportion (∼30%) of recombinant molecules were detected. The in vitro mixed-template results were comparable to those obtained from using the genomic DNA of the original mushroom specimen as template. Our results indicate that caution should be in place when interpreting ITS sequences from individual fungal specimens, especially those containing divergent ITS copies. Similar results could also happen to PCR-based analyses of other multicopy DNA fragments as well as single-copy DNA sequences with divergent alleles in diploid organisms.Entities:
Keywords: ITS; chimera sequence; mutation spectrum; polymerase chain reaction; recombination
Year: 2019 PMID: 31803173 PMCID: PMC6877668 DOI: 10.3389/fmicb.2019.02686
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Experimental design for assessing in vitro PCR artifacts of the consensus fungal barcode marker ITS.
Mutation and recombination frequencies detected within each of the 11 PCR treatments.
| A (PCR) | 110 | 110 (100%) | – | – | – | – | – | – | – | – | – | 100 ± 0 a |
| B (PCR) | 109 | – | 109 (100%) | – | – | – | – | – | – | – | – | 100 ± 0 a |
| C (PCR) | 110 | – | – | 110 (100%) | – | – | – | – | – | – | – | 100 ± 0 a |
| A (Clone) | 653 | 633 (96.94%) | – | – | – | – | – | – | – | 20 (3.06%) | 20 (3.06%) | 96.94 ± 1.06 b |
| B (Clone) | 641 | – | 618 (96.41%) | – | – | – | – | – | – | 23 (3.59%) | 23 (3.59%) | 96.41 ± 1.47 b |
| C (Clone) | 648 | – | – | 624 (96.30%) | – | – | – | – | – | 24 (3.70%) | 24 (3.70%) | 96.30 ± 1.12 b |
| A+B | 621 | 243 (39.13%) | 190 (30.60%) | – | 101 (16.27%) | 43 (6.92%) | 20 (3.22%) | 3 (0.48%) | 8 (1.29%) | 13 (2.09%) | 188 (30.27%) | 69.51 ± 2.52 c |
| B+C | 630 | – | 204 (32.38%) | 221 (35.08%) | 108 (17.14%) | 50 (7.94%) | 23 (3.65%) | 3 (0.48%) | 5 (0.79%) | 16 (2.54%) | 205 (32.54%) | 67.33 ± 3.36 c |
| A+C | 635 | 218 (34.33%) | – | 231 (36.38%) | 105 (16.54%) | 44 (6.93%) | 16 (2.52%) | 1 (0.16%) | 7 (1.10%) | 13 (2.05%) | 186 (29.29%) | 69.91 ± 2.20 c |
| A+B+C | 636 | 150 (23.58%) | 122 (19.21%) | 136 (21.38%) | 112 (17.61%) | 56 (8.80%) | 28 (4.40%) | 4 (0.63%) | 11 (1.73%) | 17 (2.67%) | 228 (35.85%) | 64.65 ± 3.38 c |
| Original DNA | 636 | 169 (26.57%) | 131 (20.60%) | 138 (21.70%) | 103 (16.20%) | 47 (7.39%) | 15 (2.36%) | 2 (0.31%) | 11 (1.73%) | 20 (3.14%) | 198 (31.13%) | 68.07 ± 3.55 c |
| Total | 5429 | 1523 (28.05%) | 1374 (25.31%) | 1460 (26.89%) | 529 (16.75%)2 | 240 (7.60%)2 | 102 (3.23%)2 | 13 (0.41%)2 | 42 (1.33%)2 | 146 (3.69%)3 | 1072 (21.02%)3 | |
Summary mutations and error rate of each PCR treatment.
| A (PCR) | 110 | – | – | – | – | – | – | – | – | – | – | – | – | – | 0 | 0 a |
| B (PCR) | 109 | – | – | – | – | – | – | – | – | – | – | – | – | – | 0 | 0 a |
| C (PCR) | 110 | – | – | – | – | – | – | – | – | - | – | – | – | – | 0 | 0 a |
| A (Clone) | 653 | 2 | 6 | 5 | 3 | – | 1 | 1 | – | – | 2 | – | – | – | 20 | 3.53 ± 0.22 b |
| B (Clone) | 641 | 1 | 8 | 6 | 1 | – | 2 | 1 | 1 | – | 2 | 1 | – | – | 23 | 4.05 ± 0.31 bc |
| C (Clone) | 648 | 2 | 7 | 11 | 1 | – | – | 1 | – | – | 1 | 1 | – | – | 24 | 3.96 ± 0.46 bc |
| A + B | 621 | – | 7 | 12 | 1 | – | 1 | – | 1 | – | 2 | – | – | – | 24 | 4.5 ± 0.55 c |
| B + C | 630 | 2 | 6 | 7 | 2 | 1 | 1 | – | 2 | – | 1 | 1 | 1 (G) | 1 (T) | 25 | 4.51 ± 0.41 c |
| A + C | 635 | 2 | 7 | 8 | 2 | 1 | – | 1 | 1 | – | 1 | – | – | – | 23 | 4.2 ± 0.29 c |
| A + B + C | 636 | 2 | 10 | 8 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | – | 1 (T) | 31 | 5.43 ± 0.33 d |
| Original DNA | 636 | 3 | 9 | 11 | 3 | 1 | 2 | 3 | 2 | – | 1 | 1 | – | 1 (T) | 37 | 6.47 ± 0.39 d |
| Total | 5429 | 14 | 60 | 68 | 14 | 4 | 8 | 8 | 9 | 1 | 11 | 5 | 2 | 3 | 207 | |
Relative mutation frequencies in different regions of the ITS marker.
| A | 40 | 20 | 40 | 3.43 | 3.35 | 4.08 |
| B | 30.4 | 26.1 | 43.5 | 2.8 | 5.02 | 5.07 |
| C | 37.5 | 25 | 37.5 | 3.41 | 4.51 | 4.12 |
| A+B | 33.3 | 20.8 | 45.8 | 3.4 | 4.32 | 5.73 |
| A+C | 30.4 | 17.4 | 52.2 | 3.23 | 3.47 | 6.22 |
| B+C | 32 | 20 | 48 | 3.46 | 4.29 | 6.25 |
| A+B+C | 35.5 | 25.8 | 38.7 | 4.45 | 6.51 | 6.2 |
| Original | 29.7 | 24.3 | 45.9 | 4.14 | 7.01 | 8.03 |
| G to A | 25 | 16.7 | 58.3 | 0.76 | 1.06 | 2.25 |
| A to G | 35.7 | 14.3 | 50 | 0.25 | 0.21 | 0.45 |
| T to C | 28.6 | 28.6 | 42.8 | 0.2 | 0.42 | 0.39 |
| C to T | 29.4 | 22.1 | 48.5 | 1.01 | 1.59 | 2.12 |
| T to A | 66.7 | – | 33.3 | 0.4 | – | 0.26 |
| A to C | 50 | – | 50 | 0.1 | – | 0.13 |
| C to A | 33.3 | 22.2 | 44.4 | 0.15 | 0.21 | 0.26 |
| C to G | – | – | 100 | – | – | 0.06 |
| G to T | 57.1 | 14.3 | 28.6 | 0.2 | 0.11 | 0.13 |
| T to G | 60 | 20 | 20 | 0.15 | 0.11 | 0.06 |
Base compositions of one single strand of the whole ITS region and its three constituent parts (ITS1, 5.8S and ITS2) in the three ITS sequence types of A. subrufescens strain CA487.
| Total length | 662 | 292 | 154 | 216 | 661 | 291 | 154 | 216 | 660 | 291 | 154 | 215 |
| G | 154 | 68 | 37 | 49 | 156 | 70 | 38 | 48 | 154 | 68 | 37 | 49 |
| C | 126 | 50 | 33 | 43 | 127 | 51 | 33 | 43 | 128 | 51 | 33 | 44 |
| A | 151 | 63 | 41 | 47 | 149 | 60 | 40 | 49 | 149 | 62 | 41 | 46 |
| T | 231 | 111 | 43 | 77 | 229 | 110 | 43 | 76 | 229 | 110 | 43 | 76 |
| G content | 23.3% | 23.3% | 24% | 22.7% | 23.6% | 24.1% | 24.7% | 22.2% | 23.3% | 23.4% | 24% | 22.8% |
| C content | 19% | 17.1% | 21.4% | 19.9% | 19.2% | 17.5% | 21.4% | 19.9% | 19.4% | 17.5% | 21.4% | 20.5% |
| A content | 22.8% | 21.6% | 26.6% | 21.8% | 22.5% | 20.6% | 26% | 22.7% | 22.6% | 21.3% | 26.6% | 21.4% |
| T content | 34.9% | 38% | 27.9% | 35.6% | 34.6% | 37.8% | 27.9% | 35.2% | 34.7% | 37.8% | 27.9% | 35.3% |
| GC content | 42.3% | 40.4% | 45.5% | 42.6% | 42.8% | 41.6% | 46.1% | 42.1% | 42.7% | 40.9% | 45.5% | 43.3% |
| AT content | 57.7% | 59.6% | 54.5% | 57.4% | 57.2% | 58.4% | 53.9% | 57.9% | 57.3% | 59.1% | 54.5% | 56.7% |
FIGURE 2Distribution of in vitro PCR recombination events along the sequenced ITS DNA fragment. In the present study, base positions 1–32 are part of the 18S; positions 33–324 (type A), and 33–323 (types B and C) represent ITS1; positions 325–478 (type A) and 324–477 (types B and C) represent 5.8S; positions 479–694 (type A), 478–693 (type B), and 478–692 (type C) represent ITS2; and positions 695–731 (type A), 694–730 (type B), and 693–729 (type C) are part of the 26S. Because of the different lengths of types A, B, and C sequences, their aligned dataset was used to infer the regions where recombination occurred. Type A and C have the same 5.8S gene sequence, while type B has a different base at the position 366, adenine for type A and C, guanine for type B.