| Literature DB >> 25202592 |
Simon Uribe-Convers1, Justin R Duke2, Michael J Moore3, David C Tank1.
Abstract
PREMISE OF THE STUDY: We present an alternative approach for molecular systematic studies that combines long PCR and next-generation sequencing. Our approach can be used to generate templates from any DNA source for next-generation sequencing. Here we test our approach by amplifying complete chloroplast genomes, and we present a set of 58 potentially universal primers for angiosperms to do so. Additionally, this approach is likely to be particularly useful for nuclear and mitochondrial regions. • METHODS ANDEntities:
Keywords: angiosperms; chloroplast enrichment; long PCR; next-generation sequencing; plastome; universal chloroplast PCR primers
Year: 2014 PMID: 25202592 PMCID: PMC4104715 DOI: 10.3732/apps.1300063
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
List of species included in this study, with voucher information, tissue sources, and NGS assembly statistics when available.
| Species | Order/Family | Collection no. | Herbarium | Type of tissue | Collection date | No. of amplified regions | Region no. not amplified | Base pairs sequenced | No. of contigs | CAL bp (min–max) | Ave. assembly depth | No. of masked bp | % of masked bp | N50 | % called bases | No. of ambiguous bases | % of ambiguous bases |
| Lamiales/Orobanchaceae | Uribe-Convers 2010-22 | ID | Silica gel–dried | 5 July 2010 | 16 | n/a | 125,283 | 25 | 5011 (204–28,257) | 656 | 2126 | 1.7 | 19,294 | 99.9729 | 34 | 0.02714 | |
| Lamiales/Orobanchaceae | Tank 1046 | ID | Silica gel–dried | 13 July 2009 | 16 | n/a | 133,595 | 10 | 13,360 (1222–48,767) | 641 | 101 | 0.08 | 37,107 | 99.9948 | 7 | 0.00524 | |
| Lamiales/Orobanchaceae | Olmstead 2001-78 | WTU | Silica gel–dried | 4 July 2001 | 16 | n/a | 122,614 | 11 | 11,147 (464–34,602) | 664 | 440 | 0.36 | 33,049 | 99.9976 | 3 | 0.00245 | |
| Lamiales/Orobanchaceae | Tank 2001-49 | WTU | Silica gel–dried | 21 July 2001 | 16 | n/a | 122,046 | 8 | 15,256 (819–50,680) | 642 | 260 | 0.21 | 28,529 | 99.9984 | 2 | 0.00164 | |
| Lamiales/Orobanchaceae | Tank 1048-b | ID | Silica gel–dried | 13 July 2009 | 16 | n/a | 134,704 | 4 | 33,676 (6157–75,123) | 844 | 35 | 0.03 | 75,123 | 99.9970 | 4 | 0.00297 | |
| Lamiales/Orobanchaceae | Tank 2009-8 | ID | Silica gel–dried | 6 June 2009 | 16 | n/a | 125,490 | 4 | 31,372 (3039–73,629) | 764 | 29 | 0.02 | 73,629 | 99.9984 | 2 | 0.00159 | |
| Lamiales/Orobanchaceae | Uribe-Convers 2010-24 | ID | Silica gel–dried | 7 July 2010 | 15 | 13 | 119,828 | 14 | 8559 (425–67,195) | 707 | 1045 | 0.87 | 67,195 | 99.9967 | 4 | 0.00334 | |
| Lamiales/Orobanchaceae | Tank 2001-35 | WTU | Silica gel–dried | 24 June 2001 | 15 | 10 | 119,647 | 14 | 8546 (204–28,559) | 642 | 394 | 0.33 | 18,856 | 99.9983 | 2 | 0.00167 | |
| Lamiales/Orobanchaceae | Olmstead 2009-22 | WTU | Silica gel–dried | 5 Mar. 2009 | 15 | 7 | 113,650 | 21 | 5412 (178–39,914) | 698 | 1525 | 1.34 | 14,541 | 99.9938 | 7 | 0.00616 | |
| Lamiales/Orobanchaceae | Egger 1213 | WTU | Silica gel–dried | 22 Feb. 2002 | 15 | 13 | 108,071 | 3 | 36,024 (269–97,615) | 925 | 198 | 0.18 | 97,615 | 99.9991 | 1 | 0.00093 | |
| Lamiales/Orobanchaceae | Tank 2002-04 | WTU | Silica gel–dried | 27 Apr. 2004 | 14 | 9, 10 | 122,182 | 23 | 5312 (179–36,972) | 540 | 810 | 0.66 | 11,656 | 99.9844 | 19 | 0.01555 | |
| Lamiales/Orobanchaceae | Fairbarns s.n. | WTU | Silica gel–dried | 21 July 2005 | 14 | 10, 14 | 111,371 | 10 | 11,137 (616–44,011) | 688 | 547 | 0.49 | 18,398 | 99.9982 | 2 | 0.00180 | |
| Lamiales/Orobanchaceae | Zak & Jaramillo, 3387 | F | Herbarium | 16 Jan. 1988 | 14 | 9, 10 | 108,767 | 30 | 3626 (214–36,850) | 652 | 2255 | 2.07 | 11,012 | 99.9669 | 36 | 0.03310 | |
| Lamiales/Orobanchaceae | Tank 2001-27 | WTU | Silica gel–dried | 19 June 2001 | 10 | 6, 9, 10, 13, 14, 16 | 83,384 | 20 | 4169 (222–36,830) | 717 | 1544 | 1.85 | 9986 | 99.9676 | 27 | 0.03238 | |
| Lamiales/Orobanchaceae | Tank 2005-27 | WTU | Silica gel–dried | 16 Apr. 2005 | 6 | 4, 6, 7, 8, 9, 10, 13, 14, 15, 16 | 73,378 | 15 | 4892 (186–36,621) | 701 | 1187 | 1.62 | 9803 | 99.9877 | 9 | 0.01227 | |
| Lamiales/Plantaginaceae | Brunsfeld 4159 | ID | Herbarium | 14 June 2001 | 16 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Asterales/Asteraceae | Willard 2013-42 | ID | Silica gel–dried | 3 July 2013 | 15 | 5 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Apiales/Apiaceae | Poor 21 | ID | Herbarium | 27 May 2004 | 15 | 14 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Nymphaeales/Nymphaeaceae | Morales-Briones 412 | ID | Silica gel–dried | 8 July 2013 | 15 | 5 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Malpighiales/Salicaeae | Brunsfeld 7213 | ID | Herbarium | 11 June 2008 | 15 | 9 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Rosales/Rosaceae | Hetrick 1005 | ID | Herbarium | 10 Apr. 1996 | 13 | 9, 14, 17 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Caryophyllales/Polygonaceae | Smith 8040 | ID | Herbarium | 23 June 2005 | 12 | 5, 6, 9, 15 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Laurales/Lauraceae | Halse 6901 | ID | Herbarium | 28 Mar. 2002 | 12 | 6, 8, 9, 10 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Poales/Poaceae | Clippinger 2 | ID | Herbarium | 1 May 2004 | 11 | 5, 6, 9, 11, 17 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Fagales/Betulaceae | Gray 52 | ID | Herbarium | 7Aug. 1989 | 10 | 5, 6, 8, 9, 10, 14 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Poales/Poaceae | Willard 2013-26 | ID | Silica gel–dried | 3 July 2013 | 10 | 5, 6, 9, 12, 13, 14 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Asterales/Asteraceae | Willard 2013-21 | ID | Silica gel–dried | 3 July 2013 | 10 | 3, 5, 6, 8, 9, 11 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Pinales/Pinaceae | 1419-46 | WA Park Arb. | Silica gel–dried | 24 May 2009 | 9 | 4, 6, 7, 9, 10, 11, 12 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Brassicales/Brassicaceae | Brunsfeld 6313 | ID | Herbarium | 1 June 2005 | 8 | 4, 6, 8, 9, 10, 13, 14, 17 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Fabales/Fabaceae | Willard 2013-03 | ID | Silica gel–dried | 3 July 2013 | 8 | 1, 6, 8, 9, 10, 12, 13, 14 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Pinales/Pinaceae | 1005-47 | WA Park Arb. | Silica gel–dried | 24 May 2009 | 7 | 4, 5, 6, 7, 8, 9, 10, 11, 12 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Asterales/Asteraceae | Smith 9421 | ID | Herbarium | 4 June 2007 | 7 | 4, 5, 6, 7, 8, 9, 10, 11, 13 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Pinales/Pinaceae | 1084-49 | WA Park Arb. | Silica gel–dried | 24 May 2009 | 6 | 1, 3, 4, 6, 7, 8, 9, 10, 11, 12 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | |
| Average | 11,493 | 14.13 | 13,166.60 | 698.73 | 833.07 | 0.79 | 35,052.87 | 99.99 | 10.6 | 0.01 |
Note: CAL = contig average length; F = Field Museum of Natural History Herbarium; ID = University of Idaho Stillinger Herbarium; WA Park Arb. = Washington Park Arboretum; WTU = University of Washington Herbarium.
All data from the 16 chosen primer combinations.
The number of the regions is the same as the order in Fig. 1.
Base pairs (bp) sequenced is the sum of all contigs when including only one copy of the inverted repeat.
Number of bases masked because the minimum sequencing depth of 5× was not achieved.
Percentage of unambiguously called bases.
Universal angiosperm primers used for chloroplast genome amplifications. The 16 primer combinations chosen for this study are in bold with approximate amplicon sizes in kilobases (kb) indicated.
| Region no. | Approx. size (kb) | Primer (F/R) | Primer sequence (5′–3′) | Overlap between regions in bp |
| 1 | 8 | CCTTRATCCACTTGGCTACAT | Regions 1 & 2 = 542 | |
| 1 | ACTTACAGCAGCTTGCCAAAC | Regions 1 & 2a = 542 | ||
| 2/2a | 10.3/6.3 | GGACGGAAGGATTCGAACC | Regions 2a & 2b = 627 | |
| 2a | CTGCYGCTTCYGTTATTGCT | Regions 2b & 3 = 2059 | ||
| 2b | 4 | CGGTATTAAACCCGAAACTCC | Regions 2 & 3 = 2059 | |
| 2/2b | GYCGTATYGATTGGTTRAAAGG | Regions 3 & 4 = 1274 | ||
| 3 | 7 | CRGCTAAAGTTGCAAAAATAAGAGCT | Regions 4 & 5 = 860 | |
| 3 | GRGATCAAATGGCTGTTCAT | Regions 5 & 6 = 618 | ||
| 4 | 9 | GTTCGTACAGCAGTATCYACAAC | Regions 6 & 7 = 764 | |
| 4 | GCCCAAGCRAGACTTACTATATCC | Regions 7 & 8 = 153 | ||
| 5 | 10.5 | CCCAGTTCAAATCCGGGT | Regions 8 & 9 = 1216 | |
| 5 | GCRGCTTTCTTGATTGCYTC | Regions 9 & 10 = 135 | ||
| 6 | 10 | GGTTATGAGCCTTGCGAGCTA | Regions 10 & 11 = 771 | |
| 6 | GGTTAGAGCATCGCATTTGTAATG | Regions 11 & 12 = 2781 | ||
| 7 | 10.3 | GGTTTGCARCGATAACTTGGKATATC | Regions 12 & 13 = 142 | |
| 7 | GTCCATGTACCAGTAGARGATTC | Regions 13 & 14 = 392 | ||
| 8 | 9.2 | TGTCACCACAAACAGARACTAAAG | Regions 14 & 15 = 1911 | |
| 8 | GGCYGATACTACTGGAAGRAT | Regions 16 & 1 = 840 | ||
| 9 | 9.8 | CTTCAAGAYCCATTACGTGTHCAAG | ||
| 9 | TRCCYTGTCTCCACATTGGAT | |||
| 10 | 10.9 | GGGTTTRCCTTGGTATCGTGT | ||
| 10 | ATCCACTTGGTTTYMGACTTGG | |||
| 11 | 8.7 | AACCAACGAGTCACACACTAAGC | ||
| 11/16 | CAGATCATGAATGTTTGGAATCCAT | |||
| 12 | 10 | TCGGGATCCTRATGCATATAGATAC | ||
| 12 | GTTGCCAGAGTACGMTTAACCT | |||
| 13 | 11 | CCCTTGTTGACGATCCTTTACTC | ||
| 13 | CCGACCACAACGACCGAAT | |||
| 14/15 | 11.2 | CCGCTCTACCACTGAGCTAC | ||
| 14 | GCTGCTCAATCDATTAGTTATGAA | |||
| 15 | 10.5 | CGAACRCATACTTCACAAGCAA | ||
| 16 | 8.2 | GCTATGCATGGTTCYTTGGTAAC | ||
| rps16.50R | CGAACATCAATTGCAACGATTCGATA | |||
| rps16.50F | TATCGAATCGTTGCAATTGATGTTCG | |||
| psbK.200F | GGCAAGCTGCTGTAAGTTTTCGA | |||
| atpF.70F | GGGTTTAATACCGATATTTTAGCAAC | |||
| trnR.UCU.45F | GGTATAGGTTCAAATCCTATTGGAC | |||
| trnQ.UUG.47F | CGGAGGTTCGAATCCTTCC | |||
| trnK.UUU.3R | GAGATGGCAACTCAATCGTTG | |||
| trnK.UUU.3F | CAACGATTGAGTTGCCATCTC | |||
| atpA.430F | CGTTCYGTATATGARCCTCTTCAAAC | |||
| atpA.820F | ATCGMCAAATGTCTCTTCTATTAMG | |||
| ccsA.890R | TCCAAGTAATAAANGCCCAAGTTTC | |||
| trnR.ACG.15F | GAGGATTAGAGCACGTGG | |||
| ycf1.70F | GTGGTCGGACTCTATTATGGAT | |||
| trnL.UAG.18F | GGTAGACACGCTGCTCTTAGG | |||
| trnL.UAG.19F | GTAGACACGCTGCTCTTAGGAAG | |||
| rps12.320R | GGGTTCCTCGAACAATGTGATATC | |||
| rpl2.550F | GTGCTGTAGCGAAACTGATTG | |||
| rpl2.640F | TCAGCAACAGTCGGACARGT | |||
| psbT.3F | TGGAAGCATTGGTTTATACATTYCT | |||
| atpB.1290R | ARGGTTGTGATAAGAAACGYTCAA | |||
| trnT.UGU.42F | GATGGTCATCGGTTCGATTC | |||
| psbC.3R | AGTTCCATTAAAGAGCGTTTCC | |||
| psbD.860F | CYGGTTTATGGATGAGYGCT | |||
| rpoB.900R | CGTCGACCAATCYTTCCTAATTC | |||
| rpoB.470R | CCRGGRCTTTGCAATATTTGATTG | |||
| rpoC2.430R | ATRGGTAAATCAATCATTTGYCCTTG |
All primers are shown in the 5′ to 3′ direction; the name of each primer consists of three parts: the gene in which the primer is anchored, the approximate position of the primer within that gene, and either an “F” or an “R.” It is important to note that the F and R designations do not indicate that the primer should be used as a forward or reverse primer; rather, they indicate the 5′ to 3′ orientation of the primer with respect to the gene—i.e., a primer that is designated as an “F” primer has its 5′ to 3′ orientation in the same orientation as the gene (i.e., on the forward strand), whereas an “R” primer is oriented in the direction opposite to the 5′ to 3′ orientation of the gene (i.e., on the reverse strand).
Overlap between regions is given in number of base pairs (bp), without taking the length of the primers into consideration.
Fig. 1.The final annotated chloroplast genome assembly of Bartsia inaequalis with the 16 overlapping primer combinations indicated. Note that the primer combinations for regions 11, 12, 13, and 16 amplify both inverted repeat A and B in a single reaction. Photos by Simon Uribe-Convers.
Protocol for long PCR for amplification of 4–20-kb targets. Developed by the Tank Laboratory, University of Idaho; published January 2014.
| Product | Contents | Catalog no. |
| QIAGEN | 250 units | 201205 |
| QIAGEN HotStar HiFidelity DNA Polymerase | 100 units HotStar HiFidelity DNA Polymerase | 202602 |
Almost any high-quality Taq polymerase should work; however, cheap Taq polymerases (e.g., QIAGEN TopTaq or Promega GoTaq) do not work and result in large smears, rather than discrete bands.
QIAGEN HotStar HiFidelity DNA Polymerase was the only high-fidelity polymerase used in this study.
Q-solution does seem to be an important additive, thus the use of QIAGEN Taq. However, this does work using Q-solution with other high-quality Taq polymerases such as Promega’s or New England Biolab’s standard Taq (i.e., if you have a stock of Q-solution, but no QIAGEN Taq).
| Reagents to prepare the HotStar | Volumes for 25 reactions (total 12.5 µL) | Volumes for 50 reactions (total 25 µL) | Volumes for 100 reactions (total 50 µL) |
| 5× HotStar HiFidelity PCR buffer | 2.5 μL | 5.0 μL | 10 μL |
| H2O | 9.0 μL | 18 μL | 36 μL |
| QIAGEN HotStar | 1.0 μL | 2.0 μL | 4.0 μL |
| Cocktail | ×1 (25 μL reaction) |
| 10× PCR buffer (QIAGEN CoralLoad PCR Buffer or colorless, 15 mM MgCl2) | 2.5 μL |
| MgCl2 (25 mM) | 1.0 μL (3 mM final conc.; adjustable) |
| dNTP (10 mM each) | 0.75 μL (3 μL of 2.5 mM each) |
| Q solution (5×) | 5.0 μL |
| 5′ primer (5 μM) | 2.5 μL (0.5 μM final conc.) |
| 3′ primer (5 μM) | 2.5 μL (0.5 μM final conc.) |
| 0.25 μL (1.25 units) | |
| QIAGEN HotStar DNA polymerase (diluted) | 0.50 μL |
| H2O | to 25 μL (9 μL if using 1.0 μL DNA) |
The success rate was lower when a smaller quantity was used, but the best DNAs work with ≥0.125 μL.
Primer combinations for long PCR amplification of the chloroplast genome.,
| Region no. | Approx. size (kb) | Primers (F/R) | Primer sequence (5′–3′) |
| 1 | 8 | trnH.GUG.6R | CCTTRATCCACTTGGCTACAT |
| psbK.195R | ACTTACAGCAGCTTGCCAAAC | ||
| 2 | 10.3 | trnQ.UUG.50R | GGACGGAAGGATTCGAACC |
| rpoC2.4805F | GYCGTATYGATTGGTTRAAAGG | ||
| 2a | 6.3 | trnQ.UUG.50R | GGACGGAAGGATTCGAACC |
| atpH.17F | CTGCYGCTTCYGTTATTGCT | ||
| 2b | 4 | atpF.65R | CGGTATTAAACCCGAAACTCC |
| rpoC2.4805F | GYCGTATYGATTGGTTRAAAGG | ||
| 3 | 7 | atpI.705R | CRGCTAAAGTTGCAAAAATAAGAGCT |
| rpoC1.1670F | GRGATCAAATGGCTGTTCAT | ||
| 4 | 9 | rpoC2.520R | GTTCGTACAGCAGTATCYACAAC |
| petN.3R | GCCCAAGCRAGACTTACTATATCC | ||
| 5 | 10.5 | trnC.GCA.47F | CCCAGTTCAAATCCGGGT |
| psaB.2170F | GCRGCTTTCTTGATTGCYTC | ||
| 6 | 10 | trnfM.CAU.21R | GGTTATGAGCCTTGCGAGCTA |
| trnT.UGU.17F | GGTTAGAGCATCGCATTTGTAATG | ||
| 7 | 10.3 | rps4.380R | GGTTTGCARCGATAACTTGGKATATC |
| rbcL.178R | GTCCATGTACCAGTAGARGATTC | ||
| 8 | 9.2 | rbcL.2F | TGTCACCACAAACAGARACTAAAG |
| psbJ.3F | GGCYGATACTACTGGAAGRAT | ||
| 9 | 9.8 | petA.920F | CTTCAAGAYCCATTACGTGTHCAAG |
| psbB.160R | TRCCYTGTCTCCACATTGGAT | ||
| 10 | 10.9 | psbB.3F | GGGTTTRCCTTGGTATCGTGT |
| rps3.17F.new | ATCCACTTGGTTTYMGACTTGG | ||
| 11 | 8.7 | rpl16.3R | AACCAACGAGTCACACACTAAGC |
| ycf2.5100R | CAGATCATGAATGTTTGGAATCCAT | ||
| 12 | 10 | ycf2.2300F | TCGGGATCCTRATGCATATAGATAC |
| rps12.190F | GTTGCCAGAGTACGMTTAACCT | ||
| 13 | 11 | rps12.360R | CCCTTGTTGACGATCCTTTACTC |
| ycf1.59R | CCGACCACAACGACCGAAT | ||
| 14 | 11.2 | trnN.GUU.7R | CCGCTCTACCACTGAGCTAC |
| ndhA.535F | GCTGCTCAATCDATTAGTTATGAA | ||
| 14′ | 7 | trnR.ACG.15F | GAGGATTAGAGCACGTGG |
| ccsA.890R | TCCAAGTAATAAANGCCCAAGTTTC | ||
| 15 | 10.5 | ndhI.194R | CGAACRCATACTTCACAAGCAA |
| trnN.GUU.7R | CCGCTCTACCACTGAGCTAC | ||
| 16 | 8.2 | psbA.640F | GCTATGCATGGTTCYTTGGTAAC |
| ycf2.5100R | CAGATCATGAATGTTTGGAATCCAT |
Universal primers designed by M.J.M.; compiled and tested by D.C.T. and S.U.C.
Ta should be ∼5°C below Tm of primers; however, temperatures of 55°C have worked for all primer combinations.
The name of each primer consists of three parts: (1) the gene in which the primer is anchored in, (2) the approximate position of the primer within that gene (based on all-angiosperm alignment per Moore et al., 2007), and (3) either an “F” or an “R.” The F and R designations do not indicate that the primer should be used as a forward or reverse primer; rather, they indicate the 5′ to 3′ orientation of the primer with respect to the gene. In other words, a primer that is designated as an F primer has its 5′ to 3′ orientation in the same orientation as the gene (i.e., on the forward strand, or from start to stop), whereas an R primer is oriented in the direction opposite to the 5′ to 3′ orientation of the gene (i.e., on the reverse strand).
Regions 2a and 2b can be used to amplify region 2 in two pieces.
Regions 11, 12, and 13 represent a large portion of the inverted repeat (IR), thus, one amplification for both IRa and IRb.
Region 14′ amplifies ca. 2/3 of region 14.