| Literature DB >> 16336644 |
Abstract
BACKGROUND: Availability of DNA sequence information is vital for pursuing structural, functional and comparative genomics studies in plastids. Traditionally, the first step in mining the valuable information within a chloroplast genome requires sequencing a chloroplast plasmid library or BAC clones. These activities involve complicated preparatory procedures like chloroplast DNA isolation or identification of the appropriate BAC clones to be sequenced. Rolling circle amplification (RCA) is being used currently to amplify the chloroplast genome from purified chloroplast DNA and the resulting products are sheared and cloned prior to sequencing. Herein we present a universal high-throughput, rapid PCR-based technique to amplify, sequence and assemble plastid genome sequence from diverse species in a short time and at reasonable cost from total plant DNA, using the large inverted repeat region from strawberry and peach as proof of concept. The method exploits the highly conserved coding regions or intergenic regions of plastid genes. Using an informatics approach, chloroplast DNA sequence information from 5 available eudicot plastomes was aligned to identify the most conserved regions. Cognate primer pairs were then designed to generate approximately 1 - 1.2 kb overlapping amplicons from the inverted repeat region in 14 diverse genera.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16336644 PMCID: PMC1318494 DOI: 10.1186/1471-2164-6-176
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematic representation of the ASAP approach. Three rounds of PCR allow for 100% coverage of a given region. F and R suffix to the numbers represent forward and reverse universal primers.
Figure 2ASAP profile from . A. Round 1 touchdown PCR. B. Round 2 touchdown PCR and C. Round 3 touchdown extension PCR.
ASAP PCR primers. Primer sequences, annealing site and the relative position in tobacco (Nt), Arabidopsis (At) and maize (Zm) are presented, along with the anticipated amplicon size. * represent the primers with low sequence similarity in maize.
| Upstream of JLB | ggatttttttttagtgaacgtgtcac | 86657–86682 | 804 | 84258–84283 | 812 | 82644–82669* | 1001 | |
| Intron rpl2 | aagtatcgacgtaatttcatagagtc | 87436–87461 | 85045–85070 | 83622–83647 | ||||
| Intron rpl2 | catctggcttatgttcttcatgtagc | 87397–87422 | 1048 | 85006–85031 | 1049 | 83583–83608 | 1043 | |
| rpl23 3'end | caactaggacagaaataaagcattgg | 88420–88445 | 86030–86055 | 84601–84626 | ||||
| rpl23 3'end | atacgtctgtaatgcattgtatgtcc | 88310–88335 | 1001 | 85920–85945 | 980 | 84491–84516 | ||
| YCF2/ORF2280 5' | gaagatacaggagcgaaacaatcaac | 89285–89311 | 86875–86900 | Deleted | ||||
| YCF2/ORF2280 | aagaaaaaatctctatttgatagggc | 89181–89206 | 1031 | 86770–86795 | 1037 | Deleted | ||
| YCF2/ORF2280 | tttcgttccgtttgaagaaaggaagg | 90212–90187 | 87782–87807 | Deleted | ||||
| YCF2/ORF2280 | ggattccattagtaatgaggattcgg | 90096–90121 | 1162 | 87685–87710 | 1171 | Deleted | ||
| YCF2/ORF2280 | gaggctcgaaccatttcttctgactc | 91233–91258 | 88831–88856 | Deleted | ||||
| YCF2/ORF2280 | cttcgaatatggaattcaaagggatc | 91131–91156 | 1042 | 88729–88754 | 1081 | Deleted | ||
| YCF2/ORF2280 | tgaatatgttagatacctgtgactcg | 92148–92173 | 89785–89810 | Deleted | ||||
| YCF2/ORF2280 | acaattcctcaatatcttgttcattc | 92049–92074 | 1091 | 89680–89705 | 1088 | Deleted | ||
| YCF2/ORF2280 | tcttctagagaatctcctaattgttc | 93115–93140 | 90743–90768 | Deleted | ||||
| YCF2/ORF2280 | gaaaaggtcaaatctttgatgattcc | 92995–93020 | 1041 | 90623–90648 | 1035 | Deleted | ||
| YCF2/ORF2280 | tttccggcatcatatccatagttagc | 94011–94036 | 91633–91658 | Deleted | ||||
| YCF2/ORF2280 | ctgaacaagttcctggataacaagcc | 93853–93878 | 1169 | 91493–91518 | 1151 | Deleted | ||
| YCF2/ORF2280 | aaatctctgatcaaggatagaacaag | 94997–95022 | 92619–92644 | Deleted | ||||
| YCF2/ORF2280 | gatctagttcatggcctattagaagt | 94849–94874 | 1025 | 92471–92496 | 1021 | 86077–86102 | 1138 | |
| ORF 87/YCF 15 | taacatattcttccatggagctaagg | 93849–95874 | 93475–93492 | 87190–87215* | ||||
| YCF2/ORF2280 3' | cggatgaaatgaaaattggattcatg | 95669–95724 | 1094 | 93330–93355 | 1319 | No similarity | ||
| ORF 79 | aatcggacctgctttttacatatctc | 96739–96763 | 94624–94649 | No similarity | ||||
| ORF 79 | ccaattgcttcgatttgaattatccg | 96626–96644 | 1061 | 94483–94508 | 1101 | 88794–88819 | 1079 | |
| ndhB 3' exon | tggaaatcctagctattcttagcatg | 97662–97687 | 95559–95584 | 89848–89873 | ||||
| ndhB 3' exon | attccaataattacatatccgatttg | 97567–97592 | 1043 | 95464–95489 | 1049 | 89753–89778 | 1068 | |
| ndhB 5' exon | cttatcaatacacaaatgtataactc | 98585–98610 | 96488–96513 | 90796–90821 | ||||
| ndhB 5' exon | tacgtcaggagtccattgatgagaag | 98494–98519 | 1171 | 96397–96422 | 1206 | 90705–90730 | 1192 | |
| rps 7 5'end | aatatggctttcaaattaagttccga | 99640–99665 | 97578–97603 | 91872–91897 | ||||
| rps 7 5'end | gtgcaaaagctctatttgcctctgcc | 99551–99576 | 1230 | 97489–97514 | 1231 | 91783–91808 | 1245 | |
| rps12 exon 2 | tcactgcttatatacccggtattggc | 100754–100781 | 98695–98720 | 93003–93028 | ||||
| rps12 exon 2 | tcctcgaacaatgtgatatctcacac | 100699–100694 | 968 | 98608–98633 | 1079 | 92916–92941 | 859 | |
| Spacer | caacataggtcatcgaaaggatctcg | 101642–101667 | 99662–99687 | 93750–93775 | ||||
| Spacer | gtgtgagcttatccatgcggttatgc | 101554–101581 | 1116 | 99576–99601 | 1345 | 93669–93694 | 1339 | |
| rrn16 start | gcttcatattcgcccggagttcgctc | 102645–102670 | 100896–100921 | 95043–95068* | ||||
| trnV 3' end | aagtcatcagttcgagcctgattatc | 102504–102529 | 1117 | 100751–100776 | 1122 | 94901–94926 | 1121 | |
| rrn16 start | tgagtttcattcttgcgaacgtactc | 103596–103621 | 101848–101873 | 95997–96022 | ||||
| rrn16 | cgacactgacactgagagacgaaagc | 103452–103477 | 1343 | 101704–101729 | 1341 | 95853–95878 | 1347 | |
| trnI Intron/OriA | atcgaaagttggatctacattggatc | 104770–104795 | 103020–103045 | 97175–97200 | ||||
| trnI start | gggctattagctcagtggtagagcgc | 104551–104576 | 877 | 102801–102826 | 898 | 96953–96978 | 1119 | |
| trnA start | caagagcggagctctaccaactgagc | 105403–105428 | 103674–103699 | 98047–98072 | ||||
| trnI end | gaggtctctggttcaagtccaggatg | 105229–105324 | 941 | 103571–103596 | 962 | 97943–97968 | 968 | |
| trnA end | ataagcggactcgaaccgctgacatc | 106145–106170 | 104508–104533 | 98886–98911 | ||||
| trnA intron | agattttgagaagagttgctctttgg | 106003–106028 | 1190 | 104367–104392 | 1188 | 98743–98768 | 1193 | |
| 23S | tagatgtccagtcaactgctgcgcct | 107168–107193 | 105530–105555 | 99911–99936 | ||||
| 23S | gaaactaagtggaggtccgaaccgac | 107053–107079 | 1225 | 105416–105441 | 1224 | 99797–99822 | 1291 | |
| 23S | cgctaccttaggaccgttatagttac | 108253–108278 | 106615–106640 | 101063–101088 | ||||
| 23S | ggtctccgcaaagtcgtaagaccatg | 108131–108156 | 1172 | 106493–106518 | 1169 | 100941–100966 | 1156 | |
| 4.5S | acatcactgcacttccacttgacacc | 109278–109303 | 107637–107662 | 102072–102097 | ||||
| 23S end | ctgctgaaagcatctaagtagtaagc | 109089–109115 | 963 | 107451–107476 | 929 | 101897–101922 | 920 | |
| trnR end | ggttgtgggcgaggagggattcgaac | 110027–110052 | 108355–108380 | 102792–102817 | ||||
| Spacer | aaatggctggggagaggaaaggttcc | 109905–109930 | 1182 | 108231–108256 | 1232 | No similarity | ||
| ORF350 | attatcttcatgcataaggatactag | 111062–111087 | 109438–109463 | No similarity | ||||
| Spacer | tggctctatttcattatattccatcc | 110844–110869 | 1036 | 109225–109250 | 986 | No similarity | ||
| ORF 350 | agtggatccctcttgttcctgtttag | 111855–111880 | 110186–110211 | No similarity |
ASAP PCR conditions. The thermalcycler parameters used to generate ASAP amplicons in successive rounds of PCR are presented.
| Temperature (Centigrade) | Duration (Minutes) | |
| Denaturation | 94 | 4 |
| 10 Cycles | ||
| Denaturation | 94 | 0:40 |
| Annealing | 55 – 0.5/cycle | 0:40 |
| Extension | 72 | 0:40 |
| 25 Cycles | ||
| Denaturation | 94 | 0:40 |
| Annealing | 50 | 0:40 |
| Extension | 72 | 0:40 |
| Final extension | 72 | 7 |
| Denaturation | 94 | 4 |
| 10 Cycles | ||
| Denaturation | 94 | 0:40 |
| Annealing | 52 – 0.5/cycle | 0:40 |
| Extension | 72 | 0:40 |
| 25 Cycles | ||
| Denaturation | 94 | 0:40 |
| Annealing | 47 | 0:40 |
| Extension | 72 | 0:40 |
| Final extension | 72 | 7 |
| Denaturation | 94 | 4 |
| 10 Cycles | ||
| Denaturation | 94 | 0:40 |
| Annealing | 52 – 0.5/cycle | 0:40 |
| Extension | 72 | 1:00 + .05/cycle |
| 25 Cycles | ||
| Denaturation | 94 | 0:40 |
| Annealing | 47 | 0:40 |
| Extension | 72 | 1:30 + 0.05/cycle |
| Final extension | 72 | 10 |
Figure 3Composite ASAP PCR profiles from 8 plant species. – Arabidopsis thaliana, – Nicotiana tabacum, – Citrus sinensis, – Prunus persica, – Lycopersicon esculentum, – Amaranthus hypochondriacus, – Coleus hybrida and – Lactuca sativa. Horizontal lines across each species indicate 1 kb size. Vertical columns indicate the amplicons generated from a given primer pair in the 8 plant species.
Figure 4. A. Round 1 and 2 touchdown PCR. B. Round 3 touchdown extension PCR and C. Composite profile from A and B.
ASAP coverage. Percentage coverage of the IR B region using ASAP method in 10 different genera and unique features of 4 diverse genera used in the study.
| Crucifereae | 100 | 100 | |||
| Solanaceae | 100 | 100 | |||
| Rutaceae | 100 | 100 | |||
| Solanaceae | 59 | 100 | 100 | ||
| Rosaceae | 7.4 | 100 | 100 | ||
| Amaranthaceae | 81 | 100 | 100 | ||
| Asteraceae | 63 | 100 | 100 | ||
| Rosaceae | 74 | 89 | 100 | 100 | |
| Lamiaceae | 85 | 92 | N/A | 92 | |
| Poaceae | 40 | 80 | 100 | 100 | |
| Fabaceae | No Inverted Repeat | ||||
| Ginkgoaceae | Small inverted repeat of 17 kb | ||||
| Pinaceae | Gymnosperm | ||||
| Equisetaceae | Pteridophyte | ||||
Figure 5A. PCR amplicons generated from primer pair 11 in 9 plant species. – Fragaria × ananassa/Rosaceae, – Prunus persica/Rosaceae, – Arabidopsis thaliana/Crucifereae, – Citrus sinensis/Rutaceae, – Coleus hybrida/Lamiaceae, – Nicotiana tabacum/Solanaceae, – Lycopersicon esculentum/Solanaceae, – Lactuca sativa/Asteraceae, – Amaranthus hypochondriacus/Caryophyllaceae. B. Schematic representation of the polymorphism between the amplicons generated from primer pair 11 in tobacco, tomato and amaranthus.
Figure 6Composite ASAP PCR profiles from 4 unique plant species. Ps – Pisum sativum, Gb – Gingko biloba, Pt – Pinus taeda, Eh – Equisetum hyemale.