| Literature DB >> 22509391 |
Ali Taheri1, Stephen J Robinson, Isobel Parkin, Margaret Y Gruber.
Abstract
A new method to improve the efficiency of flanking sequence identification by genome walking was developed based on an expanded, sequential list of criteria for selecting candidate enzymes, plus several other optimization steps. These criteria include: step (1) initially choosing the most appropriate restriction enzyme according to the average fragment size produced by each enzyme determined using in silico digestion of genomic DNA, step (2) evaluating the in silico frequency of fragment size distribution between individual chromosomes, step (3) selecting those enzymes that generate fragments with the majority between 100 bp and 3,000 bp, step (4) weighing the advantages and disadvantages of blunt-end sites vs. cohesive-end sites, step (5) elimination of methylation sensitive enzymes with methylation-insensitive isoschizomers, and step (6) elimination of enzymes with recognition sites within the binary vector sequence (T-DNA and plasmid backbone). Step (7) includes the selection of a second restriction enzyme with highest number of recognition sites within regions not covered by the first restriction enzyme. Step (8) considers primer and adapter sequence optimization, selecting the best adapter-primer pairs according to their hairpin/dimers and secondary structure. In step (9), the efficiency of genomic library development was improved by column-filtration of digested DNA to remove restriction enzyme and phosphatase enzyme, and most important, to remove small genomic fragments (<100 bp) lacking the T-DNA insertion, hence improving the chance of ligation between adapters and fragments harbouring a T-DNA. Two enzymes, NsiI and NdeI, fit these criteria for the Arabidopsis thaliana genome. Their efficiency was assessed using 54 T(3) lines from an Arabidopsis SK enhancer population. Over 70% success rate was achieved in amplifying the flanking sequences of these lines. This strategy was also tested with Brachypodium distachyon to demonstrate its applicability to other larger genomes.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22509391 PMCID: PMC3324424 DOI: 10.1371/journal.pone.0035117
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of oligonucleotides used for genome walking in Arabidopsis with restriction enzymes NsiI and NdeI.
| Oligo name | Oligo sequence (5′ = >3′) | Primer use |
| SWA-F- |
|
|
| SWA-F- |
|
|
| SWA-R- | Phos- |
|
| SWA-R- | Phos- |
|
| GW-F-out |
| 1° PCR |
| GW-F-in |
| 2° PCR |
| LB-R-out |
| 1° PCR |
| LB-R-in |
| 2° PCR |
| LB-R-seq |
| sequencing |
1° , denotes primary PCR reaction, 2°, denotes secondary nested PCR reaction.
Figure 1Flow chart outlining the steps used in optimized genome walking.
Fragment distribution frequency, methylation sensitivity and vector representation of 29 restriction enzymes with high numbers of fragments within a 100–3000 bp range in Arabidopsis.
| Restriction enzyme | Fragments (%) | Methylation sensitive | Presence within vector pSKI015 | Cohesive or blunt end | ||
| <100 bp | 0.1–3 kb | >3 kb | ||||
|
| 37.84 | 62.12 | 0.03 | - | Y | B |
|
| 15.84 | 71.51 | 12.66 | - | Y | C |
|
| 23.54 | 76.27 | 0.19 | - | Y | C |
|
| 5.84 | 61.79 | 32.37 | - | Y | C |
|
| 5.02 | 63.35 | 31.64 | Dam | Y | C |
|
| 12.37 | 63.88 | 23.74 | CG | Y | B |
|
| 34.83 | 65.13 | 0.03 | ? | Y | C |
|
| 34.83 | 65.13 | 0.03 | Dam | Y | B |
|
| 22.53 | 71.49 | 5.98 | - | Y | B |
|
| 33.69 | 66.30 | 0.01 | - | Y | C |
|
| 9.04 | 67.04 | 23.91 | - | Y | B |
|
| 17.07 | 76.75 | 6.17 | - | Y | B |
|
| 9.04 | 67.04 | 23.91 | CG | Y | C |
|
| 8.29 | 72.72 | 18.99 | - | Y | C |
|
| 9.04 | 67.04 | 23.91 | - | Y | C |
|
| 25.06 | 69.04 | 5.90 | CG | - | B |
|
| 34.83 | 65.14 | 0.03 | Dam, CG | Y | C |
|
| 61.55 | 38.45 | 0.00 | - | - | B |
|
| 4.99 | 59.09 | 35.93 | - | - | C |
|
| 33.69 | 66.30 | 0.01 | - | Y | C |
|
| 5.76 | 63.88 | 30.37 | - | - | C |
|
| 20.39 | 79.13 | 0.48 | ? | Y | C |
|
| 5.47 | 64.44 | 30.08 | - | Y | C |
|
| 13.35 | 74.35 | 11.69 | - | Y | B |
|
| 20.39 | 79.13 | 0.48 | CG | Y | B |
|
| 12.37 | 63.88 | 23.74 | CG | - | C |
|
| 17.21 | 75.12 | 7.67 | - | Y | B |
|
| 22.73 | 76.47 | 0.80 | CG | Y | C |
|
| 28.95 | 70.74 | 0.31 | Dam | Y | C |
GlaI is a methylation dependent endonuclease which only cleaves DNA when 5-methylcytosine or 5-hydroxymethylcytosine lies within its recognition sequence [34].
? information not available; Y, yes; B, blunt; C, cohesive.
Fragment distribution frequency for in silico NsiI-digested fragments ≥3000 bp after in silico digestion with second restriction enzyme.
| Secondary Restriction enzyme | Fragments (%) | ||
| <100 bp | 100–3000 bp | >3000 bp | |
| BfaI | 24.78 | 75.11 | 0.11 |
| ChaI | 36.32 | 63.66 | 0.02 |
| NdeI | 6.67 | 72.89 | 20.45 |
| PciI | 6.59 | 76.59 | 16.82 |
| SelI | 13.52 | 74.30 | 12.18 |
| TaiI | 23.79 | 75.72 | 0.49 |
Adapter and primer sets evaluated in this study. Hairpin and self-dimer structures for each oligo were measured by Oligoanalyzer and OligoCalc.
| Adapter | Sequence | Reference | ΔG hairpins kcal.mole | ΔG self-dimers kcal.mole |
| GW. Adp |
| Clontech | −4.28 | −22.17 |
| AP1 |
| Clontech | 0.65 | −6.59 |
| AP2 |
| Clontech | −0.36 | −16.95 |
| SWA-F |
|
| −0.73 | −5.09 |
| SAP1 |
|
| −0.5 | −3.61 |
| SAP2 |
|
| 0.58 | −3.61 |
| ADP2 |
|
| −2.29 | −16.5 |
| AP1 |
|
| −0.8 | −10.76 |
| AP2 |
|
| −0.56 | −16.24 |
Figure 2Identification of T-DNA flanking sequence in 54 A. thaliana lines by genome walking.
PCR amplification of sequences flanking the left border of T-DNA inserts (A) NsiI digested DNA. (B) NdeI digested DNA. The first lane for each row is 1 kb plus ladder (Invitrogen). (C) Example of T-DNA flanking sequence obtained from transgenic line p416 from the A. thaliana SK population. In this example, T-DNA was inserted into the TTG1 gene. The T-DNA footprint is highlighted in bold and the GW-adapter sequence is underlined.