Literature DB >> 31457060

A Simple Genome Walking Strategy to Isolate Unknown Genomic Regions Using Long Primer and RAPD Primer.

Fajun Li1, Chunpeng Fu1, Qunfeng Li1.   

Abstract

BACKGROUND: Genome walking is a DNA-cloning methodology that is used to isolate unknown genomic regions adjacent to known sequences. However, the existing genome-walking methods have their own limitations.
OBJECTIVES: Our aim was to provide a simple and efficient genome-walking technology.
MATERIAL AND METHODS: In this paper, we developed a novel PCR strategy (termed SLRA PCR) that uses a single long primer (SLP), a set of gene specific primers (GSP), and a random amplified polymorphic DNA (RAPD) primer for genome walking. SLRA PCR consists of two processes: the first amplification using SLP, and three successive rounds of nested PCR amplified by GSP and RAPD primer. The novelty of the approach lies in the use of long primers (SLP and GSP) and same annealing and extension temperature 68℃ in combination. This method offers higher amplification efficiency, superior versatility, and greater simplicity compared with conventional randomly primed PCR methods for genome walking.
RESULTS: The promoter regions and the first introns of the insulin-like androgenic gland hormone (IAG) gene and the hemocyanin gene of Macrobrachium nipponense were cloned using SLRA PCR, respectively.
CONCLUSIONS: This genome walking strategy can be applied to a wide range of genomes.

Entities:  

Keywords:  DNA Primers; Hemocyanins; Polymerase Chain Reaction

Year:  2019        PMID: 31457060      PMCID: PMC6697840          DOI: 10.21859/ijb.2183

Source DB:  PubMed          Journal:  Iran J Biotechnol        ISSN: 1728-3043            Impact factor:   1.671


1. Background

Genome walking is a DNA cloning methodology that is used to isolate unknown genomic regions adjacent to known sequences. It can be used for the identification of the integration sites of transposable elements, gene cloning for functional studies, identification of regulatory sequences outside of the coding regions, and for filling in the lingering gaps after a genome is sequenced (1). The polymerase chain reaction (PCR)-based genome walking techniques are popular as the methods are relatively fast and effortless. They can be divided into four types: inverse PCR (2–4), ligation-mediated PCR (5–8), randomly primed PCR (9–11), and overlapping primer-based PCR (12). The first two groups require a preliminary digestion of genomic DNA with a suitable restriction enzyme, which makes these methods relatively expensive and time-consuming. Since information on restriction sites is usually not available in advance, there is no guarantee that digestion with a particular restriction enzyme will be successful. This necessitates trials with several different enzymes. Therefore, the requirements of high-quality DNA for complete endonuclease digestion and efficiency of the ligation reactions have limited the utilization of these methods. The randomly-primed PCR methods overcome the flaws mentioned above and has gained popularity for its simplicity as it needs no prior DNA manipulation (such as enzyme digestion or ligation) (13). They are relatively low-cost and relatively direct, i.e. there is no requirement for intermediate steps involving restriction enzyme digestion of the DNA template or ligation reactions. However, the conventional randomly-primed PCR often produces nonspecific amplification products due to the short-arbitrary-degenerate primers and low annealing temperatures, and may also achieve non-efficient amplification of target sequences(14). Even in improved methodologies, there are also many flaws. For instance, the improved high-efficiency thermal asymmetric interlaced (hi) TAIL-PCR still remains time-consuming, and is expensive in terms of primer design and synthesis. Moreover, this technique poses a great challenge to the activity of DNA polymerase, and so is frequently associated with inefficient amplification results(15); The fusion primer and nested integrated PCR (FPNI-PCR) is still a time-consuming procedure that usually involves three rounds of amplification with each round consisting of multiple PCR cycles. Numerous primers (SP1, SP2, SP3, FSP1,FSP2 and degenerate primers) are applied in this process(15). Complex procedures make it difficult to be used widely. Therefore, it is highly desirable that further novel methods are developed for rapid and efficient genome walking or flanking sequence cloning. As for overlapping primer-based PCR, the walking primers needs a special design so as to they have 10 bp overlap at the 3’ ends (12).

2. Objective

In order to overcome the technical hurdles, in this paper, we present a new PCR method for genome walking; single-long primer (SLP) and randomly-amplified-polymorphic DNA (RAPD) primer PCR (termed SLRA PCR). The basic principle of SLRA PCR is illustrated in . The promoter regions and first introns of the insulin-like androgenic gland hormone (IAG) gene and the hemocyanin gene of Macrobrachium nipponense were cloned using this approach, respectively. The simple and efficient genome walking strategy can be applied to a wide range of genomes.

3. Materials and Methods

3.1. Materials

Adult Macrobrachium nipponense prawns used in this study were obtained from Tai lake in Wuxi, China (120°13’44”E, 31°28’22”N). DNA extraction kit, PrimeSTAR Max Premix, and LA polymerase were purchased from TaKaRa (Dalian, China). Genomic DNA was isolated using the DNA extraction kit according to the manufacturer’s protocol.

3.2. Genome Walking by SLRA PCR

Step A: single long primer PCR (SLP PCR). The system consisted of two processes. The first amplification used a SLP, which was designed against known DNA sequence. The SLP must meet the following requirements: 30–35 nucleotides in length, GC content of 40–60%, annealing, and extension temperature of 68℃. There should be no more than three G’s and C’s in the last six positions at the 3’ end of the primer. In this study, the SLPs of promoter regions (PIAG-SLP, PHe-SLP) and first introns (IAGI1-SLP, HeI1-SLP) of the IAG and hemocyanin gene of M. nipponense are listed in . Cycling conditions and PCR reagents are given in . In the SLP PCR, we employed a two-step protocol. This was possible because of the high annealing and extension temperature of 68℃, of the SLP, and the characteristics of the PrimeSTAR Max Premix, which contained a fast, high-fidelity amplification enzyme. All of these factors ensured that the SLP bound specifically to the known DNA sequence. Step B: Nested PCR. The second amplification was three-step nested PCR. Three gene-specific primers (GSP1, GSP2 and GSP3) were designed against the known sequence, from outside to inside (), and were located inside the SLP (not contain all 3 GSP primers.). Each GSP was 26–28 nucleotides in length to ensure a high annealing and extension temperature. The GC content of GSP is same as SLP. Other conditions were the same as that used for the SLP PCR. RAPD primers were randomly selected. The GSPs of promoter regions (PIAG-GSP1-3, PHe-GSP1-3) and first introns (IAGI1-GSP1-3, HeI1-GSP1-3) of the IAG and the hemocyanin gene of M. nipponense are shown in . Cycling conditions and PCR reagents are given in . In the nested amplification, we used 5 µlitr of the SLP PCR product (no dilution and purification) as the first nested PCR template. 1 µlitr of the first round of nested PCR product was diluted 50–100 times and used as the template for the second round of nested circulation, and so on for subsequent rounds. Given the characteristics of the RAPD primer (10 nucleotides), we applied a low annealing temperature of 40℃ to allow the RAPD primer to bind to the DNA. An annealing and extension temperature of 68℃ was used to allow the GSP to bind effectively to the corresponding sequence. The LA polymerase was used to ensure extension efficiency.

3.3. Cloning and Sequencing

5 µlitr of the PCR products were used in electrophoresis in a 1.5% agarose gel in a TBE buffer stained with ethidium bromide. The DNA was visualized under a UV light. Bands corresponding to the largest products were excised from the agarose gels, purified with Agarose Gel DNA Recovery Kit (TaKaRa, China), and cloned into a pMD18-T TA cloning vector (TaKaRa, China), and then sequenced with M13 forward or reverse primers.

3.4. Sequence Analysis

The sequences were analyzed by computational methods. Promoter was predicted by NNPP version (http://www.fruitfly.org/seq_tools/promoter.html), Transcription factors binding sites in the promoter sequence of gene were predicted by TFSEARCH (http://www.cbrc.jp/htbin/nph-tfsearch).

4. Results

DNA quality plays a critical role for ligation-mediated PCR genome walking. However, there is no special requirement for SLRA PCR strategy, DNA obtained by general DNA extraction kit can meet the requirements. Furthermore, minute traces of genomic DNA (20 ng) was used in SLRA PCR walking. Using the above protocol, we cloned the promoter sequences and the first introns of the IAG gene (GenBank: KF811212) and hemocyanin gene (GenBank: KF887993) from M.nipponense (). TATA box, CAAT box, and the transcriptional start site were found in the IAG promoter sequence (about 700bp) (Supplementary Fig. 1). TATA box, CAAT box, and the transcriptional start site were found in the hemcoyanin promoter sequence (about 600bp) (Supplementary Fig. 2). The first introns of IAG (1795bp) () and hemocyanin (700bp) were also isolated, the exon-intron splice junctions in IAG and the hemcoyanin gene followed the GT-AG pattern.

5. Discussion

5.1. Principle of the Current Technique

The principle of genome walking with SLRA PCR is illustrated in . The novelty of the approach lies in the use of long primers (SLP or GSP) and identical annealing and extension temperatures (68℃) in combination. Such high annealing and extension temperature is essential for this protocol. At 68℃, SLP or GSP can hybridize with the known DNA strictly, which ensures the PCR amplification stringency, in parallel, the amplification efficiency of DNA polymerase is also ensured. A series of key technical points of this strategy are as follow: (i) Compared with the general primers (20–22 nucleotides), SLP (30–35 nucleotides) and GSP (26–28 nucleotides) play a key role in the process, which decreases the binding rate of primers to nonspecific sites and allows to capture the target sequence reliably. This template-capture step can reduce the template complexity and increase the specificity of the following nested PCR(1). Under higher stringent conditions, the single specific primer binds to complementary sites within captured DNA templates, and the target DNA fragments containing known and adjacent unknown sequences will be amplified. (ii) To reduce false positive products, nested PCR can be employed. The successive PCR rounds using different GSP can significantly reduce the non-specific amplification because the spurious PCR products are unlikely to contain binding sites for the inner specific primers(16). (iii) In order to avoid the randomness of RAPD primer, we use the SLP PCR product as the first nested PCR template. In the nested PCR, the RAPD primers hybridize with the rigorous and precise template so as to improve the precision of the experiment. Notably, the Tm of the SLP and GSP should be designed to be higher than 68℃. Such a high annealing temperature could ensure the binding of specific primers to the correct locus at stringent conditions in the following amplifications(17).

5.2. Advantages and Potential Applications SLRA PCR

Compared with conventional randomly-primed PCR methods for genome walking, the SLRA PCR method has several advantages: (i) Higher efficiency: conventional randomly primed PCR methods tend to produce small amplification products or nonspecific amplification products (14, 18). In our preliminary experiments, owing to nonspecific amplification, we failed to clone the promoter sequence of IAG using (hi) TAIL-PCR and FPNI-PCR protocol, respectively. In SLRA PCR strategy, we use the SLP PCR product as the first nested PCR template, which overcomes the problem of nonspecific amplification produced by the randomness of RAPD primer. Furthermore, we use long primer (SLP and GSP) and same annealing and extension temperature of 68℃ together, which not only improves the accuracy of the experiment but also boosts its efficiency, an approximate 1.8 Kb sequence of the first intron of IAG was acquired using this protocol. (ii) Superior versatility: owing to the diversity of RAPD, we can always find one hybridize with the unknown sequence, this method can effectively avoid the experimental failure caused by false amplification. Thus, the SLRA PCR can be performed on a wide range of genomes. (iii) Greater simplicity: degenerate primers and complex PCR procedures are unused in SLRA PCR strategy. Simple PCR progresses avoid the problem of decreased activity of DNA polymerase produced by complex program in hi TAIL-PCR technique. This entire process can be completed in a single working day.

6. Conclusions

The advantages of the novel SLRA PCR method make it useful for obtaining a full gene sequence, including detecting promoters and regulatory elements in the genome. This method is also an efficient tool to identify transposon integration sites and sequence-tagged sites in genomic DNA for gene functional studies and to obtain operons from a known short DNA sequence. In summary, we have developed an efficient and versatile genome walking method, and this method can be applied to a wide range of genomes for various purposes.
Table 1.

Primer sequences of promoter regions and first introns of the insulin-like androgenic gland hormone (IAG) gene and hemocyanin gene of M. nipponense used in SLRA PCR

Primer NameNucleotide Sequence (5′→3′)
Promoter IAG
    PIAG-SLPCCTTGAGAGAAGAATTTGAAGTCTTGAGATGAAGG
    PIAG-GSP1CCAGAAAGCAGGAGAGCGAATTTGGAGG
    PIAG-GSP2GAAACACCGGTTAAACGGACAAGGGGGC
    PIAG-GSP3GACACAGCCGTTCAGGGTATGATCCCAG
    PIAG-RAPDGGCACGTAAG
First intron of IAG
    IAGI1-SLPAAGACTCTGGGATCATACCCTGAACGGCTGTGTCC
    IAGI1-GSP1GCCCCCTTGTCCGTTTAACCGGTGTTTC
    IAGI1-GSP2CCTCCAAATTCGCTCTCCTGCTTTCTGG
    IAGI1-GSP3TCTCAAGACTTCAAATTCTTCTCTCAAGG
    IAGI1-RAPDCAGCGCTACG
Promoter of Hemocyanin
    PHe-SLPATCGCTCTGGAAGCCGTCCAGAGAGTGACTCTCG
    PHe-GSP1GCTAAAGCTGGGCCAGGCTGCGGCAG
    PHe-GSP2CAGCAGAGCGCACAAGACAAACACCTT
    PHe-GSP3TGTGTGCTGTGCCGGACCAGGGGAATC
    PHe-RAPDCCACGGGAAG
First intron of Hemocyanin
    HeI1-SLPGCCAATAGGTATGTAGTGTCTGCTGATTTCACG
    HeI1-GSP1GGCGACGGACTTTGCCTCGGCCTTGC
    HeI1-GSP2CACAGCACACAATGAAGGTGTTTGTC
    HeI1-GSP3GCCCAGCTTTAGCTTCGAGAGTCACTC
    HeI1-RAPDTAGCCACTGG
Table 2.

Thermal cycling conditions and PCR reagents for the SLRA PCR

ProgramNo. of cyclesCycling conditionsPCR reagents
SLP PCR3098℃ for 10 s ; (98℃ for 10 s; annealing and extension at 68℃ for 1 min); termination extension at 68℃ for 3 minPrimeSTARMax Premix 25 μl, SLP 0.2–0.3 μM, DNA 20 ng, ddH2O to 50 μl
Nested PCR3094℃ for 90 s; (94℃ for 30 s; annealing at 40℃ for 30 s and extension at 68℃ for 3 min); termination extension at 68℃ for 5 minBuffer(Mg+) 2.5 μl, dNTP 200 μM, RAPD 0.5 μM, GSP 0.5 μM, LA polymerase 0.5 U, PCR product 1 μl, ddH2O to 25 μl
  18 in total

1.  High-throughput TAIL-PCR as a tool to identify DNA flanking insertions.

Authors:  Tatjana Singer; Ellen Burke
Journal:  Methods Mol Biol       Date:  2003

2.  Successive chromosome walking by compatible ends ligation inverse PCR.

Authors:  Maozhi Ren; Quanjia Chen; Li Li; Rui Zhang; Sandui Guo
Journal:  Mol Biotechnol       Date:  2005-06       Impact factor: 2.695

3.  A specific and versatile genome walking technique.

Authors:  Haitao Guo; Jin Xiong
Journal:  Gene       Date:  2006-06-21       Impact factor: 3.688

4.  Self-formed adaptor PCR: a simple and efficient method for chromosome walking.

Authors:  Shiming Wang; Jian He; Zhongli Cui; Shunpeng Li
Journal:  Appl Environ Microbiol       Date:  2007-05-04       Impact factor: 4.792

5.  High-efficiency thermal asymmetric interlaced PCR for amplification of unknown flanking sequences.

Authors:  Yao-Guang Liu; Yuanling Chen
Journal:  Biotechniques       Date:  2007-11       Impact factor: 1.993

6.  Straight walk: a modified method of ligation-mediated genome walking for plant species with large genomes.

Authors:  Tokuji Tsuchiya; Nanako Kameya; Ikuo Nakamura
Journal:  Anal Biochem       Date:  2009-02-07       Impact factor: 3.365

7.  Vectorette PCR: a novel approach to genomic walking.

Authors:  C Arnold; I J Hodgson
Journal:  PCR Methods Appl       Date:  1991-08

8.  A high-throughput genome-walking method and its use for cloning unknown flanking sequences.

Authors:  Palakolanu Sudhakar Reddy; Srikrishna Mahanty; Tanushri Kaul; Suresh Nair; Sudhir K Sopory; Malireddy K Reddy
Journal:  Anal Biochem       Date:  2008-07-22       Impact factor: 3.365

9.  T-linker-specific ligation PCR (T-linker PCR): an advanced PCR technique for chromosome walking or for isolation of tagged DNA ends.

Authors:  Yan Yuanxin; An Chengcai; Li Li; Gu Jiayu; Tan Guihong; Chen Zhangliang
Journal:  Nucleic Acids Res       Date:  2003-06-15       Impact factor: 16.971

10.  Rolling circle amplification of genomic templates for inverse PCR (RCA-GIP): a method for 5'- and 3'-genome walking without anchoring.

Authors:  Athanasios Tsaftaris; Konstantinos Pasentzis; Anagnostis Argiriou
Journal:  Biotechnol Lett       Date:  2009-09-17       Impact factor: 2.461

View more
  1 in total

1.  Wristwatch PCR: A Versatile and Efficient Genome Walking Strategy.

Authors:  Lingqin Wang; Mengya Jia; Zhaoqin Li; Xiaohua Liu; Tianyi Sun; Jinfeng Pei; Cheng Wei; Zhiyu Lin; Haixing Li
Journal:  Front Bioeng Biotechnol       Date:  2022-04-12
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.