| Literature DB >> 26758040 |
Nan Chen1, Wei-Min Wang1, Huan-Ling Wang1,2.
Abstract
A novel strategy for amplification full-length cDNA and promoter sequences has been developed using bioinformatics technology and multiplexed PCR methods in this study. The amplification of 3' ends of cDNA is performed according to the modified classic 3' RACE techniques, therein the more efficient and effective oligo(dT)-anchor primer with hairpin structure is specially designed. For the amplification of 5' ends of cDNA, two or three-round TAIL-PCR or touch-down PCR using arbitrary degenerate (AD) and sequence-specific reverse (SPR) primers is performed until the 5' sequence of multi-assembled fragment reaches the exon1 region identified by aligning this fragment to reference genome database. Then another TAIL-PCR or touch-down PCR using genomic DNA as template is conducted to obtain the remaining 5' and promoter sequences. The 5' end sites of cDNA are predicted by aligning finally assembled fragment to homologous reference genes of other species, and screening the relative locations of common characteristic cis-elements in silico on promoter. The putative 5' ends are further validated by primers corresponding to these predicted sites in cDNAs. This method is suitable for researchers to isolate limited full-length cDNA sequences due to its operability, inexpensiveness, efficiency and speediness.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26758040 PMCID: PMC4725349 DOI: 10.1038/srep19420
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic outline of the 3′ and 5′ RACE approaches.
First-strand cDNAs are synthesized using a special hairpin structure oligo(dT)-anchor primer, and the core sequences of target gene are amplified by up-down PCR (step 1). Sequence-specific forward primers are designed together with Outer-R and Inner-R primers to amplify the 3′ end sequences of cDNA through nest-PCR (step 2). Sequence-specific reverse primers (SPRs) referring to the assembled fragments of 3′ end and core sequences are designed and combined with arbitrary degenerate primers (ADs) to amplify 5′ sequences of cDNA by TAIL or touch-down PCR (step 3). After re-assembling, the fragments together with homologous reference gene sequences are aligned to reference genome database for exon distribution analyses (step 4). The PCRs in cDNA will not continue until the multi-assembled fragments reach the exon1 region, then other SPRs referring to the sequence located in exon1 are designed, and one more TAIL or touch-down PCR in genomic DNA are conducted to obtain the promoter sequences (step 5). The 5′ end sites of finally assembled sequence are in silico predicted and validated in cDNAs (step 6).
Figure 2Amplification 3′ end sequences of M. amblycephala heat shock transcription factor 2 (HSF2) and PHD3 in cDNAs reversed by oligo(dT) primers from modified RACE, classic RACE, SMART RACE, SMARTer RACE and life technologies (LT) Gene Race Kit, respectively.
The 3′ end sequence of HSF2 is amplified in four cDNA templates and only templates reversed by modified and classic RT primers achieve the correct bands (A). Then HSF2 3′ end is further amplified in different temperature conditions (51, 54, 57 and 60 °C), and cDNA templates reversed by hairpin structure RT primer performes better (B). These templates are also used to amplifiy 3′ end of PHD3 in one-round PCR, and cDNA templates from modified RACE and SMART can obtain weak target bands (C).
Figure 3The predicted secondary structures of RT-primer sequences.
RT-primers are coming from modified RACE (A), classic RACE (B), SMART RACE (C), SMARTer RACE (D) and life technologies Gene Race Kit (E), respectively. RT-primers from modified and SMART RACE are likely to form hairpin structure during the RT reactions, and only oligo(dT) sequence is exposed, as a result to enhance the RT reaction specificity.
Figure 4Amplification of the full-length cDNA and promoter sequences of M. amblycephala PHD1.
The core sequence of PHD1 is obtained from EST database and validated (A). Then sequence-specific forward primers are designed together with Outer-R and Inner-R primers to amplify the 3′ end sequence by nest-PCR (B). After 3′ end and core sequences assembling, SPRs are designed together with ADs to amplify the 5′ sequence by three-round TAIL-PCR (C). Since the re-assembled sequence can not reach the exon1 region after aligning, another step two-round TAIL-PCR is conducted to obtain extra 5′ unknown sequence (D). Then the third-assembling sequence is aligned again into zebrafish genome database to analyse the exon distributions, and another SPRs located in the exon1 region are designed together with ADs to amplify the promoter sequence by TAIL-PCR in genomic DNA (E). Through in silico analysis, the predicted 5′ end sites are identified and validated in cDNAs using ORF-F1, ORF-F2, ORF-F3 and ORF-R primers (F).
Figure 5Amplification of the full-length cDNA and promoter sequences of M. amblycephala PHD3.
The core sequence of PHD3 is obtained from EST database and validated (A). Then sequence-specific forward primers are designed together with Outer-R and Inner-R primers to amplify the 3′ end sequence by nest-PCR (B). After 3′′ end and core sequences assembling, SPRs are designed together with ADs to amplify the 5′ sequence by two-round TAIL-PCR (C). Then the re-assembled sequence is aligned to analyse the exon distributions, and another SPRs located in exon1 region are designed together with ADs to amplify promoter sequence by three-round TAIL-PCR in genomic DNA (D). Through in silico analysis, the predicted 5′ end site is identified and validated in cDNAs using ORF-F and ORF-R primers (E).
Figure 6Amplification of the full-length cDNA and promoter sequences of L. vannamei HSP70.
The core sequence of HSP70 is amplified by degenerate primers through touch-down PCR (A). Then sequence-specific forward primers are designed together with Outer-R and Inner-R primers to obtaine the 3′ end sequence by nest-PCR (B). In silico analysis indicats that HSP70 has only one exon, thus SPRs and ADs are used to amplify 5′ and promoter sequence through two-round touch-down PCR in genomic DNA (C). Through in silico analysis, the predicted 5′end site is identified and validated in cDNAs using ORF-F and ORF-R primers (D).
Figure 7Amplification of the full-length cDNA and promoter sequences of C. idella EDN1.
The core and 3′ end sequence of EDN1 is obtained by modified degenerate primers through touch-down PCR (A). Since introns in length of zebrafish EDN1 are short, the 5′ end and promoter sequences of C. idella EDN1 are all amplified in genomic DNA (B). In silico analysis is used to identify the EDN1 exons and 5′ end sites, and then validated by ORF-F1, ORF-F2 and ORF-R primers in cDNAs (C).
Primers used in this study.
| Primer names | Primer sequences (5′-3′) | Applications |
|---|---|---|
| PHD1-core | F: TTACCCTGGCAACGGAACAG | Amplification core sequence of PHD1, 301 bp |
| R: CTTCTCCTTAGCTTCGGCCC | ||
| PHD3-core | F: TCTTTCTGTGAGGTGGCTGTC | Amplification core sequence of PHD3, 465 bp |
| R: ATGGGTTAACGGGACCGAGA | ||
| HSP70-core | F: GTGTTCCAGCATGGCAAGGTGGARATCMTCGCC | Amplification core sequence of HSP70, 1320 bp |
| R: ACGTTCAGGATGCCGTTGGCGTCRATGTCGA | ||
| EDN1-core | F1: CCTGGATAAAGAGTGCGTCTACTTYTGYCA | Amplification core and 3′ end sequences of EDN1, 1139 bp |
| F2: GCGAACGGTGTCATATGGAYTTGGAAAYGC | ||
| Hairpin structure RT primer | CGTATCCAGTGCAGGGTCCGAGGTATTCGCACTGGATACGT20 | Reverse mRNA into cDNA |
| Outer-R | CAGTGCAGGGTCCGAGGTAT | Amplification 3′ ends of cDNA |
| Inner-R | CCGAGGTATTCGCACTGGATACGTTT | Amplification 3′ ends of cDNA |
| Classic RT primer | CCAGTGAGCAGAGTGACGAGGACTCGAGCTCAAGCT17 | Reverse mRNA into cDNA |
| Classic-Outer | CCAGTGAGCAGAGTGACG | Amplification 3′ ends of cDNA |
| Classic-Inner | GAGGACTCGAGCTCAAGC | Amplification 3′ ends of cDNA |
| SMART RT primer | ATTCTAGAGGCCGAGGCGGCCGACATGT30VN | Reverse mRNA into cDNA |
| SMART-R | ATTCTAGAGGCCGAGGCGGCCGACATG | Amplification 3′ ends of cDNA |
| SMARTer RT primer | AAGCAGTGGTATCAACGCAGAGTACT30VN | Reverse mRNA into cDNA |
| SMARTer-R | AAGCAGTGGTATCAACGCAGAGT | Amplification 3′ ends of cDNA |
| LT RT primer | GCTGTCAACGATACGCTACGTAACGGCATGACAGTGT24 | Reverse mRNA into cDNA |
| LT-R | GCTGTCAACGATACGCTACGTAACG | Amplification 3′ ends of cDNA |
| HSF2-3 | F: TAGAGTTGATGGACTATCTGGACAGTATTGA | Amplification 3′ end of HSF2, 1060 bp |
| PHD1-3 | F: TAACCTGCATCTACTATCTGAACAAGGA | Amplification 3′ end of PHD1, 1291 bp |
| PHD3-3 | F: TTTCTTATTAACACTCATAGACAAACTCATTTC | Amplification 3′ end of PHD3, 770 bp |
| HSP70-3 | F1: TCTCATCAAGCGTAACACCACCAT | Amplification 3′ end of HSP70, 881 bp |
| F2: GACTCAGACCTTCACCACCTACTC | ||
| AD | -1: NTCGASTWTSGWGTT | Amplification promoter and 5′ sequences |
| -2: NGTCGASWGANAWGAA | ||
| -3: WGTGNAGWANCANAGA | ||
| PHD1-5 | SPR1: GAAAATGAGCAGACGGTCAAAGAGAG | Amplification promoter and 5′ sequence of PHD1 |
| SPR2: GTGGATCTGCAATAAACCTCCATGAAC | ||
| SPR3: CATATCCTGTTCCGTTGCCAGGGTAA | ||
| SPR4: CGGATGTTCTTAGAGGGGATGCTCTTTTGGATG | ||
| SPR5: CCCGCTGCGATTTAGGGTCTCCACTTCCTCCAG | ||
| SPR6: CCATAGAACTTCATACAGGGAACTATGTACTG | ||
| SPR7: GTGTGCTTTTACTAGAGTCTGTTTTAGG | ||
| PHD3-5 | SPR1: TCTTTCTGTGAGGTGGCTGTCAGATCTCTA | Amplification promoter and 5′ sequence of PHD3 |
| SPR2: TGACCTTGCATGAATGCTTTTGCCCAGTTGACC | ||
| SPR3: TGAGTGTTAATAAGAAATTGATAGCCTCCGTGC | ||
| SPR4: ATGTTATTTTATCCCCTCTGATGTTTGTCCTGC | ||
| HSP70-5 | SPR1: CTTCATTTTGATGAGCACCATCGAG | Amplification promoter and 5′ sequence of HSP70 |
| SPR2: GTAGAAGGTCTTCTTGTCTCCCTTG | ||
| EDN1-5 | SPR1: GTTTTGTCTTTGCTATCTGCACATTT | Amplification promoter and 5′ sequence of EDN1 |
| SPR2: ATTTCCAAATCCATATGACACCGTTC | ||
| PHD1-ORF | F1: TGCTTCAAGACCATCGTAAATTAATATTAATGAG | Identification 5′ end of PHD1 and validation its ORF, 1824 bp |
| F2: CAGTGTTATATCAGATACGAAGCCTACT | ||
| F3: CCTTAATCTGGAAATAAACAGCCATGGAAAATAG | ||
| R: AAGCAATTCAGACTTGTTGGATGTTTGAGG | ||
| PHD3-ORF | F: TCACTTTAAAAAAACTTTAGCTGATTTGA | Identification 5′ end of PHD3 and validation its ORF, 832 bp |
| R: GCTAAGAAATAAGATGATAGACGCA | ||
| HSP70-ORF | F: GTGGTTGGGTCTTCATGATAATGCTT | Identification 5′ end of HSP70 and validation its ORF, 2131 bp |
| R: CCACCAACATCATAAATAAAATGCGG | ||
| EDN1-ORF | F1: GTTTGTGTATTTGATGGTCTATCACCGTCTG | Identification 5′ end of EDN1 and validation its ORF, 1216 bp |
| F2: GTCTGAGTCACATCCGCGTCTTGCATAAG | ||
| R: TTTTACAGTGTATGAAGTCCAATAACAAGA |
The parameters of TAIL-PCR.
| Reactions | Program NO. | Number of cycles | Cycle Parameters |
|---|---|---|---|
| First-round PCR | 1 | 1 | 95 °C 2 min, 97 °C 1 min |
| 2 | 10 | 95 °C 20 s, 63 °C 25s, 72 °C 1 min | |
| 3 | 1 | 95 °C 20 s, 25 °C 1 min, ramping to 72 °C over 1 min, 72 °C 1 min | |
| 4 | 10 | 95 °C 20 s, 30 °C 25 s, 72 °C 1 min | |
| 5 | 12 | 95 °C 20 s, 63 °C 25 s, 72 °C 45 s | |
| 95 °C 20 s, 63 °C 25 s, 72 °C 45 s | |||
| 95 °C 20 s, 30 °C 25 s, 72 °C 45 s | |||
| 6 | 1 | 72 °C 2 min, 30 °C ∞ | |
| Second-round PCR | 1 | 1 | 95 °C 3 min |
| 2 | 12 | 95 °C 20 s, 60 °C 25 s, 72 °C 45 s | |
| 95 °C 20 s, 60 °C 25 s, 72 °C 45 s | |||
| 95 °C 20 s, 48 °C 25 s, 72 °C 45 s | |||
| 3 | 1 | 72 °C 2 min, 30 °C ∞ | |
| Third-round PCR | 1 | 1 | 95 °C 3 min |
| 2 | 12 | 95 °C 20 s, 60 °C 25 s, 72 °C 45 s | |
| 95 °C 20 s, 60 °C 25 s, 72 °C 45 s | |||
| 95 °C 20 s, 48 °C 25 s, 72 °C 45 s | |||
| 3 | 1 | 72 °C 2 min, 30 °C ∞ |
The parameters of touch-down PCR.
| Reaction | Program NO. | Number of cycles | Cycle Parameters |
|---|---|---|---|
| Touch-down PCR | 1 | 1 | 95 °C 3 min, |
| 2 | 21 | 95 °C 20 s, 65 °C to 55 °C 45 s (decrease 0.5 °C each cycle), 72 °C 45 s | |
| 3 | 20 | 95 °C 20 s, 55 °C 20 s, 72 °C 45 s | |
| 4 | 1 | 72 °C 2 min, 30 °C ∞ |