| Literature DB >> 35215059 |
Md-Mafizur Rahman1,2, Sang-Jin Lim3, Yung-Chul Park1.
Abstract
Single-nucleotide polymorphisms (SNPs) are one of the most common forms of genetic variation and as such are powerful tools for the identification of bacterial strains, their genetic diversity, phylogenetic analysis, and outbreak surveillance. In this study, we used 15 sets of SNP-containing primers to amplify and sequence the target Escherichia coli. Based on the combination of the 15-sequence primer sets, each SNP site encompassing forward and reverse primer sequences (620-919 bp) were aligned and an SNP-based marker was designed. Each SNP marker exists in at least two SNP sites at the 3' end of each primer; one natural and the other artificially created by transition or transversion mutation. Thus, 12 sets of SNP primers (225-488 bp) were developed for validation by amplifying the target E. coli. Finally, a temperature gradient triplex PCR kit was designed to detect target E. coli strains. The selected primers were amplified in three genes (ileS, thrB, and polB), with fragment sizes of 401, 337, and 232 bp for E. coli O157:H7, E. coli, and E. coli O145:H28, respectively. This allele-specific SNP-based triplex primer assay provides serotype-specific detection of E. coli strains in one reaction tube. The developed marker would be used to diagnose, investigate, and control food-borne E. coli outbreaks.Entities:
Keywords: Escherichia coli; allele; genes; single nucleotide polymorphisms (SNPs); surveillance; triplex primer
Year: 2022 PMID: 35215059 PMCID: PMC8874422 DOI: 10.3390/pathogens11020115
Source DB: PubMed Journal: Pathogens ISSN: 2076-0817
Information on natural SNP-containing primer sets based on whole-genome sequences (WGS) of Escherichia coli strains (E. coli O157 and non-O157).
| P. Code C | Forward Primer (5’–3′) | Reverse Primer (5’–3′) | Amplicon Size | Flanking Sequence with Ambiguous Code and Position of SNP of a Reference | Alter Amino Acid-SNP Position in a Respective Gene-Amino Acid of Reference | Gene |
|---|---|---|---|---|---|---|
| 01 | GACGTTACAGCTGCCGGT | ACCCAACCAGTCGGCAAC | 919 | (S/G)-396-R | Homoserine kinase ( | |
| 02 | TCGGCGGTCGCTTTATGG | CCACGGCTGCATAACCCA | 669 | (S/T)-630-A | Threonine synthase ( | |
| 07 | GCGAGCTGGGAGAACTGG | GATTCGCTGTACCGCCGG | 674 | (V/F)-579-L | Transcriptional activator protein ( | |
| 08 | TGACTCGCCGTATGTGCC | TGCCCGGCAGATAAGTGC | 839 | (N/D)-485-H; (V/F)-714-I; (Y/D)-747-N | Isoleucine--tRNA ligase ( | |
| 09 | CGGCCTGGAAACCGCTAA | TCGGTTGATGCCACCCAC | 852 | V-1590-L | Isoleucine--tRNA ligase ( | |
| 12 | GGCATTAGAGGGCGTGCA | TGTCATACGGCTGGACGC | 649 | (T/P)-378-A | 4-hydroxy-tetrahydrodipicolinate ( | |
| 14 | TTGCTAAAGTGGCGGCGA | AGACGGATTCGCTTCGCA | 713 | L-1326-F | Carbamoyl-phosphate synthetase subunit beta ( | |
| 15 | TATGCAGCCAGCCATCGG | AGATGTGGCACTTCCCGC | 785 | (G/A)-721-V | Carnitine-CoA ligase ( | |
| 16 | GGCGGTATACGGGAAGGC | CGTCTCCGGGGGATCTGA | 702 | V-1108-G | Carnitine-CoA transferase ( | |
| 20 | CCCAAGTTGCCCGGTCAT | GCACAGGGCTACGACGTT | 738 | (G/A)-355-V | DNA polymerase II ( | |
| R-397-L | DNA polymerase II ( | |||||
| 21 | CAAATGCCGCCACCGAAC | CGTCCGCCAGAACGCTAT | 796 | (G/D)-1714-A | DNA polymerase II ( | |
| 22 | CCGATTGGCCTGCTTCCA | GCAGCTGTGGTCGGGATT | 742 | V-979-D | araB—Ribulokinase ( | |
| 23 | TTGGCAGCGGCGAGTTAA | AAGCGTCGCATCAGGCAA | 620 | W-474-G | Inner membrane protein ( | |
| 24 | CCATCTGGCGGGCGATAG | GGTGCTGGAGATGAGCGG | 804 | V-205-A | Thiamine transport system permease protein ( | |
| 26 | CAGTGGCGGCAGGAGTAC | CCCTGGGCATTGACCGAC | 671 | L-16-P | 3-isopropylmalate dehydratase subunit ( |
a Reference (non-O157) genome of Escherichia coli str. K-12 substr. MG1655 (NC_000913) and ambiguous code indicate, B = C/G/T; D = A/G/T; H = A/C/T; V = A/C/G; W = A/T, S = C/G; M = A/C; K = G/T; R = A/G; Y = C/T; b the position of amino acid codes of respective genes of a reference E. coli str. K-12 substr. with changed amino acids due to SNP-based changes. The amino acid codes, S = Serine, G = Glycine, R = Arginine, T = Threonine, A = Alanine, V = Valine, F = Phenylalanine, D = Aspartate, N = Asparagine, Y = Tyrosine, Z = Glutamine, W = Tryptophan, P = Proline, M = Methionine, K = Lysine, L = Leucine, I = Isoleucine. C refers to primer code numbers (i.e., 01 ecoli, 02 ecoli, and 07 ecoli, and so on), which were originated from the designed multiple primers based on the encompassing SNPs of the reference E. coli genome.
Figure 1PCR amplification of three different serotypes including four target Escherichia coli strains with 15 primer sets of first PCR amplification {(01-ecoli-F/R (919 bp); 02-(669 bp); 07-(674 bp); 08-(839 bp); 09-(852 bp); 12-(649 bp); 14-(713 bp); 15-(745 bp); 16-(702 bp); 20-(738 bp); 21-(796 bp); 22-(742 bp); 23-(620 bp); 24-(804 bp) and 26-(671 bp)}. PCR band ‘M’ indicates DNA 100 bp marker. The gel lane numbers are shown in each section (1–4): lane No. 1 = E. coli O157:H7 (ATTC-95150); No. 2 = E. coli O157:H7 (NCCP-15739); No. 3 = E. coli (KVCC-BA1800069); No. 4 = E. coli O145:H28 (KVCC-BA1800090). The four primers (12-, 21-, 23-, and 24-ecoli primer sets) were not produced with any desired bands with the target genes of E. coli, whereas the three primers (14-, 15- and 26-ecoli primer sets) were not produced with the target genes of E. coli. The detailed information of all primers was provided in Table 1, Tables S2 and S3.
List of developed single-nucleotide polymorphism (SNP)-based primers based on five gene sequences.
| Primer | Sequence (5′–3′) a | Length (bp) | Amplicon Size (bp) | Gene Description |
|---|---|---|---|---|
| O.nhaR-1-F | TTGTTTGACGTTGGCGTG | 21 | 397 | Transcriptional activator protein ( |
| O.nhaR-1-R | CGGCATCATCAAACTC | 20 | ||
| O.nhaR-3-F | GCGT | 20 | 225 | |
| O.nhaR-3-R | CCGGGAACGGTTTTTC | 20 | ||
| O.thrB-3-F | TGTTCGGTGGTCGCG | 18 | 337 | Homoserine kinase ( |
| O.thrB-3-R | CGTGAATGAAGCCAGC | 20 | ||
| O.thrC-4-F | CCATTCTGACCGCGACC | 20 | 344 | |
| O.thrC-4-R | AACCAGCTGGTTGCGC | 19 | ||
| O.ileS1-4-F | TCTGGGCGTGCTGGGC | 19 | 488 | Isoleucine–tRNA ligase ( |
| O.ileS1-4-R | AGCGCAGCAGCTCAAG | 20 | ||
| O.ileS1-3-F | ACAAAGGCGCGAAGCC | 20 | 391 | |
| O.ileS1-3-R | GGATGGG | 20 | ||
| O.ileS2-1F | GATCATCTTCCGCGC | 19 | 401 | |
| O.ileS2-1R | CAACAACAGAAGAGTGAGTA | 23 | ||
| O.ileS2-3-F | CGATCATCTTCCGCGCG | 20 | 376 | |
| O.ileS2-3-R | GAGTCAAACCATACATCCAAT | 24 | ||
| O.caiB-3-F | GCAAATTGCGGCGGGAG | 20 | 318 | Carbamoyl-phosphate synthase large chain ( |
| O.caiB-3-R | CTGCCTGATGCGACG | 20 | ||
| O.caiB-4-F | ATAACCAGTTTCGGGTTG | 21 | 296 | |
| O.caiB-4-R | ATCGAAATCGCCGGACCG | 21 | ||
| O.polB-3-F | CAAGGGGCGACCGCGCTT | 21 | 262 | DNA polymerase II ( |
| O.polB-3-R | GCTGGAAACCGTGCG | 19 | ||
| O.polB-4-F | TAATGGTGCCGCGGTTCTGG | 20 | 232 | |
| O.polB-4-R | CTTTACCTGCGTATCTTC | 21 |
a Red color indicates natural SNP and blue color indicates artificial SNP with transition or transversion mutated.
Selection of SNP primers with the target band pattern of serotype-specific E. coli strains.
| Primer | ||||
|---|---|---|---|---|
| O.nhaR1 | √ | √ | √ | |
| O.ileS2-1 | √ | √ | ||
| O.thrB-3 | √ | |||
| O.nhaR-3 | √ | |||
| O.ileS1-3 | √ | |||
| O.ileS2-3 | + | √ | √ | |
| O.caiB-3 | ++ | √ | ||
| O.polB-3 | + | √ | ||
| O.thrC-4 | √ | |||
| O.ileS1-4 | √ | |||
| O.caiB-4 | √ | √ | ||
| O.polB-4 | √ |
“√” indicates a positive target band, “+” indicates an off-target single band, and “++” indicates the off-target double band.
Figure 2PCR amplification of SNP-based markers with four target E. coli strains including two E. coli O157, E. coli, and E. coli O145. PCR band ‘M’ indicates DNA of 100 bp. The gel lane numbers are provided in each section (1–4): lane No. 1 = E. coli O157:H7 (ATTC-95150); No. 2 = E. coli O157:H7 (NCCP-15739); No. 3 = E. coli (KVCC-BA1800069); No. 4 = E. coli O145:H28 (KVCC-BA1800090). The nine primers (O.nhaR1, O.ileS2-1, O.thrB-3, O.nhaR-3, O.ileS1-3, O.thrC-4, O.ileS1-4, O.caiB-4, and O.polB-4) produced a single target band, whereas three primers (ileS2-3, O.caiB-3, O.polB-3) produced dimers or multiple bands. The two primers (O.nhaR1 and O.ileS2-1) produced the target band of serotype specific E. coli O157:H7 but O.nhaR1 also produced the target band of E. coli. Moreover, the primers (ileS1-3, O.caiB-3, O.polB-3, and O.polB-4) produced a light band during the 1st PCR and these three primers produced a strong band with the 2nd PCR amplification (images are not shown in here but 1st and 2nd PCR amplification sequences are provided in Tables S2 and S4). Moreover, the detailed primer information is provided in Table 2.
Figure 3PCR amplification of serotype-specific triplex E. coli primer set (O1). The gel lane numbers are shown in each section (1–4): No. 1 = E. coli O157:H7 (ATTC-95150); No. 2 = E. coli O157:H7 (NCCP-15739); No. 3 = E. coli (KVCC-BA1800069); No. 4 = E. coli O145:H28 (KVCC-BA1800090), and No. 5 = Genomic DNA without PCR of E. coli. (A) A strong band was produced by 35 PCR cycles with the serotype-specific E. coli; (B) a weak band was produced by 30 PCR cycles with the serotype-specific E. coli; (C) band with isolated genomic DNA of three serotype-specific E. coli without PCR amplification. In each lane with a band, the image was produced by using the concentration of 5 μL PCR products.
Information on triplex SNP primers (O1) and their target strains.
| Primer Name | Sequence (5′–3′) | Length (bp) | Amplicon Size (bp) | Target Strains |
|---|---|---|---|---|
| O.ileS2-1F | GATCATCTTCCGCGC | 19 | 401 | |
| O.ileS2-1R | CAACAACAGAAGAGTGAGTA | 23 | ||
| O.thrB-3-F | TGTTCGGTGGTCGCG | 18 | 337 | |
| O.thrB-3-R | CGTGAATGAAGCCAGC | 20 | ||
| O.polB-4-F | TAATGGTGCCGCGGTTCTGG | 20 | 232 | |
| O.polB-4-R | CTTTACCTGCGTATCTTC | 21 |
Figure 4A schematic flow diagram of the development of the SNP-based marker for serotype-specific E. coli detection.