| Literature DB >> 25947176 |
Andrew L Webb1,2, Peter Kruczkiewicz3, L Brent Selinger4, G Douglas Inglis5, Eduardo N Taboada6.
Abstract
BACKGROUND: Molecular typing methods are critical for epidemiological investigations, facilitating disease outbreak detection and source identification. Study of the epidemiology of the emerging human pathogen Arcobacter butzleri is currently hampered by the lack of a subtyping method that is easily deployable in the context of routine epidemiological surveillance. In this study we describe a comparative genomic fingerprinting (CGF) method for high-resolution and high-throughput subtyping of A. butzleri. Comparative analysis of the genome sequences of eleven A. butzleri strains, including eight strains newly sequenced as part of this project, was employed to identify accessory genes suitable for generating unique genetic fingerprints for high-resolution subtyping based on gene presence or absence within a strain.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25947176 PMCID: PMC4424573 DOI: 10.1186/s12866-015-0426-4
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Figure 1Reference genealogy of A. butzleri isolates (n=156). Clusters were calculated by simple matching comparison of 72 accessory genes using pairwise coefficients and UPGMA analysis. The scale represents fingerprint similarity based on the total number of shared loci between isolate profiles and the total number of loci in the assay. Dashed grey line represents a 90% similarity threshold used for clade definition. Isolates sequenced as part of this study are highlighted in yellow; ID 17 (strain L353, PRJNA233527), ID 40 (strain L355, PRJNA233527), ID 64 (strain L348, PRJNA233527), ID 69 (strain L352, PRJNA233527), ID 82 (strain L354, PRJNA233527), ID 100 (strain L349, PRJNA233527), ID 149 (strain L351, PRJNA233527), ID 151 (strain L350, PRJNA233527). Published reference A. butzleri strains are designated with arrows and include ID 68 (strain 7h1h, PRJNA200766), ID 109 (strain JV22, PRJNA61483), ID 138 (strain RM4018, PRJNA58557), ID 154 (strain ED-1, PRJNA158699).
Simpson’s Index of Diversity for z isolates ( =152) genotyped by CGF
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Binary Pairwise Similarity (UPGMA) | Reference | 87 | 0.984 | 0.978-0.991 | 0.977-0.992 |
| CGF40 | 86 | 0.987 | 0.983-0.992 | 0.982-0.992 |
Simpson’s Index of Diversity (ID), confidence intervals (CI), and non-approximated confidence intervals (CINA) were calculated using the online tool of the Comparing Partitions Website (http://darwin.phyloviz.net/ComparingPartitions/index.php?link=Tool).
Partitions were denoted at the 95% similarity level, which was calculated using the simple matching coefficient in BioNumerics (version 6.6, Applied Maths, Austin, TX).
Adjusted Wallace Coefficient values of CGF compared to the reference phylogeny for z isolates ( =152)
|
|
|
|
|---|---|---|
|
| 0.88 | 0.62 |
|
| (0.83-0.93) | (0.53-0.71) |
|
| 0.92 | 0.87 |
|
| (0.89-0.95) | (0.83-0.91) |
Adjusted Wallace Coefficient values were calculated using the online tool of the Comparing Partitions Website (http://darwin.phyloviz.net/ComparingPartitions/index.php?link=Tool).
Partitions were denoted by 90% and 95% accessory gene pairwise similarity, which were calculated using the binary simple matching algorithm in BioNumerics (version 6.6, Applied Maths).
Figure 2Tanglegram of CGF72 (reference) and CGF40 genealogies for A. butzleri isolates (n = 156). Coloured lines represent isolates within clusters in the reference phylogeny that are ≥90% similar to one or more other isolates. Scales represent fingerprint similarity based on the total number of shared loci between isolate profiles and the total number of loci in the assay. Coloured lines also indicate the location of the same isolate in the CGF72 and the CGF40 cladograms. Scales represent fingerprint similarity based on the total number of shared loci between isolate profiles and the total number of loci in the assay. Isolates sequenced as part of this study are highlighted in yellow; ID 17 (strain L353, PRJNA233527), ID 40 (strain L355, PRJNA233527), ID 64 (strain L348, PRJNA233527), ID 69 (strain L352, PRJNA233527), ID 82 (strain L354, PRJNA233527), ID 100 (strain L349, PRJNA233527), ID 149 (strain L351, PRJNA233527), ID 151 (strain L350, PRJNA233527). Published reference A. butzleri strains are designated with arrows and include ID 68 (strain 7h1h, PRJNA200766), ID 109 (strain JV22, PRJNA61483), ID 138 (strain RM4018, PRJNA58557), ID 154 (strain ED-1, PRJNA158699).
Isolates of z from diverse sources for whole genome sequence analysis
|
|
|
|
|
|---|---|---|---|
| L348 | Sewage outfall | Lethbridge, Alberta, CA | 07/May/2008 |
| L349 | Diarrheic human stool | Blairmore, Alberta, CA | 30/Jul/2008 |
| L350 | Non-diarrheic human stool | Lethbridge, Alberta, CA | 30/Sep/2008 |
| L351 | Non-diarrheic human stool | Lethbridge, Alberta, CA | 01/Apr/2009 |
| L352 | Diarrheic human stool | Lethbridge, Alberta, CA | 27/Apr/2009 |
| L353 | Horse feces | Diamond City, Alberta, CA | 02/Jul/2009 |
| L354 | Pig feces | Lethbridge, Alberta, CA | 12/Aug/2009 |
| L355 | Raw sewage | Lethbridge, Alberta, CA | 08/Mar/2009 |
Primers for PCR amplification of CGF markers
|
|
|
|
| |
|---|---|---|---|---|
| Multiplex 1 | 150 | GCATCCTCTTCCTCCATCAT | TCGAATAAATCCCCTACCCTT | 12 |
| 250 | ATACACCACCAGATGAGCTG | TAACGTACCGCATCCATTGA | 10 | |
| 400 | AGTGCCCGTTCTATTGGTAT | GCATAAAGAGCTTCTCCTCC | 8 | |
| 500 | ACTCTTCCCGAATCTGCAAT | TCTCCAATTCCTTGTCCTATTGT | 10 | |
| 600 | AGTCATGCAATCCTAACGAGA | AGGAGCCTACTATGTACCTCT | 10 | |
| Multiplex 2 | 150 | TTTTCATTGGGAAGAAGAATTTAGT | TCCAATTCATAAATATCTCTTGGTGA | 12 |
| 250 | TCTTTTAAAGAAGACAGCTGTAGT | TTTTGCAACACCTAATCTTGC | 18 | |
| 350 | TGATACAGGAATTATAAGAAGTGTTCC | GCATGAACTTCAACTCCAGG | 5 | |
| 450 | TGGAAATGACAGAGGATGGT | AGTAACGGATGAGCTTTTAAATTT | 8 | |
| 600 | TTGGGCTATTATGTCCCCAG | TCGTACAACTGGCATAGCTT | 7 | |
| Multiplex 3 | 200 | CCTCAACTTCTAACAGCAGG | CTCACATCACCCAATCCACT | 8 |
| 300 | TGGAATATCATAAACCAAAAATTGTTT | TTCATTGCAAATCCGCCTTT | 10 | |
| 450 | ACAGCATCCTTGATTCTAGCA | GTGTAATCATAGCCCAAATCCA | 12 | |
| 550 | TGAAATAATGAATGAACACAATAGCA | GTGCACAACCTAAAACCTCA | 10 | |
| 700 | GACAGGAACAGAGGGAAGTC | AGCATCTTTATTTGTCGCACT | 10 | |
| Multiplex 4 | 200 | TGATGAAACACTAGAAAATAAGGCT | CCAGTAAAACCTCTGTCAGC | 11 |
| 350 | TCACTTTTAGGTACTCACGACT | GCTATAAAACTTGCACCTTTATCG | 9 | |
| 450 | CAAAGATTTCTACGGGAAATTTGT | ACATCCTTTGCCTCTTTAAAAGA | 9 | |
| 550 | TCGAGGACAAGCAGATTCAA | GCCATTTCTACTTCCATTGTGT | 7 | |
| 700 | ACAGCAGTAACATTACAGGG | TCAAAAGCAATTCCACCACT | 11 | |
| Multiplex 5 | 150 | TCTATAGGTGCTGACCCACT | GCCGCAATACTTCCAAAACT | 9 |
| 250 | TTTACAGGAGCTTGGACATCA | TTTTACCATCATCTTCAACCCA | 9 | |
| 400 | CATCGTCCTTCAGTCGAATAT | GGAAACCATTTTCTTTTGCCA | 9 | |
| 550 | GTCATTTTTACACCACCTGCA | TCAAAACGCTTAGCCAAATCT | 12 | |
| 700 | ACTTTTTGCTTCTCAAAGTAGAAC | CCTCTGAAAAATTGAAATAATATACCC | 10 | |
| Multiplex 6 | 150 | GGTTGGGGAAAACTGCTTTT | TCTCTTGATTTTTAGTTTCAATCTCT | 10 |
| 250 | TGCTATGGGTGCAATGGTTA | AAGATTCTAGCAACACCCGA | 8 | |
| 400 | TGGGGACATGAAAACTGGAA | TTCACATACTTTCTCAGGCATT | 10 | |
| 550 | ACTATGGCTATATATGCGAAGAAA | TCCATAAATGTTTCAACTCAGGA | 10 | |
| 650 | GGAATTGCCGAGTTTACACG | TGAGCTCCATGTTGTATTGGA | 10 | |
| Multiplex 7 | 200 | ACTCCATTTGTGCTTATTGGA | TCTTGAACTAGCCAAAAGTGC | 10 |
| 350 | TCGAAATATCTTTTAGCTTCAAGAA | AAAACATCATTTTCTTTTGCCCA | 10 | |
| 450 | AGAGTTTGGATGGAAAACTGT | TGCAACTATTCCATCAAAACCA | 10 | |
| 550 | GGTTCAACACCAGGAACAAA | TGCAACACCTATCATCTCATTT | 10 | |
| 700 | GGAAAAGGCAAAGAATCCTCA | ACCATCGCCAGACTTCATTA | 10 | |
| Multiplex 8 | 150 | TGCAAGAAATGGTGGAACAA | CCTGTTGCAATAGTTGGTGT | 10 |
| 250 | TGGTAGAAGAAACAATAAAAAGATTTG | AGTCTTGATTTATCGACAGTTCT | 10 | |
| 350 | TTTTGTTTGAAGCTTATTCGTGA | AGTCCATATCCTTTCTCTCTCA | 8 | |
| 450 | AGGAGCTGTTGAGATTTTCAA | GTCGTTGCTCATCTGCTTTT | 7 | |
| 550 | GATGCTGGATTTTGTATGGCT | AGCCAAGAAACTTTCAATATCTCT | 10 |
Primer pairs were selected and grouped into multiplexes using Primer3 [56], multiPLX [57], and CGF Multiplexer [25].
Multiplex primer pair concentrations were optimised for Ta = 60°C.