| Literature DB >> 24465554 |
Edvin Karlsson1, Anna Macellaro1, Mona Byström1, Mats Forsman1, Dimitrios Frangoulidis2, Ingmar Janse3, Pär Larsson1, Petter Lindgren1, Caroline Ohrman1, Bart van Rotterdam3, Andreas Sjödin1, Kerstin Myrtennäs1.
Abstract
The case rate of Q fever in Europe has increased dramatically in recent years, mainly because of an epidemic in the Netherlands in 2009. Consequently, there is a need for more extensive genetic characterization of the disease agent Coxiella burnetii in order to better understand the epidemiology and spread of this disease. Genome reference data are essential for this purpose, but only thirteen genome sequences are currently available. Current methods for typing C. burnetii are criticized for having problems in comparing results across laboratories, require the use of genomic control DNA, and/or rely on markers in highly variable regions. We developed in this work a method for single nucleotide polymorphism (SNP) typing of C. burnetii isolates and tissue samples based on new assays targeting ten phylogenetically stable synonymous canonical SNPs (canSNPs). These canSNPs represent previously known phylogenetic branches and were here identified from sequence comparisons of twenty-one C. burnetii genomes, eight of which were sequenced in this work. Importantly, synthetic control templates were developed, to make the method useful to laboratories lacking genomic control DNA. An analysis of twenty-one C. burnetii genomes confirmed that the species exhibits high sequence identity. Most of its SNPs (7,493/7,559 shared by >1 genome) follow a clonal inheritance pattern and are therefore stable phylogenetic typing markers. The assays were validated using twenty-six genetically diverse C. burnetii isolates and three tissue samples from small ruminants infected during the epidemic in the Netherlands. Each sample was assigned to a clade. Synthetic controls (vector and PCR amplified) gave identical results compared to the corresponding genomic controls and are viable alternatives to genomic DNA. The results from the described method indicate that it could be useful for cheap and rapid disease source tracking at non-specialized laboratories, which requires accurate genotyping, assay accessibility and inter-laboratory comparisons.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24465554 PMCID: PMC3897454 DOI: 10.1371/journal.pone.0085417
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The 21 genomes, 26 isolates and three tissue samples analyzed in this study.
| Strain/Sample | Alt. name | Plasmid | Origin | Year | Accession no. | DNA source | DNA amplification |
| Ohio | ATCC VR 542 | QpH1 | Milk, USA | 1955 | PRJNA197125 | ATCC | no |
| Nine Mile | RSA493 | QpH1 | Tick, USA | 1935 | AE016828.2 | WSU | no |
| S4 | QpH1 | Sheep, Sweden | 1990 | PRJNA197126 | JLU/FOI | no | |
| Australian QD | RSA425 | QpH1 | Human blood, Australia | 1939 | SRS213928 | N/A | |
| C2 | QpH1 | Hay, Sweden | 1997 | PRJNA197120 | FOI | no | |
| Cb175 | QpH1 | Guyana | PRJNA199747 | N/A | |||
| Henzerling | RSA331 | QpH1 | Human, Italy | 1945 | CP000890.1 | N/A | |
| Innsbruck | Z2534 | QpH1 | Goat, Austria | PRJNA197122 | JLU | no | |
| 2338 | QpH1 | Cow, Germany | 2003 | PRJNA197123 | IMB | MDA | |
| Z349-36/94 | QpH1 | Sheep,Germany | 1994 | PRJNA197128 | IMB | no | |
| M44 | RSA459, Grita | QpH1 | Human blood, Italy | 1945 | SRS291555 | N/A | |
| Cb109 | QpH1 | Human, Germany | AKYP00000000 | N/A | |||
| Z3055 | QpH1 | the Netherlands | PRJNA199725 | N/A | |||
| Dugway | 5J108-111 | QpDG | Rodent, USA | CP000733.1 | N/A | ||
| Q321 | QpDV | Cow's milk, Russia | AAYJ00000000 | N/A | |||
| CbuK | Q154, K | QpRS | Human, USA | 1976 | CP001020.1 | N/A | |
| Priscilla | Q177 | QpRS | Goat, USA | 1980 | AAUP00000000 | IMB | MDA |
| Namibia | QpRS | Goat, Namibia | 1991 | PRJNA197124 | IMB | MDA | |
| CbuG | Q212 | Integrated | Human, Canada | CP001019.1 | N/A | ||
| McMaster | Q172 | Integrated | Human placenta | SRS213929 | N/A | ||
| Scurry | Q217 | Integrated | Human, USA | 1981 | PRJNA197127 | WSU | no |
| Brasov | QpH1 | Human, Romania | N/A | IMB | no/MDA | ||
| CS-KL 8 | QpH1 | Tick, Slovak Republic | 1989 | N/A | IMB | MDA | |
| J-3 | QpH1 | Cow, Japan | N/A | IMB | MDA | ||
| Utvinis | QpH1 | Human, Romania | N/A | JLU | no/MDA | ||
| Balaceanu | QpH1 | Human, Romania | N/A | JLU | no | ||
| Geier | QpH1 | Human, Romania | N/A | IMB | MDA | ||
| CS-Florian | QpH1 | Human blood, Slovak Republic | 1956 | N/A | IMB | MDA | |
| Soyta | QpH1 | Cow, Switzerland | 1965 | N/A | IMB | MDA | |
| Herzberg | QpH1 | Human, Greece | 1946 | N/A | FOI/IMB | no/MDA | |
| Andelfingen | QpH1 | Cow, Switzerland | 1965 | N/A | IMB | MDA | |
| München | QpH1 | Sheep, Germany | 1969 | N/A | FOI | no/MDA | |
| CS-Ixodes | QpH1 | Tick, Russia | 1962 | N/A | IMB | MDA | |
| RT-I | QpH1 | Mice, Russia | 1958 | N/A | IMB | MDA | |
| S1 | QpH1 | Sheep, Sweden | 1990 | N/A | IMB | MDA | |
| F-4 | QpDV | Human, France | 1978 | N/A | IMB | MDA | |
| F-1 | QpRS | Human, France | 1978 | N/A | IMB | MDA | |
| 03784 | QpH1 | Sheep placenta, Noord-Brabant, the Netherlands | 2009 | N/A | RIVM | no | |
| 01040 | QpH1 | Goat placenta, Noord-Brabant, the Netherlands | 2007 | N/A | RIVM | no | |
| 01050 | QpH1 | Goat placenta,Limburg, the Netherlands | 2007 | N/A | RIVM | no |
a Received from WSU = Washington State University, USA; FOI = Swedish Defence Research Agency, Umeå, Sweden; JLU = Justus-Liebig University, Giessen, Germany; IMB = Bundeswehr Institute of Microbiology, Munich, Germany; ATCC = American Type Culture Collection, USA; RIVM = National Institute for Public Health and the Environment, Utrecht, the Netherlands.
b no = no amplification; MDA = multiple displacement amplification; no/MDA = no amplification used for SNP assay, MDA used for genome sequencing.
c 36 bp single-end reads (Illumina). Mean insert size not applicable.
d Sampled by FOI. Isolated by JLU.
e 100 bp paired-end reads (Illumina). Mean insert size (standard deviation): S4; 322(145), Z349-36/94; 269(97), 2338; 478(31).
f Isolated by JLU. Cultured by FOI.
g Isolated by FOI [54].
h 76 bp single-end reads (Illumina). Mean insert size not applicable.
i 150 bp paired-end reads (Illumina). Mean insert size (standard deviation): Namibia; 297(92).
j Genomic DNA was prepared on Herzberg isolate at FOI, MDA DNA was prepared from the Herzberg isolate at IMB.
k ID at RIVM.
Genome sequence data statistics of the eight new Coxiella burnetti strains sequenced in this study.
| Genome | Mean sequence depth | Number of positions with more than 10× coverage | Reference genome length | Percentage of reference genome | Percentage of filtered reads aligning to all published | Total number of reads |
| C2 | 438 | 1,968,972 | 1,995,281 | 98,7% | 95,6% | 24,693,231 |
| Z2534, Innsbruck | 86 | 1,974,783 | 1,995,281 | 99,0% | 96,9% | 2,593,982 |
| ATCC VR 542, Ohio | 275 | 1,995,169 | 1,995,281 | 100,0% | 90,2% | 16,274,215 |
| Q217, Scurry | 157 | 1,942,040 | 1,995,281 | 97,3% | 90,4% | 9,518,013 |
| Namibia | 87 | 1,956,716 | 1,995,281 | 98,1% | 24,0% | 5,462,726 |
| 2338 | 113 | 1,969,807 | 1,995,281 | 98,7% | 20,9% | 11,434,898 |
| S4 | 1063 | 1,995,281 | 1,995,281 | 100,0% | 96,7% | 22,369,452 |
| Z349-36/94 | 924 | 1,975,322 | 1,995,281 | 99,0% | 72,6% | 24,952,742 |
a Nine Mile RSA493 (GenBank AE016828.2).
b Filtered by aligning reads to Cholocebus aethiops draft genome (NCBI BioProject PRJNA168621).
c Paired end sequences.
CanSNPs and Melt-MAMA primers.
| CanSNP | SNP position in Nine Mile RSA493 (GenBank AE016828.2) | Gene | Base | Primer | Primer sequence | Conc. (nM) | Product Tm (°C) |
| C.1 | 1,811,671 |
| C | Anc |
| 200 | 75,4 |
| G | Der |
| 200 | 80,3 | |||
| Com |
| 200 | |||||
| C.3 | 559,844 |
| T | Anc |
| 200 | 75,1 |
| C | Der |
| 200 | 80,6 | |||
| Com |
| 200 | |||||
| C.4 | 604,402 |
| G | Anc |
| 1000 | 78,0 |
| A | Der |
| 200 | 82,4 | |||
| Com |
| 200 | |||||
| C.5 | 847,839 |
| T | Anc |
| 200 | 74,5 |
| C | Der |
| 200 | 79,6 | |||
| Com |
| 200 | |||||
| C.6 | 648,943 |
| G | Anc |
| 800 | 76,1 |
| A | Der |
| 200 | 79,8 | |||
| Com |
| 200 | |||||
| C.7 | 1,177,930 |
| G | Anc |
| 200 | 80,8 |
| A | Der |
| 800 | 75,5 | |||
| Com |
| 200 | |||||
| C.8 | 1,356,018 |
| C | Anc |
| 600 | 75,5 |
| T | Der |
| 200 | 80,2 | |||
| Com |
| 200 | |||||
| C.9 | 33,004 |
| G | Anc |
| 800 | 74,0 |
| A | Der |
| 200 | 78,6 | |||
| Com |
| 200 | |||||
| C.10 | 1,436,254 |
| C | Anc |
| 800 | 76,2 |
| T | Der |
| 200 | 80,9 | |||
| Com |
| 200 | |||||
| C.11 | 1,792,411 |
| G | Anc |
| 400 | 71,8 |
| A | Der |
| 200 | 76,7 | |||
| Com |
| 200 | |||||
| C.6 Cont | A | Der Cont |
| 200 | |||
| Com Cont |
| 200 |
a SNP state could not be determined whether ancestral or derived.
b Antepenultimate mismatches and primer-tails are presented in lower case letters.
c Primers designed on the reverse complement strand to Nine Mile RSA493.
Figure 1Phylogenetic tree.
Branch lengths are proportional to the total number of SNPs, which are indicated above or below the branches. Genomes sequenced in this study are in bold face. The genomic groups (I–VII) and plasmid types are denoted as described [21], [26], [28]. CanSNP markers are depicted in red (C.1–C.15) and the corresponding MST markers are depicted in grey. The genomes Cb109, Cb175 and Z3055 became available after the time of assay design resulting in four new branches (C.12 to C.15). Although not included in this work, it demonstrates that the systematics of canSNPs easily can be extended when new genomes become available. The ancestral and derived state for C.1 and C.2 could not be determined due to the lack of sequence from a near neighbor (a root). Therefore, we could only design one assay C.1 that could be on either the C.1 or C.2 branches.
Typing results for genomes and isolates, including MST data for comparison.
| Strain (genome sequenced) | Plasmid | Genomic group | C.1 | C.3 | C.4 | C.5 | C.6 | C.7 | C.8 | C.9 | C.10 | C.11 | Derived for canSNP | Derived for MST SNP | MST branch | ST | CanSNP typing of non-sequenced isolates |
| Q321 | QpDV | VII | G |
| G | T | G | G | C | G | C | G | C.3 | Cox18bp34 | Br.IV.001 | 1–7,30 |
|
| Q154, CbuK, K | QpRS | IV | G | T |
| T | G | G | C | G | C | G | C.4 | Cox51bp67 | Br.IV.015 | 8 | F-1 |
| Q177, Priscilla | QpRS | IV | G | T |
| T | G | G | C | G | C | G | C.4 | Cox51bp67 | Br.IV.015 | 8 | F-4 |
| Namibia | QpRS | IV | G | T | G |
| G | G | C | G | C | G | C.5 | Cox18bp34 | Br.IV.001 | 1–7,30 |
|
| 5J108-111, Dugway | QpDG | VI | C | T | G | T |
| G | C | G | C | G | C.6 | Cox20bp155 | Br.VI.001 | NA | Balaceanu |
| Q212, CbuG | Integrated | V | C | T | G | T | G |
|
| G | C | G | C.8 | Cox5bp81 | Br.V.001 | 21 | Brasov |
| Q172, McMaster | Integrated | V | C | T | G | T | G |
|
| G | C | G | C.8 | Cox5bp81 | Br.V.001 | 21 | CS-KL8 |
| Q217, Scurry | Integrated | V | C | T | G | T | G |
|
| G | C | G | C.8 | Cox5bp81 | Br.V.001 | 21 | J-3 |
| ATCC VR 542, Ohio | QpH1 | I | C | T | G | T | G |
| C |
|
| G | C.10 | Cox51bp356 | Br.I.003 | 16,26 | Utvinis |
| RSA493, Nine Mile | QpH1 | I | C | T | G | T | G |
| C |
|
| G | C.10 | Cox51bp356 | Br.I.003 | 16,26 |
|
| S4 | QpH1 | I | C | T | G | T | G |
| C |
|
| G | C.10 | Cox51bp356 | Br.I.003 | 16,26 | CS-Florian |
| RSA425, Australian QD | QpH1 | I | C | T | G | T | G |
| C |
|
| G | C.10 | Cox51bp356 | Br.I.003 | 16,26 | München |
| C2 | QpH1 | I | C | T | G | T | G |
| C |
|
| G | C.10 | Cox51bp356 | Br.I.003 | 16,26 | Soyta |
| Cb175 | QpH1 | I | C | T | G | T | G |
| C |
| C | G | C.9 | Cox18bp376 | Br.I.001 | 16,17, 26 | Herzberg |
| RSA331, Henzerling | QpH1 | II | C | T | G | T | G |
| C |
| C |
| C.11 | Cox18bp166 | Br.II.001 | 18,22,23,25,29 | Andelfingen |
| Z2534, Innsbruck | QpH1 | II | C | T | G | T | G |
| C |
| C |
| C.11 | Cox18bp166 | Br.II.001 | 18,22,23,25,29 | S1 |
| 2338 | QpH1 | II | C | T | G | T | G |
| C |
| C |
| C.11 | Cox18bp166 | Br.II.001 | 18,22,23,25,29 | CS-Ixodes |
| Z349-36/94 | QpH1 | II | C | T | G | T | G |
| C |
| C |
| C.11 | Cox18bp166 | Br.II.001 | 18,22,23,25,29 | Geier, RT-1 |
| RSA459, M44 | QpH1 | II | C | T | G | T | G |
| C |
| C |
| C.11 | Cox18bp166 | Br.II.001 | 18,22,23,25,29 | SZ2009-03784 |
| Cb109 | QpH1 | II | C | T | G | T | G |
| C |
| C |
| C.11 | Cox51bp492 | Br.II.007 | 11–15,24,32–34 | SZ2007-01040 |
| Z3055 | QpH1 | II | C | T | G | T | G |
| C |
| C |
| C.11 | Cox51bp492 | Br.II.007 | 11–15,24,32–34 | SZ2007-01050 |
a Sequenced in this study.
b The assays for C.3 and C.5 have no corresponding assay in Hornstra et al. [28]. C.3 and C.5 targets the clades starting with branches Br.IV.003 (ST1–4), and Br.IV.010 (ST30), respectively.
c Genotypes based on the 14 MST assays in Hornstra et al. [28].
d Cb175 is on branch Br.I.002 (ST17) in Hornstra et al. [28], but no assay is developed for this branch.
e The genome became available after assay design.
Figure 2Synthetic vector controls.
Two control sequences with ancestral or derived SNP alleles for ten synonymous SNP markers (C.1 to C.11) were synthesized and cloned into the standard vector pEX-A (Fig. S2). Primer binding sites are marked in red, with the SNP states for the markers shown above them.