| Literature DB >> 25388670 |
Mette T Christiansen1, Amanda C Brown2,3, Samit Kundu4,5, Helena J Tutill6, Rachel Williams7, Julianne R Brown8, Jolyon Holdstock9, Martin J Holland10, Simon Stevenson11, Jayshree Dave12, C Y William Tong13, Katja Einer-Jensen14, Daniel P Depledge15, Judith Breuer16.
Abstract
BACKGROUND: Chlamydia trachomatis is a pathogen of worldwide importance, causing more than 100 million cases of sexually transmitted infections annually. Whole-genome sequencing is a powerful high resolution tool that can be used to generate accurate data on bacterial population structure, phylogeography and mutations associated with antimicrobial resistance. The objective of this study was to perform whole-genome enrichment and sequencing of C. trachomatis directly from clinical samples.Entities:
Mesh:
Year: 2014 PMID: 25388670 PMCID: PMC4233057 DOI: 10.1186/s12879-014-0591-3
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.090
Figure 1The effect of whole-genome enrichment on the proportion of reads mapping to The plot shows the proportion of reads mapping to a C. trachomatis reference (F/SW4) genome (% on-target reads) from a C. trachomatis cultured sample that was processed with and without whole-genome enrichment.
Figure 2SNP profiles in a sample processed with and without whole-genome enrichment. The plot illustrates SNP profiles in a cultured C. trachomatis sample processed with and without whole-genome enrichment compared to the GenBank reference strain isolate F/SW4 (Accession no. NC_017951.1). The two outer tracks shown in green illustrate the open reading frames (ORFs) annotated in the GenBank reference strain isolate F/SW4 with forward and reverse orientation. The red track shows the SNP differences found within coding regions between the cultured C. trachomatis sample processed without whole-genome enrichment and the reference strain. The blue track shows the SNP differences found within coding regions between the same cultured C. trachomatis sample processed with whole-genome enrichment and the reference strain. The SNPs are called at the consensus level.
Figure 3Proportion of reads mapping to reference genome in relation to total genome copies input. Plot showing the relationship between the numbers of input C. trachomatis genomes calculated and the proportion of reads mapping to the C. trachomatis reference (F/SW4) genome (% on-target reads).
Figure 4Coverage of reference genome and mean read depth in relation to total genome copies input. The plot shows the amount of target DNA (C. trachomatis DNA) used in the library preparations and the percentage coverage of the reference genome obtained with the various target input. The bars (black, grey, and white) illustrate the minimum mean read depth obtained from each sample. The columns to the far left represent a sample which did not undergo whole-genome enrichment during library preparation (indicated with text - No enrichment).
Overview of raw sequence data obtained from the cultures samples
| ID | Sample type | Genome copies in 1-2 ml DNA extract | Total input | Total reads | Mapped reads | Reads mapping to | Serovar/Genotype ( |
|---|---|---|---|---|---|---|---|
| CT_10 | Culture | 2.3x109 | 362,346,600 | 3450960 | 3058284 | 88.62% | B |
| CT_11 | Culture | 3.5x108 | 2,098,188,000 | 3847486 | 3455294 | 89.81% | C |
| CT_12 | Culture | 8.1x109 | 683,544,000 | 5055724 | 4632017 | 91.62% | G |
| CT_13 | Culture | 1.1x109 | 1,070,106,000 | 3923244 | 3512706 | 89.54% | H |
| CT_14 | Culture | 1.8x109 | 8,421,480,000 | 4814146 | 4524936 | 93.99% | I |
| CT_15 | Culture | 1.4x1010 | 5,587,859,200 | 4572350 | 4242802 | 92.79% | J |
| CT_16 | Culture | 1.4x1010 | 1,563,216,000 | 7734134 | 7240982 | 93.62% | L1 |
| CT_17 | Culture | 2.6x109 | 1224,048,000 | 6316494 | 5681915 | 89.95% | L2 |
All 8 samples were sequenced in a MiSeq run. The total input refers to the number of C. trachomatis genome copies used as input in the library preparation. The genotyping (ompA) was performed in the clinical laboratory.
Analysis of sequence data from DNA extracted directly from clinical samples
| ID | Original sample type | Ct-value ( | Genome copies in 100 μl DNA extract | Total input | Total reads | Mapped reads | Reads mapping to | Mean read depth | Coverage of ref. genome | Serovar/Genotype ( |
|---|---|---|---|---|---|---|---|---|---|---|
| CT-33 | Vaginal swab | 19.3 | 1.2x108 | 68,864,400 | 5,493,094 | 2,722,903 | 49.57% | 410 | 99.9% | D |
| CT-34 | Vaginal swab | 26.5 | 1.0x106 | 868,530 | 8,016,738 | 105,030 | 1.31% | 15.8 | 96.3% | J |
| CT-35 | Vaginal swab | 29 | 2.0x105 | 168,980 | 3,740,600 | 14,129 | 0.38% | 2 | 59.1% | - |
| CT-36 | Vaginal swab | 26.5 | 1.0x106 | 864,705 | 3,731,308 | 195,331 | 5.23% | 29.4 | 100.0% | E |
| CT-37 | Vaginal swab | 26.2 | 1.3x106 | 1,092,760 | 3,993,426 | 73,356 | 1.84% | 11 | 98.4% | C/K |
| CT-38 | Urine | 29.4 | 1.5x105 | 129,710 | 4,425,438 | 194,923 | 4.40% | 29.4 | 99.9% | Ia |
| CT-39 | Vaginal swab | 30.6 | 7.0x104 | 59,755 | 4,454,502 | 2,975 | 0.07% | 0.4 | 24.3% | - |
| CT-40 | Vaginal swab | 27.2 | 6.5x105 | 551,820 | 7,083,232 | 110,006 | 1.55% | 16.5 | 98.8% | Ia |
| CT-41 | Urine | 31.5 | 3.9x104 | 33,320 | 4,159,384 | 41,937 | 1.01% | 6.3 | 95.2% | E |
| CT-42 | Urine | 30.1 | 1.0x105 | 86,360 | 4,590,004 | 111,826 | 2.44% | 16.9 | 99.9% | G |
All 10 samples were multiplexed and sequenced twice on a MiSeq in two separate runs, after which the data-sets were combined. Ct-value refers to a cycle threshold obtained by qPCR and were defined in the clinical laboratory. Genome copies in 100 μl DNA extract from 200 μl original sample. The total input refers to the number of C. trachomatis genome copies used as input in the library preparation. In silico genotyping was performed comparing the ompA gene sequence to other known serovars found in GenBank.
Figure 5Coverage of reference genome - consensus sequences obtained directly from clinical samples. The plot shows the coverage of the reference genome (%) found when aligning the consensus sequences generated from the eight clinical samples that produced good genomic data.
Figure 6Neighbour-joining reconstruction of the phylogeny of . The figure shows an un-rooted phylogenetic reconstruction of the alignment data including 8 cultured and 8 clinical C. trachomatis samples sequenced in this study and 23 C. trachomatis genomes obtained from GenBank. The Neighbour-joining tree was constructed with 500 bootstrap replicates using a Jukes Cantor model of evolution and gamma correction for among-site rate variation with four rate categories. The gap/missing data was treated with a site coverage cut-off of 90%. The blue clusters represent the ocular/urogenital biovar (dark blue T2 clade and light blue T1 clade) and the red cluster represents the LGV biovar. The ocular serovars A-C cluster within the dark blue T2 clade. The samples sequenced in this study are highlighted in colours (blue and red) and the GenBank strains are represented in black.