| Literature DB >> 22558202 |
Mary G Krauland1, Julie C Dunning Hotopp, David R Riley, Sean C Daugherty, Jane W Marsh, Nancy E Messonnier, Leonard W Mayer, Hervé Tettelin, Lee H Harrison.
Abstract
In the United States, serogroup Y, ST-23 clonal complex Neisseria meningitidis was responsible for an increase in meningococcal disease incidence during the 1990s. This increase was accompanied by antigenic shift of three outer membrane proteins, with a decrease in the population that predominated in the early 1990s as a different population emerged later in that decade. To understand factors that may have been responsible for the emergence of serogroup Y disease, we used whole genome pyrosequencing to investigate genetic differences between isolates from early and late N. meningitidis populations, obtained from meningococcal disease cases in Maryland in the 1990s. The genomes of isolates from the early and late populations were highly similar, with 1231 of 1776 shared genes exhibiting 100% amino acid identity and an average π(N) = 0.0033 and average π(S) = 0.0216. However, differences were found in predicted proteins that affect pilin structure and antigen profile and in predicted proteins involved in iron acquisition and uptake. The observed changes are consistent with acquisition of new alleles through horizontal gene transfer. Changes in antigen profile due to the genetic differences found in this study likely allowed the late population to emerge due to escape from population immunity. These findings may predict which antigenic factors are important in the cyclic epidemiology of meningococcal disease.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22558202 PMCID: PMC3338715 DOI: 10.1371/journal.pone.0035699
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
General Features of Sequenced Genomes.
| Features | NM220 (early strain type) | NM233 (late strain type) |
|
| 21.60 | 27.25 |
|
| 242.869 | 246.036 |
|
| 380 | 343 |
|
| 2011600 | 1992570 |
|
| 51.7 | 51.8 |
|
| 288 | 261 |
|
| 1992450 | 1977986 |
|
| 10891 | 12240 |
|
| 1896 | 1879 |
|
| 898 | 900 |
|
| 99.71% | 99.86% |
Large Contig, contig at least 500 bases in length.
Contig N50, length such that 50% of the assembled genome lies in contigs of N50 size or longer.
ORF, open reading frame.
Q40+ Bases, all the bases in the assembled contigs that carry a phred-equivalent quality score of 40 and above. A score of 40 is equivalent to an accuracy of 99.99%.
Genes found in a majority of sequenced genomes but absent from NM220 and NM233.
| Protein Name | Representative Gene MC58 Genome |
| NadA | NMB_0394 |
| TspB | NMB_1548 |
| TspB | NMB_1628 |
| TspB | NMB_1747 |
| zonula occludens toxin family protein | NMB_1551 |
| zonula occludens toxin family protein | NMB_1626 |
| zonula occludens toxin family protein | NMB_1749 |
| tellurite resistance protein TehA | NMB_1603 |
| oxidoreductase, zinc-binding dehydrogenase family | NMB_1395 |
| peptidase, S24 family | NMB_0910 |
| PIN domain protein | NMB_1665 |
| death-on-curing family protein | NMB_0917 |
| Fic family protein | NMB_0255 |
| DNA-cytosine methyltransferase | NMB_0725 |
| putative plasmid toxin protein PemI | NMB_0914 |
| putative plasmid toxin protein PemK | NMB_0913 |
| hemagglutinin/hemolysin family protein | NMB_0493 |
| hemagglutinin/hemolysin family protein | NMB_1214 |
| hemolysin secretion/activation protein, ShlB family | NMB_1762 |
| hemolysin-activating acyltransferase, HlyC family protein | NMB_1763 |
| hemolysin-activating lysine-acyltransferase hlyC | NMB_1210 |
| HmbR | NMB_1668 |
Genes found in NM220 and NM233 but not in other sequenced genomes.
| Protein Name | NMY220 Gene |
| RelA/SpoT domain protein | NMY220_01221 |
| Exl2 | NMY220_01514 |
| hypothetical protein | NMY220_01650 |
| ATPase RavA | NMY220_01800 |
| putative uncharacterized protein | NMY220_01801 |
Figure 1Circular representation of similarity of late strain type genome to early strain type genome.
Ordered from outermost to innermost, the rims show: rim 1 (plus strand) and rim 2 (minus strand), predicted coding regions, colored by role category; rim 3, SNP density; rim 4, πN, plotted from 0 (base of rim) to 0.1; rim 5, πS, plotted 0 (base of rim) to 0.5. Role category colors are: amino acid biosynthesis, violet; biosynthesis of cofactors, prosthetic groups and carriers, light blue; cell envelope, light green; cellular processes, red; central intermediary metabolism, brown; disrupted reading frame, black; DNA metabolism, gold; energy metabolism, light gray; fatty acid and phospholipid metabolism, magenta; hypothetical proteins, black; mobile and extrachromosomal element functions, cyan; protein fate, pink; protein synthesis, pink; purines, pyrimidines, nucleosides and nucleotides, orange; regulatory functions, olive; signal transduction, olive; transcription, dark green; transport and binding proteins, blue-green; unknown function, gray; viral functions, gray. SNP density: pink, 0 SNPs per kbp; yellow, 1–25 SNPs per kbp; red, 26–50 SNPs per kbp; green, 51–75 SNPs per kbp; blue, 76–155 SNPs per kbp.
Figure 2NeighborNet tree of genetic relatedness of early strain type, late strain type and 20 available sequenced meningococcal genomes using aligned sequence of regions common to all genomes.
Tree produced using SplitsTree. Other genomes include: GB013, 053442, 961–5945, NZ05/33, CU385, M01-240355, M01-240149, M13399, M04-240196, H44/76, N1568, M6190, Z2491, G2136, M0579, FAM18, MC58, OX99-30304, alpha14, and ES14902.
Figure 3Histograms showing πN (number of non-synonymous substitutions per non-synonymous site) and πS (number of synonymous substitutions per synonymous site) in early strain type and late strain type genomes.
πN and πS calculated using codeml from the PAML package.
Genes with highest πS and πN in the early strain type and late strain type genome comparison. Maximum likelihood πS and πN calculated using the codeml program from PAML.
| Highest πS | Highest ΠN | ||||||||
| πN | πS | Early | Late | Predicted Protein | πN | πS | Early | Late | Predicted Protein |
| 0.2789 | 1.2413 | NMY220_1828 | NMY233_1807 | PorB | 0.2789 | 1.2413 | NMY220_1828 | NMY233_1807 | PorB |
| 0.0372 | 0.5443 | NMY220_1280 | NMY233_1266 | glucokinase | 0.1793 | 0.2512 | NMY220_1916 | NMY233_1893 | conserved hypothetical protein |
| 0.0786 | 0.5293 | NMY220_1431 | NMY233_1409 | putative permease, YjgP/YjgQ family | 0.1559 | 0.2387 | NMY220_1413 | NMY233_1391 | lactoferrin-binding protein |
| 0.0406 | 0.4260 | NMY220_1858 | NMY233_1837 | thiamine biosynthesis protein ThiS | 0.0980 | 0.0821 | NMY220_0860 | NMY233_0849 | prepilin-type N-terminal cleavage/methylation domain protein |
| 0.0173 | 0.3737 | NMY220_0538 | NMY233_0511 | magnesium and cobalt efflux protein CorC | 0.0964 | 0.1445 | NMY220_0962 | NMY233_0948 | conserved hypothetical protein |
| 0.0375 | 0.3675 | NMY220_1299 | NMY233_1283 | conserved hypothetical protein | 0.0959 | 0.2951 | NMY220_1241 | NMY233_1228 | conserved hypothetical protein |
| 0.0936 | 0.3545 | NMY220_0390 | NMY233_0372 | shikimate 5-dehydrogenase (AroE) | 0.0951 | 0.2212 | NMY220_1068 | NMY233_1071 | putative lipoprotein |
| 0.0299 | 0.3518 | NMY220_1433 | NMY233_1411 | ornithine carbamoyltransferase | 0.0936 | 0.3545 | NMY220_0390 | NMY233_0372 | shikimate 5-dehydrogenase (AroE) |
| 0.0266 | 0.3293 | NMY220_0388 | NMY233_0370 | lipopolysaccharide ABC transporter,ATP-binding protein | 0.0921 | 0.3260 | NMY220_0548 | NMY233_0521 | PilV |
| 0.0921 | 0.3260 | NMY220_0548 | NMY233_0521 | PilV | 0.0786 | 0.5293 | NMY220_1431 | NMY233_1409 | putative permease, YjgP/YjgQ family |
| 0.0256 | 0.3244 | NMY220_0961 | NMY233_0947 | phosphoribosyl aminoimidazolecarboxylase, ATPase subunit | 0.0754 | 0.2449 | NMY220_0822 | NMY233_0800 | conserved hypothetical protein |
| 0.0334 | 0.3023 | NMY220_1434 | NMY233_1412 | ketol-acid reductoisomerase | 0.0628 | 0.1951 | NMY220_1465 | NMY233_1442 | two component sensor kinase |
| 0.0239 | 0.3011 | NMY220_1869 | NMY233_1848 | cysteinyl-tRNA synthetase | 0.0613 | 0.2013 | NMY220_0445 | NMY233_0426 | sel1 repeat protein |
| 0.0959 | 0.2951 | NMY220_1241 | NMY233_1228 | conserved hypothetical protein | 0.061 | 0.2852 | NMY220_0858 | NMY233_0847 | prepilin-type N-terminal cleavage/methylation domain protein |
| 0.0107 | 0.2890 | NMY220_0977 | NMY233_0981 | conserved hypothetical protein | 0.0544 | 0.2142 | NMY220_0600 | NMY233_0574 | septum formation protein Maf |
| 0.0610 | 0.2852 | NMY220_0858 | NMY233_0847 | prepilin-type N-terminal cleavage/methylation domain protein | 0.0544 | 0.2142 | NMY220_0977 | NMY233_0981 | conserved hypothetical protein |
Early and late strain type population isolates, year of isolation, and results of PCR and Sanger sequencing of target genes.
| ID | Year | Strain Type | Gene Targets | ||||||||||
|
|
|
|
|
|
|
|
|
|
|
| |||
| NM 109 | 1996 | Early |
| NP | NP | E | L | V15 | E | L | L | L | E |
| NM 115 | 1996 | Early |
| NP | NP | E | L | V1 | E | L | L | L | E |
| NM 131 | 1996 | Early |
| NP | NP | E | E | V1 | E | L | L | L | E |
| NM 187 | 1998 | Early |
| NP | NP | E | E | F | E | E | E | E | E |
| NM 206 | 1999 | Early |
| NP | NP | E | L | V1 | E | L | L | L | E |
| NM 235 | 1999 | Early |
| NP | NP | E | E | F | E | L | L | L | E |
| NM 271 | 2001 | Early |
| NP | NP | E | E | V27 | E | L | L | L | R |
| NM 284 | 2001 | Early |
| NP | NP | E | E | M | E | L | L | L | E |
| NM 289 | 2001 | Early |
| NP | NP | E | E | V2 | E | E | E | E | E |
| NM51 | 1993 | Late |
| P | P | L | L | L | L | L | L | L | L |
| NM101 | 1995 | Late |
| P | P | L | L | L | L | L | L | L | L |
| NM119 | 1996 | Late |
| P | P | L | L | L | L | L | L | L | L |
| NM145 | 1997 | Late |
| P | P | L | L | L | L | L | L | L | L |
| NM165 | 1997 | Late |
| P | P | L | L | L | L | L | L | L | L |
| NM203 | 1999 | Late |
| P | P | L | L | L | L | L | L | L | L |
| NM249 | 2000 | Late |
| P | P | L | L | L | L | L | L | L | L |
| NM261 | 2000 | Late |
| P | P | L | L | L | L | L | L | L | L |
| NM264 | 2000 | Late |
| P | P | L | L | L | L | L | L | L | L |
Based upon OMP and PFGE profile [2];
NP, not present, by PCR;
E, early strain type allele;
L, late strain type allele;
V1, tbpB variant allele 1;
F, failed PCR;
V2, tbpB variant allele 2;
R, repeat difference;
M, mosaic allele;
P, present, by PCR.