| Literature DB >> 21541312 |
Biju Joseph1, Roland F Schwarz, Burkhard Linke, Jochen Blom, Anke Becker, Heike Claus, Alexander Goesmann, Matthias Frosch, Tobias Müller, Ulrich Vogel, Christoph Schoen.
Abstract
BACKGROUND: Neisseria meningitidis is a naturally transformable, facultative pathogen colonizing the human nasopharynx. Here, we analyze on a genome-wide level the impact of recombination on gene-complement diversity and virulence evolution in N. meningitidis. We combined comparative genome hybridization using microarrays (mCGH) and multilocus sequence typing (MLST) of 29 meningococcal isolates with computational comparison of a subset of seven meningococcal genome sequences. PRINCIPALEntities:
Mesh:
Year: 2011 PMID: 21541312 PMCID: PMC3082526 DOI: 10.1371/journal.pone.0018441
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Strains used in the study.
| Strain | Serogroup | ST | CC | Lineage | Source | Country | Year | Reference | Genome |
| α14 |
| 53 | 53 | carriage | carrier | Germany | 1999 |
| AM889136* |
| α704 |
| 198 | 198 | carriage | carrier | Germany | 2000 |
| NA |
| WUE2594 | A | 5 | 5 | invasive | IMD | Germany | 1991 |
| NA |
| Z2491 | A | 4 | 4 | invasive | IMD | Gambia | 1983 |
| AL157959* |
| DE6894 | B | 32 | 32 | invasive | IMD | Germany | 2000 |
| NA |
| α4 | B | 19 | 18 | carriage | carrier | Germany | 1999 |
| NA |
| α31 | B | 797 | 269 | invasive | carrier | Germany | 1999 |
| NA |
| α78 | B | 44 | 41/44 | invasive | carrier | Germany | 1999 |
| NA |
| α490 | B | 364 | 364 | carriage | carrier | Germany | 2000 |
| NA |
| α522 | B | 35 | 35 | carriage | carrier | Germany | 2000 |
| NA |
| α547 | B | 41 | 41/44 | invasive | carrier | Germany | 2000 |
| NA |
| α710 | B | 136 | 41/44 | invasive | carrier | Germany | 2000 |
| CP001561 |
| DE7127 | B | 41 | 41/44 | invasive | IMD | Germany | 2000 |
| NA |
| DE7865 | B | 269 | 269 | invasive | IMD | Germany | 2001 |
| NA |
| DE7901 | B | 18 | 18 | carriage | IMD | Germany | 2001 |
| NA |
| DE8638 | B | 162 | 162 | carriage | IMD | Germany | 2002 |
| NA |
| DE9155 | B | 44 | 41/44 | invasive | IMD | Germany | 2002 |
| NA |
| MC58 | B | 74 | 32 | invasive | IMD | UK | 1983 |
| AE002098* |
| DE6904 | C | 8 | 8 | invasive | IMD | Germany | 2002 |
| NA |
| DE7017 | C | 11 | 11 | invasive | IMD | Germany | 2000 |
| NA |
| FAM18 | C | 11 | 11 | invasive | IMD | USA | 1980 |
| AM421808* |
| WUE2121 | C | 11 | 11 | invasive | IMD | Germany | 1997 |
| NA |
| α95 | 29E | 106 | 106 | carriage | carrier | Germany | 1999 |
| NA |
| α153 | 29E | 60 | 60 | carriage | carrier | Germany | 1999 |
| AM889137 |
| α707 | 29E | 254 | 254 | carriage | carrier | Germany | 2000 |
| NA |
| α3 | W-135 | 174 | 174 | carriage | carrier | Germany | 1999 |
| NA |
| α275 | W-135 | 22 | 22 | carriage | carrier | Germany | 2000 |
| AM889138 |
| α267 | Y | 23 | 23 | carriage | carrier | Germany | 2000 |
| NA |
| α533 | Y | 92 | 92 | carriage | carrier | Germany | 2000 |
| NA |
Assignments of clonal complexes (CCs) into invasive and carriage lineages is based on refs. [15].
Asterisks behind the GenBank accession numbers indicate that the respective genome was represented on the microarray used for mCGH analyses.
Invasive meningococcal disease.
Figure 1Characterization of the meningococcal gene complement.
A) Boxplots comparing the GC content and CAI between core, distributed and strain-specific genes. Core genes have a significantly higher GC content than the distributed genes and also a significantly higher CAI than genes from the accessory genome comprising distributed and strain-specific genes (p<0.001, Wilcoxon test). B) Distribution of genes from the different genomic compartments also including pairwise differences between strains from the same CC among the major COG functional classes. There are significant differences in the functional composition between all four genomic compartments (p<0.01, χ2 test). Compared to the other genomic compartments the core genome is enriched for metabolic genes as well as genes involved in cellular processes, information storage and processing, respectively (p<0.01, Fisher's exact test). C) Histogram of the predicted subcellular localization of genes from the different genomic compartments. Again, there are significant differences between the different genomic compartments (p<0.01, χ2 test). In line with their housekeeping functions, core genes are localized primarily within the cytoplasm, the inner membrane and the periplasmic space whereas the distributed genes and genes that differ between strains from the same CC are in turn enriched for genes coding for outer membrane proteins (p<0.05, Fisher's exact test).
Figure 2Clustering of strains based on the accessory genome.
A maximum parsimony (MP) tree is shown with bootstrap values at nodes used for grouping of strains into eight GGs from their gene content comprising 470 parsimony informative genes as revealed by mCGH. Strains with an asterisk next to their name have further been used for the in silico screening for intragenomic recombination in 1092 genes from the core genome as estimated via mCGH of the entire sample population. Next to the MP tree, the CCs and serogroups (Sg) of the respective strains are given with hypervirulent CCs in black boxes, and right to the Sgs a virtual array image displaying the presence and absence of 1679 genes is shown. Strains from the same serogroup have in general highly similar gene content, and strains from the same CC always belong to the same GG. In turn, a GG can comprise strains from different CCs, and with the exception of the two serogroup W-135 strains split between GG-II and GG-III and the serogroup B strain DE8638, GGs always comprise strains from the same serogroup. However, no two strains have exactly the same gene content (see also Figure 1 for the functional profile of genes differently present among otherwise identical pairs of strains). Right below the virtual array, the spotted genes are color coded according to the source genome (representing the genomes of strain Z2491, MC58, FAM18 and α14), and the presence of putatively mobile DNA is depicted below with IHT-B, IHT-C, IHT-E as well as the λ prophage denoted as B, C, E and λ, respectively, in the respective lanes (Abbreviations: IHT, island of horizontal transfer; Φ: prophage; MME, minimal mobile element). At the lower margin, the FDR for the association with hyperinvasive lineages is color coded for each gene with genes having a FDR<0.05 depicted in blue.
Figure 3Comparison of strain clustering based on mCGH and MLST.
A) Neighbor-net representation of gene distances based on the distribution of genes from the accessory genome. Strain names as well as the defining splits having a bootstrap support of greater than 85% are printed in the same colors as in Figure 2. Clustering based on gene content distance and maximum parsimony results in the same eight genome groups. Serogroup B and C strains as well as the two cnl strains are separated from the other four serogroups based on gene content, and the cycles within the serogroup B/C subgraph indicate a high rate of gene transfer among these strains. Strains from the same CC always reside on common splits. B) Neighbor-net representation based on the concatenated alignments of the seven housekeeping gene fragments form the meningococcal core genome used for MLST comprising 383 parsimony informative sites. A MLSA-based clustering of strains results in the separation of strains from the same serogroup or belonging to the same GGs. The large number of cycles in the graph indicates a high rate of homologous recombination among the genes used for MLST and that a MLSA-based reconstruction of the deep phylogeny of these strains is not possible.
Genes significantly associated with hyperinvasive lineages.
| Gene | Function | OR | 95%-CI | p |
|
| ||||
| NMB0369 | Conserved protein with hedgehog/intein (Hint) domain | n. d. | 3.82 - ∞ | 0.008 |
| NMB0502 | TpsA C-terminal cassette TpsS3 | n. d. | 3.82 - ∞ | 0.009 |
| NMB0506 | TpsA C-terminal cassette TpsS4 | 14.33 | 1.96 - 189.05 | 0.041 |
| NMB0508 | Conserved hypothetical protein | 14.33 | 1.96 - 189.05 | 0.042 |
|
| ||||
| NMB0899 | Phage associated conserved protein | 20.32 | 2.62 - 287.23 | 0.014 |
| NMB0900 | Putative KilA-N domain-containing protein | 31.6 | 3.64 - 523.63 | 0.009 |
| NMB0901 | Conserved hypothetical protein | 20.32 | 2.62 - 287.23 | 0.014 |
| NMB0903 | Conserved hypothetical cytoplasmic protein | 20.32 | 2.62 - 287.23 | 0.015 |
| NMB0904 | Hypothetical periplasmic protein | 20.32 | 2.62 - 287.23 | 0.016 |
| NMB0906 | Phage associated conserved protein | 20.32 | 2.62 - 287.23 | 0.017 |
| NMB0907 | Conserved hypothetical protein | 20.32 | 2.62 - 287.23 | 0.018 |
| NMB0910 | Putative phage HTH-type transcriptional regulator | 20.32 | 2.62 - 287.23 | 0.019 |
| NMB0916 | Putative membrane protein (fragment) | 20.32 | 2.62 - 287.23 | 0.020 |
|
| ||||
| NMB1543 | Putative phage replication initiation factor | 20.32 | 2.62 - 287.23 | 0.021 |
| NMB1544 | Conserved hypothetical protein | 41.48 | 3.87 - 2316.04 | 0.009 |
| NMB1545 | Hypothetical cytoplasmic protein | 41.48 | 3.87 - 2316.04 | 0.011 |
| NMB1546 | Hypothetical integral membrane protein | 41.48 | 3.87 - 2316.04 | 0.013 |
| NMB1547 | Hypothetical integral membrane protein | 41.48 | 3.87 - 2316.04 | 0.018 |
| NMB1550 | Hypothetical integral membrane protein | 29.64 | 2.88 - 1603.74 | 0.018 |
| NMB1630 | Hypothetical integral membrane protein | 41.48 | 3.87 - 2316.04 | 0.026 |
|
| ||||
| NMB1749 | Putative zonular occludens toxin-like protein | n. d. | 3.10 - ∞ | 0.013 |
| NMB1750 | Putative pilin gene-inverting protein (PIVML) | n. d. | 4.00 - ∞ | 0.010 |
|
| ||||
| NMO0346 | CRISPR-associated protein Cas2 | 0.07 | 0.01 - 0.50 | 0.044 |
| NMO0347 | CRISPR-associated protein Cas1 | 0.05 | 0.00 - 0.38 | 0.023 |
The functional annotation was taken from the NeMeSys database [69].
Odds ratio of a Fisher's exact test.
95%-confidence interval of the respective odds ratio.
P-values were computed upon the simultaneous comparison of 1679 genes using Fisher's exact test with the Benjamini-Hochberg multiple testing correction.
n. d., not defined.
Selected examples of recombinant housekeeping genes from the core genome.
| Gene | Function | Pathway/Category |
|
| ||
|
| Adenosylmethionine-8-amino-7-oxononanoate aminotransferase | Biotin |
|
| Aethiobiotin | Biotin |
|
| Bifunctional tetrahydrofolate and dihydrofolate synthase | Folic acid |
|
| Glutamyl-tRNA reductase | Heme, porphyrin |
|
| Delta-aminolevulinic acid dehydratase | Heme, porphyrin |
|
| Enterobactin uptake system binding lipoprotein FetB | Iron acquisition |
|
| Putative ferric enterobactin uptake system ATP-binding protein FetE | Iron acquisition |
|
| Quinolinate synthetase B protein | Nicotinamide adenine dinucleotide |
|
| Pyridoxal phosphate biosynthetic protein PdxJ | Pyridoxine |
|
| Riboflavin biosynthesis protein RibD | Riboflavin, FAD/FMN |
|
| Riboflavin biosynthesis protein | Riboflavin, FAD/FMN |
|
| S-adenosylmethionine synthetase | S-Adenosyl methionine biosynthesis |
|
| Sulfate/thiosulfate import ATP-binding protein CysA | Sulphur acquisition |
|
| Sulfate transport system permease protein CysT | Sulphur acquisition |
|
| Sulfate transport system permease protein CysW | Sulphur acquisition |
|
| ||
|
| Enolase (2-phosphoglycerate dehydratase) | Glycolysis |
|
| Fructose-bisphosphate aldolase | Glycolysis |
|
| Fructose-1,6-bisphosphatase | Glycolysis |
|
| Glyceraldehyde 3-phosphate dehydrogenase A | Glycolysis |
|
| Glucose-6-phosphate isomerase 1/2 | Glycolysis |
|
| Phosphoglycerate kinase | Glycolysis |
|
| Phosphoglucomutase | Glycolysis |
|
| Pyruvate kinase II | Glycolysis |
|
| NADH-quinone oxidoreductase chain C | Oxidative phosphorylation |
|
| NADH-quinone oxidoreductase chain D | Oxidative phosphorylation |
|
| NADH-quinone oxidoreductase chain E | Oxidative phosphorylation |
|
| NADH-quinone oxidoreductase chain F | Oxidative phosphorylation |
|
| NADH-quinone oxidoreductase chain H | Oxidative phosphorylation |
|
| NADH-quinone oxidoreductase chain J | Oxidative phosphorylation |
|
| NADH-quinone oxidoreductase chain N | Oxidative phosphorylation |
|
| Transketolase (TK) | Pentose phosphate pathway |
|
| Fumarate hydratase class I | TCA cycle |
|
| Fumarate hydratase class II | TCA cycle |
|
| Dihydrolipoyl dehydrogenase | TCA cycle |
|
| ||
|
| A/G-specific adenine glycosylase | Base excision repair |
|
| DNA-3-methyladenine glycosylase I | Base excision repair |
|
| Uracil-DNA glycosylase | Base excision repair |
|
| Replicative DNA helicase | DNA replication |
|
| Ribonuclease HII (RNase HII) | DNA replication |
|
| DNA ligase | DNA replication and repair |
|
| DNA polymerase III epsilon subunit | DNA replication, mismatch repair |
|
| DNA polymerase III tau/gamma subunits | DNA replication, mismatch repair |
|
| RecA protein (recombinase A) | Homologous recombination |
|
| Exodeoxyribonuclease V alpha chain | Homologous recombination |
|
| ATP-dependent DNA helicase RecG | Homologous recombination |
|
| Holliday junction DNA helicase RuvA | Homologous recombination |
|
| Holliday junction DNA helicase RuvB | Homologous recombination |
|
| Crossover junction endodeoxyribonuclease RuvC | Homologous recombination |
|
| DNA helicase II | Mismatch and base excision repair |
|
| Exodeoxyribonuclease VII small subunit | Mismatch repair |
Gene abbreviations are based on the respective KEGG entries.
The functional annotation was taken from the NeMeSys database [69].