| Literature DB >> 23176117 |
Peter M Power1, Stephen D Bentley, Julian Parkhill, E Richard Moxon, Derek W Hood.
Abstract
BACKGROUND: Haemophilus influenzae is an important human commensal pathogen associated with significant levels of disease. High-throughput DNA sequencing was used to investigate differences in genome content within this species.Entities:
Mesh:
Year: 2012 PMID: 23176117 PMCID: PMC3539920 DOI: 10.1186/1471-2180-12-273
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
strains selected for study
| RM7190 | a | Malaysia | 1973 | 1.5 | meningitis |
| RM6062 | a | England | 1965 | 1.5 | nasopharynx |
| RM6064 | a | England | 1966 | 1.5 | pleural fluid |
| RM6073 | a | England | 1966 | 1.6 | bronchitis |
| RM7017 | b | Ghana | 1983 | 1.6 | CSF |
| RM7060 | b | New York, USA | 1971 | 1.5 | nasopharynx |
| RM7414 | b | Kenya | 1980’s | 1.5 | |
| RM7419 | b | Kenya | 1980’s | 1.5 | |
| RM7651 | b | Norway | 1976 | 1.7 | |
| DC11238 | b | UK | 2003 | 1.8 | meningitis |
| DC800 | b | UK | 1989 | 1.9 | meningitis |
| DC8708 | b | UK | 2000 | 1.8 | |
| DCG1574 | b | Gambia | 1993 | 1.8 | nasopharynx |
| Eagan | b | | | 1.5 | |
| RM7578 | b | Switzerland | 1983 | 1.8 | |
| RM7582 | b | RSA | 1980’s | 1.8 | |
| RM7598 | b | USA | 1985 | 1.8 | |
| RM7018 | b* | Ghana | 1983 | 1.4 | CSF |
| RM7122 | b* | Australia | <1984 | 1.5 | meningitis |
| RM7459 | b* | Iceland | 1984 | 1.4 | CSF |
| RM7465 | b* | Iceland | 1985 | 1.6 | CSF |
| RM7617 | b* | Malaysia | 1970’s | 1.5 | CSF |
| RM6132 | c | England | 1964 | 1.6 | chronic sinusitis |
| RM6134 | c | England | 1975 | 1.4 | abscess |
| RM7422 | c | Kenya | 1986 | 1.4 | |
| RM6158 | e | England | 1962 | 1.7 | cystic fibrosis |
| RM6237 | f | England | 1963 | 1.4 | nasal discharge |
| RM7283 | f | Malaysia | 1972 | 1.5 | trachea |
| RM7290 | f | Malaysia | 1974 | 1.5 | trachea(malnutrition) |
| PLMIOG2822H-L | | | 1.6 | | |
| PLh.hlnctc10659T | | | 1.6 | | |
| PLHparaphorH-L | | | 1.7 | | |
| PLMIOG2838H-L | | | 1.4 | | |
| DCMO-099-5-LST-8 | UK | 1997 | 1.7 | nasopharynx (commensal) | |
| DCMO-099-8-MST-8 | UK | 1997 | 1.6 | nasopharynx (commensal) | |
| DCO-CFE24-1-T2ST-27 | UK | 2001 | 1.8 | nasopharynx (commensal) | |
| DCO-OM30-1-A1 | UK | 2001 | 1.6 | nasopharynx (commensal) | |
| DCT2T1ST-34 | Gambia | 2001 | 1.9 | nasopharynx (commensal) | |
| DCT5A1ST-41 | Gambia | 2001 | 1.9 | nasopharynx (commensal) | |
| DCT7B2ST-47 | Gambia | 2001 | 1.8 | nasopharynx (commensal) | |
| DCT8A1ST-52 | Gambia | 2001 | 1.9 | nasopharynx (commensal) | |
| RY15 | | | 1.7 | nasopharynx (commensal) | |
| RY20 | | | 1.7 | nasopharynx (commensal) | |
| RY22 | | | 1.9 | nasopharynx (commensal) | |
| RY8 | | | 1.7 | nasopharynx (commensal) | |
| DCT2B3ST-33 | hybrid | Gambia | 2001 | 1.4 | nasopharynx (commensal) |
| DCG-T53T1 | hybrid | Gambia | 2001 | 1.5 | nasopharynx (commensal) |
| DCT8B3ST-51 | hybrid | Gambia | 2001 | 1.5 | nasopharynx (commensal) |
| DH1500spain | NTHi | Spain | 2000 | 1.4 | COPD |
| DH1559spain | NTHi | Spain | 2000 | 1.5 | COPD |
| DH1630spain | NTHi | Spain | 2000 | 1.3 | COPD |
| DH398spain | NTHi | Spain | 2000 | 1.5 | COPD |
| Fi176 | NTHi | Finland | 1995 | 1.5 | otitis media |
| Fi723 | NTHi | Finland | 1995 | 1.6 | otitis media |
| Fi981 | NTHi | Finland | 1995 | 1.7 | otitis media |
| RM6011 | NTHi | UK | 1984 | 1.3 | meningitis |
| RM6019 | NTHi | UK | 1984 | 1.3 | meningitis |
| RM6033 | NTHi | UK | 1984 | 1.5 | pus hydrosalpinx |
| RM6051 | NTHi | UK | 1985 | 1.5 | CSF |
| RM7028 | NTHi | PNG | 1980’s | 1.5 | blood |
| RM7308 | NTHi | South Korea | 1984 | 1.5 | nasopharynx |
| RM7309 | NTHi | South Korea | 1984 | 1.5 | nasopharynx |
| RM7347 | NTHi | USA | 1985 | 1.4 | sputum |
| RM7448 | NTHi | Iceland | 1978 | 1.4 | blood |
| RM7477 | NTHi | Iceland | 1986 | 1.6 | |
| RM7490 | NTHi | RSA | 1980’s | 1.6 | CSF |
| DH1513spain | NTHi | Spain | 2000 | 1.5 | COPD |
| Fi1180 | NTHi | Finland | 1995 | 1.6 | otitis media |
| Fi162 | NTHi | Finland | 1995 | 1.7 | otitis media |
| Fi667 | NTHi | Finland | 1995 | 1.7 | otitis media |
| RM7029 | NTHi | PNG | 1980’s | 1.6 | blood |
| RM7637 | NTHi | China | 1971 | 1.4 | sputum |
| DC7331 | NTHi | UK | 1997 | 1.8 | meningitis |
| DC7654 | NTHi | UK | 1997 | 1.8 | blood |
| DC7695 | NTHi | UK | 1997 | 1.9 | CSF |
| DCg2120 | NTHi | Gambia | | 1.8 | nasopharynx |
| DCH3151 | NTHi | Gambia | 1993 | 1.8 | pneumonia |
| DCO-OM33-2B3ST-21 | NTHi | UK | 2001 | 1.5 | nasopharynx |
| PLMIOG2819 | | | | 1.5 | |
| PLMIOG2820 | | | | 1.5 | |
| RM6006 | | | | 1.4 | |
| PLMIOG2836 | | | | 1.7 | |
| DCMO-009-14-S-TR-ST-12 | | UK | 1998 | 1.6 | nasopharynx |
| PL10839T | | | | 1.6 | |
| PLMIOG2837 | | | | 1.6 | |
| RM7054 | NTHi | USA | 1984 | | blood (sepsis) |
| Fi1247 | NTHi | Finland | 1995 | | otitis media |
| Fi1124 | NTHi | Finland | 1995 | | otitis media |
| Fi486 | NTHi | Finland | 1995 | | otitis media |
| Fi432 | NTHi | Finland | 1995 | | otitis media |
| RM7068 | NTHi | PNG | | | pneumonia |
| Fi285 | NTHi | Finland | 1995 | | otitis media |
| PP H.parasuis | NTHi | | | | |
| RM7876 | | | | | |
| Fi1200 | NTHi | Finland | 1995 | | otitis media |
| RM7066 |
The 96 strains selected for genome sequencing are listed along with respective information on serotype, isolation, and associated disease. The total length of the genome sequence following assembly is listed (to the nearest 0.1 Mbp) for each strain. The 11 strains below the horizontal line are those for which the quality of the assembled genome sequence was insufficient for the sequence data to be included in subsequent analyses.
* Strains were originally designated as NT.
Figure 1Whole genome heat map, constructed by Mauve, to achieve pairwise percentage of genome sequence alignment. Pair-wise Mauve alignments were conducted with 60 H. influenzae and H. haemolyticus genome sequences from strains included on a single sequencing flow cell. For each pair-wise comparison the length of the alignment achieved, expressed as the percentage of the total sequence length, was calculated and a distance matrix created. The heat map was created using the R statistical package and shows the clustered genomes determined by the default R heatmap function clustering methods ( http://www.r-project.org/). At the top of the figure, an indication of the relatedness between genomes is given. Mauve achieved pairwise genome sequence alignments of between 69.8 and 94.4% across our range of genomes. Strains are listed in the same order on the x and y axes; groupings discussed in the text are indicated along the top axis and the relevant strains are indicated by brackets on the right hand side axis, labelled with a Greek letter.
Figure 2SNPs of type b strain sequences when compared with Hib strain 10810. The complete genome sequence of the Hib strain 10810 was used as a reference against which the sequence reads of each strain were mapped using MAQ. Each vertical black line represents the location of a SNP. The equivalent groupings to those identified in Figure 1 are labelled on the right hand side. Regions marked at the bottom of the figure represent genome segments which are present in the reference strain 10810 but that may not be found in all other strains. The brackets on the left hand side of the figure indicate the number of MLST alleles shared between the pairs of genomes indicated; the sequence type (ST) of each strain is indicated to the right of its name.
Figure 3Size of SNP blocks found in the β1 group of Hib strains. This histogram represents the frequency of different sizes of SNP blocks (as defined in the text) in the genomes of β1 H. influenzae type b strains.
Regions in the strain RM7060 genome not found in strain 10810
| ZP_01791522 | NikR predicted transcriptional regulator | |
| AAL79955 | Hia/YadA-like similar to neisserial GNA992 | |
| AAM74927 | Hap peptidase S6 | |
| ZP_05977792 | putative carboxylate/amino acid/amine transporter | |
| P46495 | Putative integrase/recombinase HI_1572 | |
| ZP_00134779 | Phage-related protein, tail component | |
| YP_001968298 | Phage-related protein, tail component | |
| ZP_01791539 | Mu-like prophage protein | |
| YP_003007008 | Phage-related minor tail protein | |
| ZP_01791533 | putative phage tail component | |
| YP_001290203.1 | tRNA-dihydrouridine synthase C | |
| YP_001053216.1 | predicted bacteriophage tail assembly protein | |
| ZP_05990265 | hypothetical protein COK_2151 | |
| ZP_04753126 | possible prophage antirepressor | |
| ZP_04464399 | Phage Mu protein F like protein | |
| YP_003007004 | phage protein |
Figure 4Frequency of Eaganstrand Eagan SNPs in the Rd+Eaganstrand Rd+Eagan transformants. Panel A; Location and frequency of EaganstrR specific SNPs plotted as estimated number of strains (y-axis) against location in RdKW20 genome sequence (x-axis) using SNPSeeker. MAQ was used to identify SNPs in the pooled sequences from 200 transformants. The location of the strR point mutation is indicated. Panel B; A magnified view of one region marked on Panel A showing a putative secondary transformation event. The extent of the chromosomal region involved with this predicted transformation event (13 kbp) is marked. Panel C; A magnified view of the primary transformation event from Panel A with the location of the strR point mutation marked. Panel D; The location and frequency of Eagan-specific SNPs in the genome of pooled Rd+Eagan transformants (200); Eagan unmarked (wild-type) genomic DNA was used as the donor.