| Literature DB >> 31860436 |
Tegan M Harris1, Erin P Price2,1, Derek S Sarovich2,1, Niels Nørskov-Lauritsen3, Jemima Beissbarth1, Anne B Chang4,1, Heidi C Smith-Vaughan5,1.
Abstract
The heterogeneous and highly recombinogenic genus Haemophilus comprises several species, some of which are pathogenic to humans. All share an absolute requirement for blood-derived factors during growth. Certain species, such as the pathogen Haemophilus influenzae and the commensal Haemophilus haemolyticus, are thought to require both haemin (X-factor) and nicotinamide adenine dinucleotide (NAD, V-factor), whereas others, such as the informally classified 'Haemophilus intermedius subsp. intermedius', and Haemophilus parainfluenzae, only require V-factor. These differing growth requirements are commonly used for species differentiation, although a number of studies are now revealing issues with this approach. Here, we perform large-scale phylogenomics of 240 Haemophilus spp. genomes, including five 'H. intermedius' genomes generated in the current study, to reveal that strains of the 'H. intermedius' group are in fact haemin-independent H. haemolyticus (hiHh). Closer examination of these hiHh strains revealed that they encode an intact haemin biosynthesis pathway, unlike haemin-dependent H. haemolyticus and H. influenzae, which lack most haemin biosynthesis genes. Our results suggest that the common ancestor of modern-day H. haemolyticus and H. influenzae lost key haemin biosynthesis loci, likely as a consequence of specialized adaptation to otorhinolaryngeal and respiratory niches during their divergence from H. parainfluenzae. Genetic similarity analysis demonstrated that the haemin biosynthesis loci acquired in the hiHh lineage were likely laterally transferred from a H. parainfluenzae ancestor, and that this event probably occurred only once in hiHh. This study further challenges the validity of phenotypic methods for differentiating among Haemophilus species, and highlights the need for whole-genome sequencing for accurate characterization of species within this taxonomically challenging genus.Entities:
Keywords: 'Haemophilus intermedius'; Haemophilus haemolyticus; comparative genomics; haemin biosynthesis; haemin-independent Haemophilus haemolyticus
Mesh:
Substances:
Year: 2020 PMID: 31860436 PMCID: PMC7067038 DOI: 10.1099/mgen.0.000303
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
hiHh isolates used in this study
|
Isolate |
Anatomical site |
Country of origin |
Year of isolation |
Haemin (X-factor) dependence* |
NAD (V-factor) dependence* |
Haemin biosynthesis pathway† |
Genome reference |
|---|---|---|---|---|---|---|---|
|
60819_B_Hi1 |
BAL |
Australia |
2010 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
60824_B_Hi4 |
BAL |
Australia |
2010 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
60971_B_Hi3 |
BAL |
Australia |
2012 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
60982_B_Hi1 |
BAL |
Australia |
2012 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
65117_B_Hi3 |
BAL |
Australia |
2011 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
65151_B_Hi4 |
BAL |
Australia |
2011 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
839_HINF |
BAL |
USA |
2013 |
Unknown |
Unknown |
C5, PP-dependent, O2-independent |
[ |
|
CCUG 11096 |
Pleural fluid |
Sweden |
1981 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
CCUG 15949 |
Eye |
Sweden |
1984 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
CCUG 30218 |
Cerebrospinal fluid |
Sweden |
1992 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
CCUG 31732 |
Ascitic fluid |
Sweden |
1993 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
|
CCUG 66565 |
Sputum |
Sweden |
2014 |
Unknown |
Unknown |
C5, PP-dependent, O2-independent |
This study |
|
F0629 |
Oral cavity |
USA |
2015 |
Unknown |
Unknown |
C5, PP-dependent, O2-independent |
This study |
|
PN24 |
Urine |
Denmark |
2004 |
− |
+ |
C5, PP-dependent, O2-independent |
[ |
BAL, Bronchoalveolar lavage.
*, Recorded phenotype.
†, Aminolevulinic acid biosynthesis occurs using the C5 pathway [63]; coproporphyrinogen III conversion to protohaem is protoporphyrin-dependent, and occurs in an oxygen-independent manner [63].
Fig. 1.Phylogenomic analysis of 240 spp. to identify placement of hiHh and ‘H. intermedius’. A midpoint-rooted maximum parsimony tree was constructed using 30 345 orthologous, biallelic SNPs found among 152 . (purple), including 16 clade I (green) [36] and three fucP-negative clade (blue) [17, 18] genomes, 64 . (black) including 14 hiHh and ‘H. intermedius’ genomes (pink), and 24 . genomes (red). Consistency index=0.1482. Bootstrap values were inferred from 1000 replicates. Clades with >80 % support are annotated with a filled orange circle. Type strains are denoted with an asterisk. Bar, 900 bp.
Fig. 2.The haemin biosynthesis pathway in hiHh. (a) The hiHh strains synthesize haemin by utilizing the C5 pathway for 5-aminolevulinic acid synthesis. Conversion of coproporphyrinogen III to protohaem occurs in a protoporphyrin-dependent, oxygen-independent manner. (b) Heatmap showing the percentage nucleotide identity of haemin biosynthesis genes when compared to reference gene sequences extracted from the assembled genome of hiHh 60819_BAL_Hi1. The heatmap is plotted against a midpoint-rooted, maximum parsimony tree, constructed using 153 468 orthologous, biallelic SNPs found amongst the 64 . genomes. hiHh are shown in pink. Consistency index=0.2380. Bootstrap values were inferred from 1000 replicates. Nodes with >80 % support are annotated with a filled green circle. Bar, 1000 bp.
Fig. 3.Maximum-likelihood phylogeny of haemin biosynthesis pathway genes in 14 hiHh (pink) and 24 . (black), constructed using 73 orthologous, biallelic SNPs, with reference to a concatenated nucleotide sequence of haemin biosynthesis genes from T3T1 (GenBank accession no. NC_015964.1). Bar, nucleotide substitutions per site.
Fig. 4.Evolution of the haemin biosynthesis pathway compared to whole-genome evolution in hiHh. Midpoint-rooted maximum parsimony trees of the 14 hiHh were constructed with reference to 60819_BAL_Hi1. Bootstrap values were inferred from 1000 replicates. The whole-genome phylogeny (left) was derived from 114 346 orthologous, biallelic, SNPs, using a merged, multi-contig 60819_BAL_Hi1 assembled genome as the reference. The haemin biosynthesis pathway phylogeny (right) was derived from 548 orthologous, biallelic, SNPs, with reference to a concatenated nucleotide sequence of haemin biosynthesis genes from 60819_BAL_Hi1. In the tanglegram plot, lines are used to connect the same taxa in both trees. The absence of entanglement does not reflect topological differences in the trees.