| Literature DB >> 31379777 |
Ammar Aziz1, Derek S Sarovich1,2, Elizabeth Nosworthy1, Jemima Beissbarth1, Anne B Chang1,3, Heidi Smith-Vaughan1, Erin P Price1,2, Tegan M Harris1.
Abstract
Non-typeable Haemophilus influenzae (NTHi), an opportunistic pathogen of the upper airways of healthy children, can infect the lower airways, driving chronic lung disease. However, the molecular basis underpinning NTHi transition from a commensal to a pathogen is not clearly understood. Here, we performed comparative genomic and transcriptomic analyses of 12 paired, isogenic NTHi strains, isolated from the nasopharynx (NP) and bronchoalveolar lavage (BAL) of 11 children with chronic lung disease, to identify convergent molecular signatures associated with lung adaptation. Comparative genomic analyses of the 12 NP-BAL pairs demonstrated that five were genetically identical, with the remaining seven differing by only 1 to 3 mutations. Within-patient transcriptomic analyses identified between 2 and 58 differentially expressed genes in 8 of the 12 NP-BAL pairs, including pairs with no observable genomic changes. Whilst no convergence was observed at the gene level, functional enrichment analysis revealed significant under-representation of differentially expressed genes belonging to Coenzyme metabolism, Function unknown, Translation, ribosomal structure, and biogenesis Cluster of Orthologous Groups categories. In contrast, Carbohydrate transport and metabolism, Cell motility and secretion, Intracellular trafficking and secretion, and Energy production categories were over-represented. This observed trend amongst genetically unrelated NTHi strains provides evidence of convergent transcriptional adaptation of NTHi to pediatric airways that deserves further exploration. Understanding the pathoadaptative mechanisms that NTHi employs to infect and persist in the lower pediatric airways is essential for devising targeted diagnostics and treatments aimed at minimizing disease severity, and ultimately, preventing NTHi lung infections and subsequent chronic lung disease in children.Entities:
Keywords: NTHi; RNA-seq; adaptation; bacterial evolution; comparative genomics; convergence; non-typeable Haemophilus influenzae; transcriptomics
Year: 2019 PMID: 31379777 PMCID: PMC6646836 DOI: 10.3389/fmicb.2019.01622
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Summary of genomic and transcriptomic differences between paired, isogenic NTHi isolates obtained from the nasopharynx (NP) and lower airways (bronchoalveolar lavage; BAL) of 11 Australian children with chronic lung disease.
| 60051 | NP | 1 | 0 | 1 | 58 (4/54) | |
| BAL | 1 | |||||
| 60068 | NP | 3 | 0 | 1 | 54 (1/53) | – |
| BAL | 1 | |||||
| 60069 | NP | 2 | 0 | 1 | 0 | – |
| BAL | 1 | |||||
| 60294 | NP | 1 | 0 | 0 | 0 | – |
| BAL | 1 | |||||
| 60295 | NP | 1 | 1 | 0 | 46 (38/8) | – |
| BAL | 1 | |||||
| 60316 | NP | 1 | 0 | 0 | 0 | – |
| BAL | 1 | |||||
| 60362 | NP | 1 | 0 | 0 | 15 (14/1) | – |
| BAL | 2 | |||||
| 60370 | NP | 2 | 0 | 0 | 2 (2/0) | – |
| BAL | 2 | |||||
| 60373_P1 | NP | 1 | 0 | 1* | 5 (3/2) | |
| BAL | 1 | |||||
| 60373_P2 | NP | 3 | 0 | 0 | 4 (4/0) | – |
| BAL | 4 | |||||
| 65001 | NP | 2 | 0 | 3 | 0 | – |
| BAL | 1 | |||||
| 65290 | NP | 3 | 1 | 0 | 4 (4/0) | – |
| BAL | 4 |
Summary of genetic variants identified between 12 paired, isogenic non-typeable Haemophilus influenzae isolates retrieved from the nasopharynx vs. lower airways (bronchoalveolar lavage; BAL).
| 60051 | AAGACG | A | Glu9fs (556) | NP | Loss-of-function frameshift; premature stop yields 13aa product | Type I restriction enzyme HindVIIP M protein | DNA binding; site-specific DNA-methyltransferase (adenine-specific) activity | Defense mechanisms | ||
| 60068 | GCCAACCAA | G | Thr35fs (992) | NP | Loss-of-function frameshift; premature stop yields 12aa product | Hemoglobin-haptoglobin binding protein B | Receptor and transporter activity | Inorganic ion transport and metabolism | ||
| 60069 | Intergenic | – | GATTATT | G | – | NP | Intergenic (62 bp upstream of NTHI0369 [ | – | – | – |
| 60295 | C | T | Arg407Lys (992) | NP | Non-synonymous | Hemoglobin-haptoglobin binding protein B | Receptor and transporter activity | Inorganic ion transport and metabolism | ||
| 60373_P1† | CGAGA (60%); CGAGACGAGA (40%) | A | Glu12fs (556) | NP | Loss-of-function frameshift; premature stop yields 16aa or 24aa product | Type I restriction enzyme HindVIIP M protein | DNA binding; site-specific DNA-methyltransferase (adenine-specific) activity | Defense mechanisms | ||
| 65001 | TTTGA | T | Ser18fs (320) | NP | Loss-of-function frameshift; premature stop yields 36aa product | CMP-Neu5Ac–lipooligosaccharide alpha 2–3 sialyltransferase | Transferase activity; transfer of glycosyl groups | Cell envelope biogenesis | ||
| C | CAGCAGTAGG | Ala269_Gly27 0insValGlyAla (365) | NP | In frame insertion of ValGlyAla; 368aa product | Outer membrane protein P2 | Porin activity | Cell envelope biogenesis | |||
| ACTTACCAGCGAG | A | Thr173_Leu1 76del (390) | NP | In-frame deletion of 4aa | Type I restriction-modification system specificity protein | DNA binding and modification | Defense mechanisms | |||
| 65290 | T | C | Glu508Gly (800) | NP | Non-synonymous | Protective surface antigen D15 | Cell outer membrane assembly; protein insertion into membrane | Cell envelope biogenesis | ||
FIGURE 1Read mapping analysis of the highly mutable hsdM3 pentanucleotide simple sequence repeat (SSR) tract in 60051 and 60373_P1 NP-BAL pairs. (A) Graphical representation of 60051 BAL Hi1 Illumina reads aligned against the hdsM3 region of the 60051 NP Hi1 reference assembly. One allele consisting of a 5 bp deletion (AGACG) was observed. (B) Graphical representation of 60373_P1 BAL Hi1 Illumina reads aligned against the 60373_P1 NP Hi1 reference assembly. Two different alleles consisting of 5 bp (AGACG) and 10 bp (AGACGx2) deletions were observed in the BAL isolate. Blue and red bars represent forward and reverse reads (respectively). (C) Amino acid alignment of HsdM3 against reference strain 86-028NP (wild-type). NP isolates from 60051 and 60373_P1 encode the full-length protein, with minor differences resulting from the variable SSR tract. Indels in the same SSR tract of both BAL isolates, including the two alleles observed in 60373_P1, result in a loss-of-function frameshift that is predicted to yield a truncated protein of 13-24aa.
FIGURE 2Heat map of differentially expressed (DE) genes identified between the NP-BAL pairs from 11 pediatric chronic lung disease patients. In total 134 non-redundant DE genes were detected across eight NP-BAL pairs. DE was observed in eight NP-BAL pairs; the remaining four pairs contained no DE. Genes were predominantly upregulated in the 60295 NP-BAL pair but predominately downregulated in the 60051 and 60068 NP-BAL pairs. The direction of log2 fold change of DE genes in BAL is color-coded as follows: upregulation (blue), downregulation (green), no DE (tan).
FIGURE 3Clusters of Orthologous Group (COG) analysis comparing the frequency of categories in the 86-028NP genome with all DE genes in the 12 NP-BAL pairs. Seven COG categories were significantly enriched in DE genes from BAL-derived isolates, comprising four over-represented categories (Carbohydrate transport and metabolism, Cell motility and secretion, Energy production and conversion and Intracellular trafficking, and secretion) that were majority downregulated and three under-represented categories (Coenzyme metabolism, Function unknown, and Translation, ribosomal structure and biogenesis). ∗∗∗p < 0.001, ∗∗p < 0.01, *p < 0.05.