Literature DB >> 30533737

Draft Genome Sequences for a Diverse Set of Seven Haemophilus and Aggregatibacter Species.

Megan Nichols1, Nadav Topaz2, Xiong Wang1,3, Xin Wang2, Dave Boxrud1.   

Abstract

Haemophilus is a complex genus that includes commensal and pathogenic species that pose a public health threat to humans. While the pathogenic species have been studied extensively, many commensals have limited genomic information available. Here, we present 24 draft genomes for a diverse set of 7 Haemophilus and Aggregatibacter species.

Entities:  

Year:  2018        PMID: 30533737      PMCID: PMC6256573          DOI: 10.1128/MRA.00880-18

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

The genus Haemophilus consists of pleomorphic Gram-negative coccobacilli that all share similar growth requirements for the presence of hemin and/or NAD (1). Haemophilus species are part of the commensal flora in humans and can most commonly be found colonizing the upper respiratory tract, oral cavity, and mucosal membranes (1). The human pathogen belonging to the genus is Haemophilus influenzae, which can cause a variety of conditions, including meningitis, bacteremia, otitis media, sinusitis, and conjunctivitis (2). While the pathogenic species have been studied extensively, other members of the genus have been studied far less extensively, and many have only limited available genomic information. To supplement the existing Haemophilus genomic collection, we present genomic data for 7 Haemophilus and Aggregatibacter species, Aggregatibacter aphrophilus (formerly H. aphrophilus), Aggregatibacter segnis (formerly H. segnis), H. haemolyticus, H. parahaemolyticus, H. parainfluenzae, H. paraphrohaemolyticus, and H. sputorum. Bacteria were isolated from clinical specimens collected in Minnesota from 2000 to 2015, and single colonies were cultivated on chocolate agar for 24 to 48 h at 33 to 37°C and 4 to 6% CO2. Bacterial DNA was extracted using the QIAmp DNA blood minikit on the Qiagen QIAcube following the manufacturer’s guidelines, and DNA concentrations were quantitated using the Qubit double-stranded-DNA (dsDNA) high-sensitivity (HS) assay kit (Thermo Fisher Scientific). Samples were prepared for whole-genome sequencing following the Nextera XT DNA Library preparation protocol and the manufacturer’s (Illumina) guidelines. Bar-coded libraries were then pooled and loaded onto the Illumina MiSeq system using 500-cycle V2 chemistries for multiplexed 250-bp paired-end sequencing. The Illumina reads were then trimmed using Cutadapt 1.8 (3) with default parameters and assembled using SPAdes 3.7.0 (4) with default parameters. Genomes were annotated by NCBI using the Prokaryotic Genome Annotation Pipeline (PGAP) (5). The genus Haemophilus has undergone many revisions over the years; with the addition of new species to the genus (H. pittmaniae and H. sputorum) and reclassification of six former members, including two species covered in this study (A. aphrophilus and A. segnis), the taxonomy of the genus is an ongoing topic of discussion (6). Despite the importance of rapid and accurate species identification in clinical and research settings, correct identification of Haemophilus has been a continuous challenge due to the lack of proper detection methods. Not all species are clearly distinguishable by their biochemical and phenotypic properties alone due to the shared characteristics among members, while the use of molecular methods for identification has also been problematic due to the high rate of recombination and horizontal gene transfer that occurs between the commensals and pathogens (7). Misidentification of the commensals as the pathogenic species is not uncommon and has been reported at a rate as high as 40% in some clinical labs (8). In recent years, whole-genome sequencing (WGS) has alternatively been used to identify unique genomic targets to discriminate between species in other assays and to provide extensive genomic data that can be used for comparative genomic analysis of Haemophilus species (8). The genomic sequences for 7 Haemophilus and Aggregatibacter species in this study will provide data for future studies examining species delineation and unique genomic targets among Haemophilus species.

Data availability.

The draft genome sequences have been deposited in GenBank under the accession numbers listed in Table 1.
TABLE 1

Metadata and GenBank accession numbers for draft genome assemblies reported in this study

Isolate nameGenBank accession no.SpeciesCollection yrIsolation sourceN50 (bp)No. of contigsAvg coverage (×)
C2015005679QEQH00000000H. sputorum2015Blood201,0272643
C2015005473QEQG00000000H. sputorum2015Blood201,0272634
C2014016342QEQF00000000H. paraphrohaemolyticus2014Bronchus246,4692727
C2011020591QEQE00000000H. parainfluenzae2011Sputum254,7032637
C2010039593QEQD00000000H. parahaemolyticus2010Sputum171,4344136
C2010020251QEQC00000000A. aphrophilus2010Hip402,8641838
C2009038101QEQB00000000H. parainfluenzae2009Sputum230,9912545
C2009017515QEQA00000000A. aphrophilus2009Cheek399,9893532
C2008003258QEPZ00000000H. parainfluenzae2008Sputum189,2562143
C2008003249QEPY00000000A. aphrophilus2008Sinus451,1741825
C2008001782QEPX00000000A. aphrophilus2008Bronchial wash347,0441633
C2008001710QEPW00000000H. parainfluenzae2008Sputum112,6255161
C2008001229QEPV00000000A. aphrophilus2008Brain abscess456,6463351
C2008000870QEPU00000000A. aphrophilus2008Blood162,2533334
C2006002596QEPT00000000H. parainfluenzae2006Blood246,2202847
C2006000788QEPS00000000H. parahaemolyticus2006Bronchial wash1,108,1791572
C2005004058QEPR00000000H. parainfluenzae2005Wound517,2561642
C2004002729QEPQ00000000H. parainfluenzae2004Sputum466,1071553
C2004002727QEPP00000000H. parainfluenzae2004Blood526,7372343
C2004000280QEPO00000000H. parainfluenzae2004Toe200,7273264
C2002001239QEPN00000000H. sputorum2002Throat470,9102936
C2001002503QEPM00000000A. segnis2001Sputum341,4481331
C2001002324QEPL00000000H. haemolyticus2001Sputum392,6012027
C2000002669QEPK00000000A. segnis2000Penile lesion153,6662230
Metadata and GenBank accession numbers for draft genome assemblies reported in this study
  5 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Comparative Genomic Analysis of Haemophilus haemolyticus and Nontypeable Haemophilus influenzae and a New Testing Scheme for Their Discrimination.

Authors:  Fang Hu; Lavanya Rishishwar; Ambily Sivadas; Gabriel J Mitchell; I King Jordan; Timothy F Murphy; Janet R Gilsdorf; Leonard W Mayer; Xin Wang
Journal:  J Clin Microbiol       Date:  2016-10-05       Impact factor: 5.948

Review 3.  Classification, identification, and clinical significance of Haemophilus and Aggregatibacter species with host specificity for humans.

Authors:  Niels Nørskov-Lauritsen
Journal:  Clin Microbiol Rev       Date:  2014-04       Impact factor: 26.132

4.  Haemophilus influenzae: using comparative genomics to accurately identify a highly recombinogenic human pathogen.

Authors:  Erin P Price; Derek S Sarovich; Elizabeth Nosworthy; Jemima Beissbarth; Robyn L Marsh; Janessa Pickering; Lea-Ann S Kirkham; Anthony D Keil; Anne B Chang; Heidi C Smith-Vaughan
Journal:  BMC Genomics       Date:  2015-08-27       Impact factor: 3.969

5.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.