Literature DB >> 35799858

Genome sequencing data of extended-spectrum beta-lactamase-producing Escherichia coli INF191/17/A isolates of nosocomial infection.

Nik Siti Hanifah Nik Ahmad1, Khor Bee Yin2, Nik Yusnoraini Yusof3.   

Abstract

The infection with extended-spectrum beta-lactamase-producing Escherichia coli is associated with higher mortality, longer length of hospital-stay and increased costs compared to infection with antibiotic-susceptible E. coli. Here, the draft genome of ESBL-producing E. coli circulating at local hospital is reported. The strain was detected as containing the genes of antibiotic resistance TEM, CTX-M-1, and CTX-M-9. The 5,136,548-bp genome, with a GC content of 50.59%, comprised 4987 protein-coding genes, four ribosomal RNA, and 66 transfer RNA. The ResFinder was successfully predicted fourteen antimicrobial genes in the E. coli INF191/17/A genome. Sequence data has been deposited in the GenBank database under the accession number JAIEXV000000000. The BioProject ID in the GenBank database is PRJNA752944. The raw data was sequenced using Ilumina MiSeq and submitted to the NCBI SRA database (SRX11797310), which is publicly available.
© 2022 The Author(s). Published by Elsevier Inc.

Entities:  

Keywords:  Antimicrobial resistant gene; Escherichia coli; Extended-spectrum beta-lactamase; Genome sequencing

Year:  2022        PMID: 35799858      PMCID: PMC9253457          DOI: 10.1016/j.dib.2022.108407

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

The whole genome sequencing data provides insight into genomic determinants of the ESBL-producing E. coli strains INF191/17/A and antimicrobial resistance (AMR) genes. This data should be used by researchers and public health officers to keep up surveillance and control of ESBL-producing gram negative organisms in order to prevent the emergence of highly resistant strain, which is one of serious problem in the world. The genome data of E. coli strain INF191/17/A accelerates knowledge for pathogenic microbial research in the context of comparative studies, pan-genome, and evolution of non-ESBL and ESBL strains within different epidemiology. Furthermore, prior to biomarker discovery, drug or vaccine development, the comprehensive understanding of the whole genome of this pathogen is critically important.

Data Description

The Escherichia coli INF191/17/A was discovered as an extended-spectrum beta-lactamase (ESBL) strain carrying the antibiotic resistance genes TEM, CTX-M-1, and CTX-M-9 via polymerase chain reaction using ESBL specific primers [1]. The 251 base-pair paired-end (2 × 251 bp) sequencing raw reads of the E. coli strain INF191/17/A genome were obtained from the Illumina MiSeq system (Illumina, CA, USA) [2]. The raw reads were pre-processed before the genome assembly and annotation. Antimicrobial resistant genes were predicted using curated public database. Genomic DNA was extracted from E. coli strain INF191/17/A and sequenced to generate a total of 1,368,224 reads in a 500-cycle run. The total reads from a paired-end dataset (191-17-A_R1.fastq and 191-17-A_R2.fastq) have resulted in 329,238,355 total bases (Table 1). The pre-processed of raw reads including trimming adapter sequences, low-quality and short reads, resulting 46.9% of clean readings. De novo assembly of the clean reads was performed and generated 314 contigs with a total size of 5.12 Mbp. Scaffolding resulted in 74 scaffolds with the longest scaffold is 2,520,446 and N50 scaffold length of 1,733,129 bases (Table 2). The average coverage of assembled sequence is 66x with 50.59% of G+C content. Using PGAP, a total of 4987 coding sequences (CDS), four ribosomal RNA, and 66 transfer RNA (Table 3) were predicted. Furthermore, ResFinder predicted that E. coli INF191/17/A will develop fourteen antibiotic resistance genes (Table 4).
Table 1

Statistics of the raw and clean reads data including forward (191-17-A_R1.fastq) and reverse (191-17-A_R2.fastq) reads.

191-17-AR1R2Total
Total Raw Reads684,112684,1121,368,224
Total Raw Reads Bases164,465,730164,772,625329,238,355
Total Clean Reads320,871320,871641,742
Total Clean Reads Bases54,470,38340,781,24895,251,631
Clean Reads (%)46.9046.9046.90
Table 2

The statistics of the assembled draft genome of E. coli strain INF191/17/A.

AttributesValue
Number of scaffolds74
Total size of scaffolds5,136,548
Longest scaffold2,520,446
Shortest scaffold204
Number of scaffolds > 1 K nt51 (68.9%)
Number of scaffolds > 10 K nt21 (28.4%)
Number of scaffolds > 100 K nt3 (4.1%)
Number of scaffolds > 1 M nt2 (2.7%)
Number of scaffolds > 10 M nt0 (0.0%)
Mean scaffold size69,413
Median scaffold size2736
N50 scaffold length1,733,129
L50 scaffold count2
Table 3

The annotation of draft genome of E. coli INF191/17/A.

AttributesValue
Total number of genes5062
Number of coding sequences4987
Number of genes (coding)4736
Total number of RNAs75
Number of rRNAs4
Number of tRNAs66
Number of ncRNAs5
Number of pseudogenes251
Table 4

Antimicrobial resistance genes and their corresponding antibiotics detected in the E. coli INF191/17/A.

AMR geneDescriptionResistance
mdf(A)Multidrug transporter MdfAFluoroquinolone, Aminoglycoside, Tetracycline, Macrolide, Rifamycin, Phenicol
aph(3′')-IbAminoglycoside resistance protein BStreptomycin
aac(3)-IIdAminoglycoside-(3)-N-acetyl-transferase (aacC2) geneApramycin, Gentamicin, Tobramycin, Dibekacin, Netilmicin, Sisomicin
aph(6)-IdInosamine-phosphate amidinotransferaseStreptomycin
aadA5Streptomycin and spectinomycin resistance aminoglycoside adenyltransferaseSpectinomycin, Streptomycin
tet(A)Trimethoprim resistant dihydrofolate reductaseDoxycycline, Tetracycline
mph(A)Macrolide 2′-phosphotransferase IErythromycin, Azithromycin, Spiramycin, Telithromycin
sitABCDPeriplasmic binding protein (sitA), ATP-binding component (sitB), inner membrane component (sitC), inner membrane component (sitD)Hydrogen peroxide
blaTEM-1BBet-lactamase TEM-1Amoxicillin, Ampicillin, Cephalothin, Piperacillin, Ticarcillin
blaCTX-M-27Beta-lactamase CTX-M-27Amoxicillin, Ampicillin, Aztreonam, Cefepime, Cefotaxime, Ceftazidime, Ceftriaxone, Piperacillin, Ticarcillin
sul2Dihydropteroate synthase type-2Sulfamethoxazole
sul1Dihydropteroate synthase type-1Sulfamethoxazole
dfrA17Dihydrofolate reductaseTrimethoprim
qacEQuaternary ammonium compound-resistance protein QacEBenzylkonium chloride, Ethidium bromide, Chlorhexidine, Cetylpyridinium chloride
Statistics of the raw and clean reads data including forward (191-17-A_R1.fastq) and reverse (191-17-A_R2.fastq) reads. The statistics of the assembled draft genome of E. coli strain INF191/17/A. The annotation of draft genome of E. coli INF191/17/A. Antimicrobial resistance genes and their corresponding antibiotics detected in the E. coli INF191/17/A.

Experimental Design, Materials and Methods

Sample Collection and Isolation of ESBL E. coli Strain INF191/17/A

E. coli strain INF191/17/A was isolated from a 45-year-old male patient who was suffering from a high fever at a local hospital. In brief, the sample was cultured in the Bactec 9240 blood culture system (Becton, Dickinson, USA) before proceeding with the biochemical testing and gram staining [3]. The ESBL screening and disk confirmation tests were measured according to Clinical and Laboratory Standards Institute (CLSI) [4]. The 16S rRNA sequences for this strain were validated using specific primers of E. coli [5]. Then, the PCR was conducted using ESBL-primers for the confirmation of ESBL-type [1].

DNA Isolation, Genome Sequencing, Assembly, and Annotation

Genomic DNA was isolated using NucleoSpin tissue DNA, RNA, and protein purification kit according to manufacturer's instructions (Macherey-Nagel). The purified DNA was processed using Nextera XT DNA library preparation kit following the manufacturer's instructions (Illumina, USA). A whole-genome sequence was performed using the Miseq platform (Illumina, USA) (2 × 251 bp). The adapter trimming, quality trimming, contaminant filtering and read length filtering were performed using BBDuk (BBTools version 36) (http://jgi.doe.gov/data-and-tools/bbtools/). The low-quality bases (6] to obtain contigs. These assembled contigs were subjected to scaffolding against the closest reference genomes [3] to produce a draft genome using Medusa (Multi-Draft based Scaffolder) software [7]. The genome was annotated using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v4.10 [8].

Antimicrobial Resistant Genes Analysis

ResFinder (v4.1) [9] was used to screen for antimicrobial resistance genes. The assembled genome was searched against the curated Escherichia coli database using the default parameters. The prediction of the genes was confirmed if the assembled sequence had at least 95% nucleotide matching identity and 80% coverage with candidate genes in the database.

Ethics Statement

The study protocol was approved by the ethics committee of the Universiti Sains Malaysia (USM/JEPeM/20030152).

CRediT authorship contribution statement

Nik Siti Hanifah Nik Ahmad: Software, Formal analysis, Writing – review & editing, Funding acquisition. Khor Bee Yin: Conceptualization, Software, Formal analysis, Data curation, Writing – original draft. Nik Yusnoraini Yusof: Conceptualization, Software, Methodology, Resources, Writing – review & editing, Supervision, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.
SubjectHealth and medical sciences
Specific subject areaMicrobiology and genomics.Genome sequencing of pathogenic bacteria by using next generation sequencing approach.
Type of dataTableSequencing raw reads in FASTQ format text fileAssembled draft genome of E. coli strain INF191/17/A in FASTA format text fileGenome sequence data in FASTA and FASTQ format
How data were acquiredThe Illumina MiSeq platform was used to generate paired-end reads of extended spectrum beta lactamase (ESBL)-producing E. coli strain INF191/17/A genome.
Data formatRaw data in FASTQ formatAssembled data in FASTA format: GenBank assembly accession: GCA_019599325.1 (https://www.ncbi.nlm.nih.gov/assembly/ GCA_019599325.1).
Parameters for data collectionBacterial genomic DNA was extracted from a pure culture of ESBL-producing E. coli INF191/17/A . Nextera XT DNA library preparation kit was used for the whole-genome sequencing library preparation to generate 2 × 251 paired end reads data.
Description of data collectionWhole genome sequencing was performed using Illumina MiSeq system (Illumina®,USA). BBDuk (BBTools v36) was used to trim raw reads, and SPAdes v3.9.0 was used to assemble clean reads. Genome scaffolding was performed with Medusa v1.6. ResFinder software predicted the putative antimicrobial resistant genes.
Data source locationInstitution: Institute for Research in Molecular Medicine (INFORMM)City/Town/Region: Kubang Kerian, KelantanCountry: MalaysiaLatitude and longitude for collected samples/data: 6.10 N 102.28 E
Data accessibilityThe data is hosted on a public repository.Bioproject: https://www.ncbi.nlm.nih.gov/bioproject/PRJNA752944Biosample: https://www.ncbi.nlm.nih.gov/biosample/SAMN20668118NCBI GenBank Accession Number: JAIEXV000000000https://www.ncbi.nlm.nih.gov/nuccore/JAIEXV000000000Repository name: NCBI SRA databaseData identification number: SRR15497613Direct URL to data: https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR15497613
  8 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  MeDuSa: a multi-draft based scaffolder.

Authors:  Emanuele Bosi; Beatrice Donati; Marco Galardini; Sara Brunetti; Marie-France Sagot; Pietro Lió; Pierluigi Crescenzi; Renato Fani; Marco Fondi
Journal:  Bioinformatics       Date:  2015-03-25       Impact factor: 6.937

3.  RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation.

Authors:  Wenjun Li; Kathleen R O'Neill; Daniel H Haft; Michael DiCuccio; Vyacheslav Chetvernin; Azat Badretdin; George Coulouris; Farideh Chitsaz; Myra K Derbyshire; A Scott Durkin; Noreen R Gonzales; Marc Gwadz; Christopher J Lanczycki; James S Song; Narmada Thanki; Jiyao Wang; Roxanne A Yamashita; Mingzhang Yang; Chanjuan Zheng; Aron Marchler-Bauer; Françoise Thibaud-Nissen
Journal:  Nucleic Acids Res       Date:  2020-12-03       Impact factor: 16.971

4.  Extended-spectrum β-lactamase producing E. coli in urinary tract infections: A two-center, cross-sectional study of prevalence, genotypes and risk factors in Amman, Jordan.

Authors:  Sayida A Al-Jamei; Abla Y Albsoul; Faris G Bakri; Amal G Al-Bakri
Journal:  J Infect Public Health       Date:  2018-08-23       Impact factor: 3.718

5.  The occurrence of ESBL-producing Escherichia coli carrying aminoglycoside resistance genes in urinary tract infections in Saudi Arabia.

Authors:  Essam J Alyamani; Anamil M Khiyami; Rayan Y Booq; Majed A Majrashi; Fayez S Bahwerth; Elena Rechkina
Journal:  Ann Clin Microbiol Antimicrob       Date:  2017-01-06       Impact factor: 3.944

6.  Comparative analysis of 7 short-read sequencing platforms using the Korean Reference Genome: MGI and Illumina sequencing benchmark for whole-genome sequencing.

Authors:  Hak-Min Kim; Sungwon Jeon; Oksung Chung; Je Hoon Jun; Hui-Su Kim; Asta Blazyte; Hwang-Yeol Lee; Youngseok Yu; Yun Sung Cho; Dan M Bolser; Jong Bhak
Journal:  Gigascience       Date:  2021-03-12       Impact factor: 6.524

7.  ResFinder 4.0 for predictions of phenotypes from genotypes.

Authors:  Valeria Bortolaia; Rolf S Kaas; Etienne Ruppe; Marilyn C Roberts; Stefan Schwarz; Vincent Cattoir; Alain Philippon; Rosa L Allesoe; Ana Rita Rebelo; Alfred Ferrer Florensa; Linda Fagelhauer; Trinad Chakraborty; Bernd Neumann; Guido Werner; Jennifer K Bender; Kerstin Stingl; Minh Nguyen; Jasmine Coppens; Basil Britto Xavier; Surbhi Malhotra-Kumar; Henrik Westh; Mette Pinholt; Muna F Anjum; Nicholas A Duggett; Isabelle Kempf; Suvi Nykäsenoja; Satu Olkkola; Kinga Wieczorek; Ana Amaro; Lurdes Clemente; Joël Mossong; Serge Losch; Catherine Ragimbeau; Ole Lund; Frank M Aarestrup
Journal:  J Antimicrob Chemother       Date:  2020-12-01       Impact factor: 5.790

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.