| Literature DB >> 35510268 |
Peter Kotsoana Montso1,2, Victor Mlambo3, Collins Njie Ateba1,2.
Abstract
Atypical enteropathogenic E. coli belonging to the serotype O177 is a rare strain found in ruminants, especially cattle. When compared to shiga toxin producing E. coli (STEC) O157 and non-O157 STEC (O26, O45, O103, O104, O111, O121, and O145) serotypes, the antimicrobial resistance, virulence factors, and genomic structure of E. coli O177 are poorly understood. Therefore, in this article, we present the whole genome sequence data of two aEPEC E. coli O177 isolates (E. coli O177_CF-154-A and E. coli O177_CF-335-B) generated using Illumina MiSeq platform. The raw data were generated, cleaned, and assembled using Trimmomatic and SPAdes. Genome data analysis yielded 5,112,402 and 5,460,435 bp, comprising contigs 101 and 191 with GC contents of 50.7% and 50.5% for E. coli O177_CF-154-A and E. coli O177_CF-335-B, respectively. Prokaryotic Genome Annotation Pipeline (PGAP) and Rapid Annotation using Subsystem Technology (RAST) showed that the complete genome of E. coli O177_CF-154-A contained 5040 coding sequences (CDS), 5146 genes, 4896 proteins, 90 RNAs, and 78 tRNA while that of E. coli O177_CF-335-B contained 5463 CDS, 5570 genes, 5230 proteins, 92 RNAs, and 80 tRNA for. A total of 426 and 425 subsystem features with 5190 and 5662 CDS were obtained for E. coli O177_CF-154-A and E. coli O177_CF-335-B, respectively. Several genes encoding virulence and antimicrobial resistance were identified in both genomes. Complete genome sequence data of both isolates have been deposited in the National Center for Biotechnology Information (NCBI), GenBank: accession numbers, VMKH00000000 (E. coli O177_CF-154-A) and VMKG00000000 (E. coli O177_CF-335-B). This data can be used as a reference for determining the virulence and antimicrobial resistance in E. coli O177 isolates from different sample sources.Entities:
Keywords: Escherichia coli O177; Genome annotation; Genomic data; Virulence and Antimicrobial resistance genes; Whole genome sequence
Year: 2022 PMID: 35510268 PMCID: PMC9058948 DOI: 10.1016/j.dib.2022.108167
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig 1An illustration of the North West province map. https://municipalities.co.za/provinces/view/8/north-west.
Features of draft genomes of two E. coli O177 isolates obtained from cattle faeces.
| Sample ID | ||
|---|---|---|
| Features | ||
| Genome size | 5,112,402 bp | 5,460,435 bp |
| Genome coverage depth | 124.7x | 162.128x |
| Total length | 5111092 bp | 5459908 bp |
| GC content (%) | 50.7 | 50.5 |
| Number of contigs | 101 | 191 |
| Contigs N50 | 127249 | 113919 |
| Contigs L50 | 14 | 15 |
| Number of Scaffold | 101 | - |
| Scaffold N50 | 130301 | - |
| Scaffold L50 | 13 | - |
| Coding genes | 4896 | 5230 |
| Total genes | 5146 | 5570 |
| Total CDSs | 5040 | 5463 |
| Total proteins | 4896 | 5230 |
| rRNA | 8, 4, 6 (5S, 16S, 23S) | 7, 4, 6 (5S, 16S, 23S) |
| tRNA | 78 | 80 |
| ncRNA | 10 | 10 |
| CRISPR Arrays | 2 | 2 |
Fig. 2Frequency distribution of gene categories in genomes of two E. coli O177isolates obtained from cattle faeces.
Fig. 3The circular genome map of E. coli O177 isolates (CF-154-A and CF-335-B) obtained from cattle faeces. Circle displays from inside to outside: GC Skew (light orange), GC content (light purple), Drug Tagets (black), Transporters (blue), Virulence factor genes (yellow), Antimicrobial resistance genes (red), Non CDS features (turquoise blue), CDS reverse strand (light purple) and CDS forward strand (green).
Fig. 7Resistance mechanisms in two E. coli O177 isolates from cattle faeces
| Subject | Microbiology |
| Specific subject area | Molecular Microbiology and Bioinformatics |
| Type of data | TableFiguresExcel Sheets |
| How the data were acquired | Whole genome sequence was performed using Illumina MiSeq platform. The FASTQ files were obtained and imported into Kbase platform ( |
| Data format | Raw, filtered and analysed. |
| Description of data collection | Genomic DNA was extracted from two aEPEC O177 isolates (CF-154-A and CF-334-B) obtained from the Department of Microbiology, at NWU. The gDNA was sequenced using Illumina MiSeq platform. After sequencing FASTQ files were obtained. Raw reads were cleaned and assembled into contigs using FASTQC (v.0.11.5) SPAdes (v3.13.0), respectively. The genome annotation was carried out using PGAP, v.2.0 and RAST (v.2.0). The genome maps were drafted using PATRIC (v.3.6.2). |
| Data source location | • Institution: North-West University• City/Town/Region: North-West Province• Country |
| Data accessibility | Repository name: National Center for Biotechnology Information (NCBI), GenBank, and figshare.Data identification numbers: VMKH00000000 ( |
| Related research article | P. K. Montso, C. C. Bezuidenhout, C. Mienie, Y. M. Somorin, O. A. Odeyemi, V. Mlambo, C. N. Ateba |