| Literature DB >> 35146093 |
Jobin John Jacob1, Tharani Priya T1, Dhanalakshmi Solaimalai1, Yesudoss M1, Jansi Rani Malaiyappan1, Tanya Rachel1, Aravind V1, Monisha Priya T1, Praveena Jeslin1, Biju George2, Balaji Veeraraghavan1.
Abstract
We report here the draft genome sequence of two rare Salmonella serotypes, isolated from human faecal samples in India. The isolates were identified as Salmonella enterica subsp. enterica serovar Ceyco and serovar Hillegersberg by Wole genome sequencing (WGS) based serotype prediction. The genomic similarity of study isolates was identified by clustering with the global collection of Salmonella sp. available in EnteroBase and SISTR based on their cgMLST profile. Phylogenetic analysis showed the study isolates were closer to S. Detmold and other unknown serovars from serogroup D2 . The information generated from genome sequencing of two rare S. enterica serovar will improve the overall understanding of the epidemiology of this clinically relevant pathogen.Entities:
Keywords: Diarrhea; Genomics; India; Non-Typhoidal Salmonella; Whole genome sequencing
Year: 2022 PMID: 35146093 PMCID: PMC8816656 DOI: 10.1016/j.dib.2022.107875
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
General genome characteristics of S. Ceyco strain FC2085 and S. Hillegersberg strain FC2223.
| Features | FC2085 | FC2223 |
|---|---|---|
| Status | Draft | Draft |
| NCBI BioSample no. | SAMN17348777 | SAMN21988699 |
| SRA accession no. | SRR16229856 | SRR16474915 |
| GenBank accession no. | GCA_016745495.1 | GCA_020551995.1 |
| No. of contigs | 37 | 177 |
| Total length (bp) | 4,691,294 | 4,744,996 |
| Total No. of CDS | 4,408 | 4,586 |
| N50 (bp) | 417,690 | 51,438 |
| GC content (%) | 51.8 | 52 |
| Coverage (×) | 71.0x | 23.0x |
| No. of reads | 778972 | 270818 |
| Serogroup | D2 | D2 |
| Serotype | Ceyco | Hillegersberg |
| AMR genes | ||
| SPI | SPI-1, 2, 3, 9 & C63PI | SPI-1, 2, 3 & C63PI |
| CRISPR Arrays | 2 | 3 |
Fig. 1Genetic distance-based tree generated by Mash tree of Salmonella genomes grouped into the same clusters by both GrapeTree MLST and SISTR cgMLST analysis. The phylogenetic tree was annotated tree with Interactive Tree Of Life. Serogroups and MLST are displayed with colour strips.
| Subject | Biological Sciences |
| Specific subject area | Clinical Microbiology; Bacterial genomics |
| Type of data | Genome sequence data, table, figure |
| How the data were acquired | Whole genome sequencing: Illumina iSeq-100 |
| Data format | Raw paired-end Illumina sequences assembled and annotated |
| Description of data collection | The clinical isolates were cultured on Xylose Lysine Deoxycholate (XLD) agar and the Genomic DNA was isolated using Wizard DNA purification kit for Whole genome sequencing. The Raw reads were assembled using SPAdes and annotated by NCBI Prokaryotic Genome Annotation Pipeline (PGAP v. 4.1) |
| Data source location | |
| Data accessibility | The Raw reads and assembled genome sequences are deposited in GenBank under the BioProject number PRJNA692535 and PRJNA767943. |