| Literature DB >> 31886370 |
Maria Siniagina1, Maria Markelova1, Alexander Laikov1, Eugenia Boulygina1, Dilyara Khusnutdinova1, Anastasia Kharchenko1, Albina Misbakhova2, Tatiana Grigoryeva1.
Abstract
Dysbiosis of the gut microbiota in inflammatory bowel disease (IBD) patients is of great interest. It has been reported that Crohn's disease (CD) is associated with a general decrease in microbial diversity [1]. Altered microbial composition and function in CD results in imbalance in host-bacteria interaction and increased immune stimulation [2]. It is shown that microbiota in CD is characterized by increased proportion of E. coli in human gut in contrast to healthy individuals [3]. However, the overall qualitative and quantitative diversity of E. coli strains in CD is not fully understood. Here, we present a dataset of whole-genome sequences of E. coli's.Entities:
Keywords: Crohn's disease; Escherichia coli; Human gut microbiota; Whole-genome sequencing
Year: 2019 PMID: 31886370 PMCID: PMC6923290 DOI: 10.1016/j.dib.2019.104948
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Phylogenetic tree of E. coli strains from CD patients (red), healthy individuals (black) and reference genomes (blue) based on accessory gene content in genome assemblies.
Fig. 2Phylogenetic tree of E. coli strains from CD patients (red), healthy individuals (black) and reference genomes (blue) based on core genes in genome assemblies.
Fig. 3Distribution of virulence and pathogenicity genes of E. coli from CD patients (red), healthy individuals (black) and reference strains LF82 and Nissle 1917 (blue). Genes present or absent in all analyzed strains are not displayed. Gene with differential distribution in strains from patients with ileitis vs patients with colitis and ileocolitis is marked with asterisk (*).
Specifications Table
| Subject | Immunology and microbiology |
| Specific subject area | Microbiology |
| Type of data | Whole-genome sequencing data, table, figure |
| How data were acquired | Whole-genome sequencing on Illumina MiSeq platform. Bioinformatics approaches: genome assembler SPAdes v.3.11.1, rapid prokaryotic genome annotation Prokka v.1.12, pan genome Roary pipeline v.3.12.0, FastTree v.2.1.11 tool, SerotypeFinder-2.0 tool. |
| Data format | Raw, analyzed, deposited data |
| Parameters for data collection | Whole genomes of |
| Description of data collection | Dataset covers 64 samples ( |
| Data source location | Kazan Federal University, Kazan, Russian Federation |
| Data accessibility | The whole genome sequencing data have been deposited to NCBI BioProject with the dataset identifier PRJNA560176 |
| Related research article | Miquel S., Peyretaillade E., Claret L., De Vallée A., Dossat C., Vacherie B., Zineb E., Segurens B., Barbe V., Sauvanet P., Neut C., Colombel, J., Medigue C., Mojica F., Peyret P., Bonnet R., Darfeuille-Michaud A. Complete genome sequence of Crohn's disease-associated adherent-invasive |
The sequence data will be useful for comparative genomic and transcriptomic studies of The complete genome sequences of |