| Literature DB >> 28724357 |
Daria V Rakitina1, Alexander I Manolov2, Alexandra V Kanygina3, Sofya K Garushyants4,5, Julia P Baikova2, Dmitry G Alexeev2,3, Valentina G Ladygina2, Elena S Kostryukova2, Andrei K Larin2, Tatiana A Semashko2, Irina Y Karpova2, Vladislav V Babenko2, Ruzilya K Ismagilova4,6, Sergei Y Malanin4,6, Mikhail S Gelfand4,5,7,8, Elena N Ilina2, Roman B Gorodnichev2, Eugenia S Lisitsyna2, Gennady I Aleshkin9, Petr L Scherbakov10, Igor L Khalif11, Marina V Shapina11, Igor V Maev12, Dmitry N Andreev12, Vadim M Govorun2,3.
Abstract
BACKGROUND: Escherichia coli (E. coli) has been increasingly implicated in the pathogenesis of Crohn's disease (CD). The phylogeny of E. coli isolated from Crohn's disease patients (CDEC) was controversial, and while genotyping results suggested heterogeneity, the sequenced strains of E. coli from CD patients were closely related.Entities:
Keywords: Crohn’s disease; E. coli; Genome; Propanediol
Mesh:
Substances:
Year: 2017 PMID: 28724357 PMCID: PMC5517970 DOI: 10.1186/s12864-017-3917-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Samples and patients
| patient № | sex | age | disease | isolate name and origin | number of isolates | clinic | assembly | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| localisation | clinical activity | endoscopic activity | biopsy | aspirate from ileum lumen | feces | ||||||
| 1 | M | 33 | ileitis | low | 10 | RCE01–01 RCE01–02 RCE01–03 (ileum) | RCE01–04 RCE01–05 | RCE01–06 | 6 | C | RCE01 |
| 2 | M | 23 | ileocolitis | low | 13 | RCE02–01 RCE02–02 RCE02–03 | 3 | C | RCE02 | ||
| 3 | F | 37 | ileocolitis | medium | 14 | RCE03–01 RCE03–02 RCE03–03 (ileum, caecum, sigmoid) | 3 | C | RCE03 | ||
| 4 | F | 40 | ileocolitis-perianal | high | 0 | RCE04–01 (caecum) | RCE04–02 RCE04–03 RCE04–04 RCE04–05 RCE04–06 | 6 | C | RCE04 | |
| 5 | M | 32 | ileitis-jejunitis | high | 9 | RCE05 | 1 | C | RCE05 | ||
| 6 | M | 47 | ileocolitis | low | 15 | RCE06–01 RCE06–02 RCE06–03 RCE06–04 RCE06–05 | 5 | C | RCE06 | ||
| 7 | M | 32 | ileocolitis | remission | 3 | RCE07 | 1 | C | RCE07 | ||
| 8 | F | 29 | ileocolitis-perianal | low | 6 | RCE08 | 1 | S | RCE08 | ||
| 9 | M | 25 | colitis | low | 5 | RCE10 | 1 | S | RCE10 | ||
| 10 | M | 29 | ileocolitis | high | 8 | RCE11 | 1 | C | RCE11 | ||
| Total isolates number | 28 | ||||||||||
C - Central Scientific Institute of Gastroenterology, Moscow
S - State Scientific Center of Coloproctology, Moscow
Fig. 1Genomic similarity of E. coli from individual colonies. Heatmap colors represent the number of SNPs per nucleotide (all-vs-all method). Lighter colors mean higher sequence similarity
Fig. 2Phylogenetic analysis of E. coli strains. The phylogenetic tree of all universal single-copy genes was constructed by the maximum-likelihood algorithm with 100 bootstrap replicates for E. coli isolates from ten patients (this study), 32 E. coli and Shigella strains from [42] from phylogroups A (yellow), B1(light green), B2 (green), D(cyan), E (blue), S (violet), previously published [27–30] CD-associated strains (red), and uropathogenic strain JJ1886. Escherichia albertii KF1 and Escherichia fergusonii ECD227 were used as outgroups
Fig. 3Genomic comparison of 14 CD-associated strains with pathogenic and non-pathogenic E. coli genomes. Maximum likelihood unrooted tree is based on the core genes. Strains from this study are colored black (here and on the images below indicated as Crohn); previously published CDEC (here and on the images below indicated as CrohnLit) are grey; nonpathogenic, green; pathogenic, pink; strains of undetermined pathogenicity, white. Derivatives of one laboratory strain are merged. Strains containing a plasmid homologous to plLF82 and pJJ1886 are indicated. Pdu operons from LF82 are marked with red
Fig. 4Full-length alignment of plasmids shared by CDEC strains from the present study: E. coli LF82 plasmid (a), and JJ1886 plasmid 4 (b). The first row in each case represents the plasmid map; other rows show homologous regions and rearrangments (MAUVE 2.4.0, default parameters) between the plasmid of interest and meta-assemblies for specific patients. Each homologous region is shown by a specific color
Fig. 5Multidimensional scaling plot of distances between the PFAM-domain content in CDEC, pathogenic, non pathogenic, and commensal E. coli. The colors of strains are as in Fig. 3
Fig. 6Functions of overrepresented OGs (Fisher’s test p-value <0.05 prior to Holm’s correction). The number of overrepresented OGs with a given function is shown on the horizontal axis for commensal strains (grey, left panel) and CDEC (yellow, right panel)
Fig. 7Operons and gene groups enriched in CDEC (yellow) and commensal E. coli (grey) (Fisher test p-value <0.05 prior to Holm’s correction). OGs form horisontal rows, strains – vertical columns. Genomes with similar OG patterns were clustered together using a custom R script (see Methods). that can be achieved at GitHub repository https://github.com/paraslonic/Rakitina_etal_Crohn_paper/tree/master/ogEnrichment [41]. Phylogroups of strains are indicated (A, B2, B1, D)
Fig. 8Schematic representation of the propanediol and galactitol operons in CDEC genomes. For each operon the reference strain and the percent of genomes containing it is indicated (CDEC vs commensal)