| Literature DB >> 29426864 |
Carla Camprubí-Font1, Mireia Lopez-Siles1, Meritxell Ferrer-Guixeras1, Laura Niubó-Carulla1, Carles Abellà-Ametller1, Librado Jesús Garcia-Gil1, Margarita Martinez-Medina2.
Abstract
Adherent-invasive Escherichia coli (AIEC) have been involved in Crohn's disease (CD). Currently, AIEC are identified by time-consuming techniques based on in vitro infection of cell lines to determine their ability to adhere to and invade intestinal epithelial cells as well as to survive and replicate within macrophages. Our aim was to find signature sequences that can be used to identify the AIEC pathotype. Comparative genomics was performed between three E. coli strain pairs, each pair comprised one AIEC and one non-AIEC with identical pulsotype, sequence type and virulence gene carriage. Genetic differences were further analysed in 22 AIEC and 28 non-AIEC isolated from CD patients and controls. The strain pairs showed similar genome structures, and no gene was specific to AIEC. Three single nucleotide polymorphisms displayed different nucleotide distributions between AIEC and non-AIEC, and four correlated with increased adhesion and/or invasion indices. Here, we present a classification algorithm based on the identification of three allelic variants that can predict the AIEC phenotype with 84% accuracy. Our study corroborates the absence of an AIEC-specific genetic marker distributed across all AIEC strains. Nonetheless, point mutations putatively involved in the AIEC phenotype can be used for the molecular identification of the AIEC pathotype.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29426864 PMCID: PMC5807354 DOI: 10.1038/s41598-018-20843-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Characteristics of the three sequenced AIEC/non-AIEC strain pairs.
| Strain | Phylogroup | Serotype | STa | Virulence genes | Adhesion indexbφ | Invasion indexc ϕ | Intramacrophage replication index in J774dΦ | Intramacrophage replication index in THP-1d¥ |
|---|---|---|---|---|---|---|---|---|
| AIEC17 | D | ONT:HNT | 569 | 21.6 ± 17.5 | 0.266 ± 0.055 | 1053 ± 75 | 213 ± 60 | |
| ECG28 | 0.6 ± 0.3 | 0.005 ± 0.001 | 774 ± 129 | 228 ± 68 | ||||
| AIEC01 | B2 | O6:H1 | 73 | 15.9 ± 9.3 | 0.284 ± 0.106 | 1567 ± 1060 | 173 ± 99 | |
| ECG11 | 1.1 ± 0.8 | 0.004 ± 0.002 | 716 ± 315 | 74 ± 29 | ||||
| AIEC07 | B1 | O22:H7 | 3232 | 20.0 ± 13.4 | 0.565 ± 0.392 | 1693 ± 297 | 189 ± 71 | |
| ECG04 | 1.8 ± 0.7 | 0.036 ± 0.029 | 527 ± 194 | 77 ± 17 |
aSequence type. bNumber of bacteria per I-407 cell. cPercentage of intracellular bacteria after 1 h gentamicin treatment relative to the inoculum. The percentages of reduction of invasion for AIEC17, AIEC01 and AIEC07 were 99.8%, 99.4% and 99.8%, respectively, in the presence of cytochalasin D and 90.4%, 99% and 95%, respectively, in the presence of colchicine. dPercentage of intracellular bacteria present at 24 h post-infection relative to the number of intracellular bacteria present after 1 h of gentamicin treatment. Results of control strains LF82 and K-12 strains respectively: φ25.66 ± 15.7 and 0.70 ± 0.02. ϕ2.26 ± 1.349 and 0.019 ± 0.020. Φ777 ± 304.8 and 11 ± 5. ¥121 ± 59 and 10 ± 7. Strains with an adhesion index ≥ 1, an invasion index ≥ 0.1% and an intramacrophage replication index ≥ 100% were classified as AIEC.
Figure 1Genome similarities among the six strains and within each pair by analysis of orthologous clusters of genes (OCG) analysis. (a) Edward’s Venn diagram indicating the number of OCG. The shadowed areas correspond to clusters exclusively shared between at least two AIEC strains. (b) Percentage of OCG between phylogenetically distinct strain pairs and among AIEC strains. Percentages are calculated in relation to the number of variable OCG for each strain. Those OCG that are not present in all six strains are considered variable. Other combinations include gene clusters shared by 5 strains or 3 strains from two or three different phylogroups. There were no common OCG among AIEC strains.
Number of SNPs in the sequenced AIEC/non-AIEC strain pairs.
| AIEC17-ECG28 | AIEC01-ECG11 | AIEC07-ECG04 | |
|---|---|---|---|
| Total SNPs | 51 | 126 | 109 |
| Total SNPs in genes | 40 | 91 | 82 |
| Selected SNPs*1 | 20 (7) | 10 (5) | 30 (19) |
| Confirmed SNPs*2 | 4 (4) | 7 (3) | 9 (4) |
| SNPs studied in a strain collection*3 | 4 (4) | 4 (2) | 8 (3) |
*The number of genes in which the SNPs are located is indicated in parenthesis.
1SNPs that conform to the following criteria: (I) cause a non-synonymous amino acid change; (II) are not located at the end of a contig; and (III) were validated bioinformatically by ClustalW.
2Validated by Sanger method.
3Confirmed SNPs that were not strain-specific (and four strain-specific SNPs for validation).
Location of the confirmed SNPs, nucleotide variants and gene functions.
| Gene ID | Strain pair where identified | SNP location in AIEC genome (Contig: position) | Nucleotide variant*1 (AIEC/non-AIEC) | Protein name | Protein family | Gene Ontology |
|---|---|---|---|---|---|---|
| E1-E2_3.4 | AIEC17 vs ECG28 | 8:204059 | C/T | GntR family transcriptional regulator | PF00392; PF07702 | GO:0003677; GO:0003700; GO:0006351 |
| E1-E2_3.6 | 105:325 | C/T | Phage protein | PF06174 | ||
| E1-E2_3.7 | 3:50414 | T/C | Serine peptidase DegQ | PF13365; PF13180; PF00595 | GO:0004252 | |
| E1-E2_5 | 51:69 | G/T | Vitamin B12 transporter BtuB | PF07715 | GO:0009279; GO:0015235; GO:0006811; GO:0046872; GO:0046930; GO:0015288; GO:0004872 | |
| E3-E4_4.3 | AIEC01 vs ECG11 | 3:167, 173, 209 | C/Y, Y/Y, T/K | Putative uncharacterised protein | PF06174 | |
| E3-E4_4.4 | 84:1126 | R/R | dGTPase | PF00350 | GO:0005525 | |
| E3-E4_4.7 | 80:920, 932, 1013 | S/S, S/S, Y/Y | Chemotaxis protein | PF13990 | ||
| E5-E6_3.1 | AIEC07 vs ECG04 | 3:6356 | A/C | FimH | PF00419; PF09160 | GO:0007155; GO:0009289 |
| E5-E6_3.12 | 83:442 | A/G | Succinyl –CoA ligase subunit beta | PF08442 | GO:0005524; GO:0000287; GO:0030145; GO:0004775; GO:0006099 | |
| E5-E6_3.16 = 3.22 | 51:418, 544, 545, 633, 646, 650 | Y/Y, S/S, R/R M/M, Y/Y, S/S | Enterobacterial Ail/Lom family protein | PF06316 | GO:0009279; GO:0016021 | |
| E5-E6_3.17 | 62:583 | R/R | Putative prophage component | PF00877 |
*1A:adenine; C:cytosine; G:guanine; K:guanine or thymine; M:adenine or cytosine; R:adenine or guanine; S:guanine or cytosine; T:thymine; W:adenine or thymine; Y:cytosine or thymine.
Prevalence of genes encompassing SNPs in a collection of AIEC/non-AIEC strains and the frequency of particular nucleotide variants with respect to AIEC phenotype and phylogroup origin of the strains. Only SNPs presenting statistically significant differences regarding pathotype are presented. Values are given in percentages with respect to the total number of AIEC or non-AIEC strains. Statistically significant differences for each variant are presented in bold type. *For phylogroup analysis, the atypical non-AIEC strain was discarded.
| SNP ID | PCR amplification | SNP base vs AIEC phenotype | SNP base vs phylogroup | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| AIEC (n = 22) | non-AIEC (n = 28) | Ntd (N strains) | AIEC | non-AIEC | p-value | A (n = 9) | B1 (n = 7) | B2 (n = 28) | D (n = 5) | p-value | |
| E3-E4_4.3(2) | 100 | 100 | C (33) | 77.78 | 71.43 | 60.71 | 60.00 | 0.667 | |||
| T (9) | 22.22 | 0.00 | 21.43 | 20.00 | |||||||
| Y (8) | 13.64 | 17.86 | 0.00 | 28.57 | 17.86 | 20.00 | |||||
| E3-E4_4.4 | 77.28 | 71.43 | A (5) | 13.64 | 7.14 | 0.00 | 14.29 | 14.29 | 0.00 | 0.636 | |
| G (14) | 33.33 | 28.57 | 32.14 | 0.00 | |||||||
| R (18) | 33.33 | 14.29 | 42.86 | 40.00 | |||||||
| E5-E6_3.16 = 3.22(2) | 90.91 | 89.28 | C (14) | 44.45 | 28.57 | 17.86 | 40.00 | ||||
| G (8) | 22.22 | 0.00 | 14.29 | 40.00 | |||||||
| T (13) | 36.36 | 17.86 | |||||||||
| S (5) | 9.09 | 10.71 | |||||||||
| K (3) | 0.00 | 10.71 | 0.00 | 0.00 | 10.71 | 0.00 | |||||
| Y (2) | 0.00 | 7.14 | 11.11 | 14.29 | 0.00 | 0.00 | |||||
A:adenine; C:cytosine; G:guanine; K:guanine or thymine; M:adenine or cytosine; R:adenine or guanine; S:guanine or cytosine; T:thymine; W:adenine or thymine; Y:cytosine or thymine.
Figure 2Adhesion (a) and invasion (b) abilities of the strains according to specific nucleotide variants of SNPs. Only SNPs associated with significant differences (p < 0.05 using the Mann-Whitney U-test) in the adhesion or invasion abilities of variants are shown. Homogeneous subgroups (p > 0.05) within each panel are indicated by the same superscripts. The median of the data is indicated by the horizontal line in each box, boxes cover the 25% and 75% quantiles, and bars show the 10% and 90% quantiles. Outliers are marked as dots.
Binary logistic regression model for the SNPs associated with the AIEC pathotype. The equation variables, the risk of being AIEC (odds ratio), the p-value of the regression model and the percentage of successfully classified strains are indicated.
| Equation variables | Observed | Predicted | |||||||
|---|---|---|---|---|---|---|---|---|---|
| B | p-value | Odds ratio | 95% CI | Non-AIEC | AIEC | % Correct | Global % | ||
| Not to have G in E3-E4_4.4 | 2.420 | 0.006 | 11.250 | 2.004–63.168 | Non-AIEC | 12 | 8 | 60.0 | 73.0 |
| Constant | −1.792 | 0.019 | 0.167 | AIEC | 2 | 15 | 88.2 | ||
| To have G in E5-E6_3.16 = 3.22(2) | 2.559 | 0.023 | 12.923 | 1.430–116.785 | Non-AIEC | 24 | 1 | 96.0 | 68.9 |
| Constant | −0.613 | 0.075 | 0.542 | AIEC | 13 | 7 | 35.0 | ||
Figure 3Classification algorithm for AIEC identification. Percentages represent the proportion of strains that are correctly predicted as AIEC or non-AIEC based on the result for each SNP combination. The number of total strains corresponding to each condition is indicated. (−): no amplification; other: a nucleotide different from guanine (G) or overlapping peaks.