| Literature DB >> 31100356 |
Valentina Galata1, Cédric C Laczny1, Christina Backes1, Georg Hemmrich-Stanisak2, Susanne Schmolke3, Andre Franke2, Eckart Meese4, Mathias Herrmann5, Lutz von Müller5, Achim Plum6, Rolf Müller7, Cord Stähler1, Andreas E Posch8, Andreas Keller9.
Abstract
Emerging antibiotic resistance is a major global health threat. The analysis of nucleic acid sequences linked to susceptibility phenotypes facilitates the study of genetic antibiotic resistance determinants to inform molecular diagnostics and drug development. We collected genetic data (11,087 newly-sequenced whole genomes) and culture-based resistance profiles (10,991 out of the 11,087 isolates comprehensively tested against 22 antibiotics in total) of clinical isolates including 18 main species spanning a time period of 30 years. Species and drug specific resistance patterns were observed including increased resistance rates for Acinetobacter baumannii to carbapenems and for Escherichia coli to fluoroquinolones. Species-level pan-genomes were constructed to reflect the genetic repertoire of the respective species, including conserved essential genes and known resistance factors. Integrating phenotypes and genotypes through species-level pan-genomes allowed to infer gene-drug resistance associations using statistical testing. The isolate collection and the analysis results have been integrated into GEAR-base, a resource available for academic research use free of charge at https://gear-base.com.Entities:
Keywords: Antibiotic resistance; Bacteria; Pan-genome; Whole-genome sequencing
Mesh:
Year: 2019 PMID: 31100356 PMCID: PMC6624217 DOI: 10.1016/j.gpb.2018.11.002
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Figure 1GEAR-base workflow and structure
Schematic overview of data collection, processing and integration into GEAR-base.
Figure 2Overview of resistance profiles
Heatmaps of log-transformed (base 2) median MIC values (A) and resistance rates (B) for all species with at least 50 isolates. Drugs labels were grouped relative to their class. The cells are coded in color gradient from blue to red with blue for lower values and red for higher values. White color in panel B corresponds to the cases where no breakpoints are available from the used guidelines. MIC, minimum inhibitory concentration.
Figure 3Assembly quality overview
Assembly summary statistics for the 11,062 isolates with a de novo assembly. The isolates were grouped by their species taxon, and isolates not belonging to any of the main 18 species used for pan-genome construction were grouped into ”Other”. The box plots show the GC content (A), mean assembly coverage (B), number of contigs (C), L50 value (D), and N50 value (E) for contigs of at least 200 bp. The assembly quality cut off values are illustrated by dotted lines (1000 for the number of contigs; 200 for L50; and 5000 bp for N50). The plot area satisfying the respective filtering criterion is colored in green. Percentages of isolates passing the respective criterion as well as all criteria are shown to the right.
Figure 4Centroid frequency
Number of centroids in each pan-genome of the 18 main species in relation to their frequency. The first column contains centroids that are present in <10% of the isolates, and the last one contains centroids that are present in ≥90% of the isolates. Cells are coded in color gradient to indicate the log10-transformed number of centroids. The bar plot on the right shows the number of isolates used to construct the respective pan-genomes.
Figure 5Number of significant results of the resistance association analysis
Significant results (adjusted P < 1E−5) of the resistance association analysis based on the presence/absence of centroids. The heatmap shows the number of significant results (in color gradient with lighter blue for smaller numbers and darker blue for larger numbers) per taxon and drug. Drugs are sorted according to their class.