| Literature DB >> 28232822 |
Alex Orlek1, Nicole Stoesser2, Muna F Anjum3, Michel Doumith4, Matthew J Ellington5, Tim Peto1, Derrick Crook1, Neil Woodford5, A Sarah Walker1, Hang Phan1, Anna E Sheppard1.
Abstract
Plasmids are extra-chromosomal genetic elements ubiquitous in bacteria, and commonly transmissible between host cells. Their genomes include variable repertoires of 'accessory genes,' such as antibiotic resistance genes, as well as 'backbone' loci which are largely conserved within plasmid families, and often involved in key plasmid-specific functions (e.g., replication, stable inheritance, mobility). Classifying plasmids into different types according to their phylogenetic relatedness provides insight into the epidemiology of plasmid-mediated antibiotic resistance. Current typing schemes exploit backbone loci associated with replication (replicon typing), or plasmid mobility (MOB typing). Conventional PCR-based methods for plasmid typing remain widely used. With the emergence of whole-genome sequencing (WGS), large datasets can be analyzed using in silico plasmid typing methods. However, short reads from popular high-throughput sequencers can be challenging to assemble, so complete plasmid sequences may not be accurately reconstructed. Therefore, localizing resistance genes to specific plasmids may be difficult, limiting epidemiological insight. Long-read sequencing will become increasingly popular as costs decline, especially when resolving accurate plasmid structures is the primary goal. This review discusses the application of plasmid classification in WGS-based studies of antibiotic resistance epidemiology; novel in silico plasmid analysis tools are highlighted. Due to the diverse and plastic nature of plasmid genomes, current typing schemes do not classify all plasmids, and identifying conserved, phylogenetically concordant genes for subtyping and phylogenetics is challenging. Analyzing plasmids as nodes in a network that represents gene-sharing relationships between plasmids provides a complementary way to assess plasmid diversity, and allows inferences about horizontal gene transfer to be made.Entities:
Keywords: antibiotic resistance; genomic epidemiology; network analysis; plasmid typing; whole-genome sequencing
Year: 2017 PMID: 28232822 PMCID: PMC5299020 DOI: 10.3389/fmicb.2017.00182
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Summary of plasmid typing and subtyping schemes.
| Plasmid typing schemes | Comments | |
|---|---|---|
| Replicon typing | Inc grouping | Plasmids with similar replication machinery are often unable to stably co-exist within the same host cell ( |
| Replicon probe hybridization | ||
| PCR-based replicon typing (PBRT) | PBRT for plasmids of the well-studied Enterobacteriaceae family currently detects 28 replicons (based on various genetic loci including | |
| Replicon subtyping | Allelic profiles are assessed at 2–6 core loci (depending on the specific scheme). Plasmids are assigned a pMLST subtype nesting within the broader replicon type. pMLST schemes are available for six common replicon types of Enterobacteriaceae plasmids (IncF, HI1, HI2, I1, N, A/C). PCR-based and | |
| Replicon and pMLST allele databases can be downloaded for local use, but user-friendly web-tools (PlasmidFinder/pMLST for replicon typing/subtyping) can run the analysis pipeline, including read assembly. The PlasmidFinder replicon database currently contains 121 reference replicons for Enterobacteriaceae plasmids; a dataset of replicons for gram-positive plasmids based on the scheme devised by | ||
| MOB typing | PCR-based MOB typing | PCR-based ‘degenerate primer MOB typing’ (DPMT) is used to type γ-Proteobacterial plasmids; 19 degenerate primer pairs target relaxase sequences to partition plasmids into five of the main MOB types identified by |
| Six N-terminal relaxase sequences are used as PSI-BLAST probes to detect relaxase sequences of transmissible plasmids, and partition plasmids into six possible MOB types ( | ||
| Other locus-targeting schemes | Other locus-based methods for plasmid typing and subtyping exist, but tend to be applicable to a more restricted set of plasmids ( | |
| Plasmid ‘fingerprinting’ (RFLP typing) | Restriction fragment length polymorphism (RFLP) is sometimes used to subtype plasmids, especially when pMLST is unavailable. However, band patterns can be difficult to interpret, and do not provide a reliable phylogenetic marker ( | |
Summary of common in silico tools used for plasmid analysis.
| Goal | Tool(s); reference(s) | Comments |
|---|---|---|
| Detect loci of interest from reads | SRST2 ( | Reads are mapped to a reference database using bowtie2 ( |
| Detect resistance genes from k-mers | KmerResistance ( | Identifies resistance genes from WGS data by examining co-occurrence of k-mers (DNA substrings of length k) between the query WGS data and a reference database of resistance genes. |
| Comparative plasmid genomics | ACT; BRIG ( | Tools such as ACT and BRIG can be used to order contigs against a reference plasmid using BLAST, allowing homologies and gene content similarity to be visualized. |
| Detect replicon type/subtype from contigs | PlasmidFinder; pMLST ( | See |
| Detect resistance genes from contigs | ResFinder ( | Contigs are BLAST searched against a database of horizontally acquired resistance genes; resistance-conferring mutations are not accounted for. |
| CARD ( | Contigs are BLAST searched against the CARD database; resistance genes are associated with an ontology allowing resistance gene metadata to be retrieved. CARD also provides the Resistance Gene Identifier tool for resistance prediction. | |
| ARG-ANNOT ( | BLAST-based tool for detection of resistance genes and resistance mutations. | |
| Localize specific genes of interest from a contig assembly | Bandage ( | Assembly graph visualization and annotation tool (can be used for manual repeat resolution). |
| ISMapper ( | Mapping-based tool which uses paired-end sequencing data to localize insertion sequences. Can be used for localizing a particular resistance locus, given a known association with a specific insertion sequence. | |
| Distinguish plasmid from chromosomal sequences | cBar ( | Plasmid and chromosomal sequences are distinguished based on pentamer frequencies. |
| Other tools | Tools such as plasmidSPAdes and PlasmidFinder may also be used to distinguish plasmid and chromosomal sequences ( | |
| Resolve plasmid structures from ambiguous assembly graphs | PLACNET ( | An input assembly graph is reconfigured according to the homology of contigs to reference sequences; the assembly graph can be visualized to allow manual pruning and correction. |
| Recycler ( | Cycles in an assembly graph are identified and sequentially extracted from the graph, favoring cycles with minimal coverage variation across constituent contigs. Assuming different genetic units have distinct copy numbers, retrieved cycles should represent individual circular elements (plasmids, circular phages). Information from paired-end reads is used to exclude cycles that do not correspond to a single circular element, but arise from repeat elements shared across different molecules. | |
| plasmidSPAdes ( | Median coverage of longer contigs is calculated to estimate chromosomal coverage; this estimate is used as a basis for filtering putative chromosomal contigs from the assembly graph. Connected components within the filtered graph are reported as putative plasmids. This approach assumes that chromosomal contig coverage differs from plasmid contig coverage. |