Literature DB >> 26586809

probeBase--an online resource for rRNA-targeted oligonucleotide probes and primers: new features 2016.

Daniel Greuter1, Alexander Loy2, Matthias Horn3, Thomas Rattei1.   

Abstract

probeBase http://www.probebase.net is a manually maintained and curated database of rRNA-targeted oligonucleotide probes and primers. Contextual information and multiple options for evaluating in silico hybridization performance against the most recent rRNA sequence databases are provided for each oligonucleotide entry, which makes probeBase an important and frequently used resource for microbiology research and diagnostics. Here we present a major update of probeBase, which was last featured in the NAR Database Issue 2007. This update describes a complete remodeling of the database architecture and environment to accommodate computationally efficient access. Improved search functions, sequence match tools and data output now extend the opportunities for finding suitable hierarchical probe sets that target an organism or taxon at different taxonomic levels. To facilitate the identification of complementary probe sets for organisms represented by short rRNA sequence reads generated by amplicon sequencing or metagenomic analysis with next generation sequencing technologies such as Illumina and IonTorrent, we introduce a novel tool that recovers surrogate near full-length rRNA sequences for short query sequences and finds matching oligonucleotides in probeBase.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26586809      PMCID: PMC4702872          DOI: 10.1093/nar/gkv1232

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Our understanding of the diversity and role of microorganisms on our planet is to a great extent based on exploiting the ribosomal RNA as phylogenetic marker molecule in diagnostic molecular biology and microscopy assays. While rRNA-targeted oligonucleotides have been applied in different sorts of diagnostic formats such as DNA microarrays (PhyloChips) (1,2) and denaturing gradient gel electrophoresis (3), they are now most widely used for amplicon sequencing and fluorescence in situ hybridization (FISH) (4). Today, highly multiplexed amplicon sequencing with rRNA-targeted primers enables surveying microbial diversity across numerous samples (5,6), which provides unprecedented insights into the spatial distribution and temporal dynamics of the diverse microbial communities that thrive in the environment (7) or are associated with eukaryotic hosts (8,9). Furthermore, FISH with rRNA-targeted probes and quantitative microscopy is a standard tool for revealing the identity, abundance and spatial localization of microbial cells in complex samples. More than two decades of development of FISH probes and techniques for microbial diagnostics have established a variety of methods, such as DOPE-FISH (10), CARD-FISH (11), CLASI-FISH (12) and HCR-FISH (13), and a wealth of tested probes that target diverse phylogenetic and/or taxonomic groups of microorganisms. probeBase was originally established in 2002 (14) to provide a common, freely accessible repository for rRNA-targeted oligonucleotide sequences, including contextual information and multiple options for testing in silico specificity and coverage (15) against up-to-date rRNA sequence databases such as RDP-II (16) and SILVA (17). To date (September 2015), probeBase contains 2788 probes, 175 domain-specific PCR primers (18) and 16 microarrays from 499 publications and is an online resource that is frequently used by the scientific community (180 000 average page views per year). Finding appropriate oligonucleotides with a suite of ‘Search’ and sequence ‘Match’ tools provides convenient access to the information in the database. Probes, primers, microarray layouts or references can further be retrieved through the ‘Lists’ service, including dynamic lists of all probes, all primers, all references or oligonucleotides that target microorganisms from specific environments (e.g. intestinal microbiota) or with specific functions (e.g. sulfate-reducing microorganisms). This update describes recent improvements and new features added since the last update in 2007 (19), including (i) extended ‘Search’ and ‘Match’ options, (ii) a new ‘Proxy’ tool that finds probe sets for short query sequences based on corresponding near full-length rRNA sequences and (iii) suggestion of taxonomically informed hierarchical probe sets for applications using multiple probes such as multi-color FISH and DNA microarrays.

NEW DATABASE BACKEND, SEARCH ENVIRONMENT AND WEBSITE FOR probeBase

The probeBase database has been moved to a new, more scalable database backend. This dramatically reduces the retrieval times when the database is queried and it can also handle a much larger number of requests simultaneously. A new database scheme was developed, which links probeBase entries with the NCBI taxonomy database (20). Database procedures were implemented to retrieve the taxonomic lineages (up and down) of each specificity term instantly from the NCBI taxonomy database. Thereby all changes in the reference taxonomy will be automatically adopted by probeBase. The continuously growing number of sequences in probeBase and in rRNA sequence databases (16,17) made it necessary to refine the search environment of probeBase. The core of the new search and match tools are sequence indexes based on enhanced suffix array data structures. These suffix arrays allow very short retrieval times by rapid exact string matching. They are used by VMATCH (21) for probe/primer search (‘Search’) and sequence match (‘Match’), and by LAST (22) for the proxy sequence match (‘Proxy’) functionality (see below for a description of the ‘Search’, ‘Match’ and ‘Proxy’ tools). The exact string matching in VMATCH is not aware of DNA ambiguity characters, such as R, Y, W, etc. probeBase therefore refines the alignments calculated by VMATCH for the sequence match (‘Match’) function and considers such ambiguity positions to determine the correct number of mismatches also for these positions. The probeBase web page has been moved to a new content management system to facilitate maintenance and more rapid adaptations of the web page. Another advantage of the content management system is the responsible layout, which considers the size of the browsing device. Hence, the page will be optimized for smaller displays if users access probeBase via their mobile phone or tablet computer. Result lists are now fully sortable by just clicking on the header of the respective column. In addition, longer tables are being split into multi-page tables to give the user a convenient overview even if a certain database query returns many results. This feature is particularly important due to the increased number of supported sequences per user query. The multi-page table views are accompanied by an export function. Users are able to export results in .xls and .tsv format, which allows performing further analysis in any other suitable software, such as Excel or OpenOffice.

A TARGET TAXONOMY FOR EACH OLIGONUCLEOTIDE

Detailed information is provided for each oligonucleotide (14,19), including its specificity, which indicates the intended target organism(s) of the respective probe/primer as described in the original publication or during user submissions to probeBase. Based on the information in the specificity field, we have automatically mapped each oligonucleotide to the NCBI taxonomy (20). Where necessary, assigned taxonomic names (i.e. NCBI taxonomy IDs) were manually corrected and curated to contain one taxonomic assignment per oligonucleotide. For probes targeting multiple taxa (e.g. two different species), we chose either the taxon that is predominantly covered by the probe or the next higher taxonomic rank that included all taxa (e.g. the genus, family, order, phylum or domain). Probe entries for which the specificity field did not contain any (e.g. ‘clone XYZ’) or only limited taxonomic information (e.g. ‘deltaproteobacterial symbiont of…’) were assigned to the root in the NCBI taxonomy or to the lowest meaningful taxonomic rank. The new taxonomy field in the probe details view shows the entire taxonomic hierarchy—from the assigned taxon to its highest taxonomic rank. It is noteworthy that this taxonomic assignment does not necessarily mean that an oligonucleotide is highly specific for a given taxon. Instead, it represents a systematic classification for all oligonucleotides in probeBase and allows for more advanced searches.

SEARCH probeBase

The ‘Search’ tool comprises multiple options for finding appropriate oligonucleotides and further information (Figure 1). Oligonucleotides can be recovered by the name of the target organism or taxon, by their specific target sites on the rRNA molecule, by the reference that originally described the oligonucleotide or simply by the name or sequence of the oligonucleotide itself. The output list of oligonucleotides that matched the search criteria can be restricted to primers, probes used successfully for FISH, and probes used on microarrays.
Figure 1.

Structure and main tools of the probeBase database.

Structure and main tools of the probeBase database. Because each probe is now assigned to a taxon in the NCBI taxonomy (20), a search for a specific target organism or taxon systematically returns all available oligonucleotides. The search target organism option is facilitated by an auto-complete function for taxonomic names to minimize the probability of typing errors. The search target organism option can be further adjusted by including oligonucleotides that target higher and/or lower taxonomic ranks than the query taxon. The corresponding output list of probes that target the query organism at different taxonomic levels helps researchers in identifying sets of hierarchically nested probes for application in multiple-probe hybridization formats such as multi-color FISH (10,12) and DNA microarrays (23). The search for a probe/primer sequence not only yields perfectly complementary hits, but also oligonucleotides in probeBase with up to two mismatches to the query oligonucleotide. This allows, for example, to identify if a newly designed rRNA-targeted oligonucleotide and/or closely related variants of it have already been developed, tested for FISH and published before.

MATCH rRNA QUERY SEQUENCES AGAINST OLIGONUCLEOTIDES IN probeBase

The original sequence match tool was developed to find all oligonucleotides in probeBase that perfectly match to up to 150 query rRNA sequences (14). However, due to the increased database size this tool was not usable and thus disabled for the past years. We have reimplemented and redesigned the sequence match tool (Figure 1), which is now based on a similarity search and able to process up to 1000 query sequences. Users can paste a (multi) fasta file into the text field, upload a (multi) fasta file or combine both options to query probeBase for complementary probes or primers with up to two mismatches. Results are either grouped by oligonucleotide, showing all matching sequences from the query per oligonucleotide, or by query sequence, showing all matching oligonucleotides per query sequence. A typical application of this tool is to quickly retrieve a set of available FISH probes that target rRNA sequences determined in an environmental microbial diversity survey without the need for extensive comparative sequence analysis. This probe set can then be readily applied for FISH to determine the abundance and spatial organization of the target organism in the sample (24).

FIND FULL-LENGTH PROXY SEQUENCES FOR SHORT rRNA READS AND MATCH AGAINST OLIGONUCLEOTIDES IN probeBase

Traditional surveys of microbial diversity by PCR amplification of near full-length rRNA gene sequences from environmental DNA, cloning and Sanger sequencing of clone inserts have been almost completely replaced by highly parallel next generation sequencing of rRNA gene amplicons because of higher sample throughput and sequencing depth (i.e. number of sequences per sample). However, common technologies for multiplexed amplicon sequencing, such as Illumina MiSeq, produce only short (paired-end) reads that are typically less than 500 nucleotides in length. The short rRNA reads limit the selection of complementary oligonucleotides (e.g. by the ‘Match’ tool) for follow-up hybridization applications (25). Here, we provide a new ‘Proxy’ tool that finds corresponding near full-length rRNA sequences for short query sequences (Figure 1). These long proxy sequences are retrieved from the small or large subunit rRNA reference database of SILVA (17) and matched against the oligonucleotides in probeBase analogous to the ‘Match’ tool. The output includes the identified proxy sequences for each short query sequence and shows the probes or primers that have up to two mismatches to the proxy sequences.

SUBMISSION OF MISSING OR NEWLY DEVELOPED OLIGONUCLEOTIDES

New or missing oligonucleotides can be submitted using an online form. The reference details (e.g. journal, authors, title, abstract, year) will now be automatically filled in by entering the PubMed-ID (PMID) (26) of a publication that contains new or missing probes/primers.

AVAILABILITY

probeBase is maintained by the Department of Microbiology and Ecosystem Science, University of Vienna, Wien, Austria and available at http://www.probebase.net. We welcome comments concerning probeBase and highly appreciate reports of bugs, errors or missing probes. You may contact us by email to probebase@microbial-ecology.net.
  24 in total

1.  probeBase: an online resource for rRNA-targeted oligonucleotide probes.

Authors:  Alexander Loy; Matthias Horn; Michael Wagner
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

2.  Incomplete recovery and individualized responses of the human distal gut microbiota to repeated antibiotic perturbation.

Authors:  Les Dethlefsen; David A Relman
Journal:  Proc Natl Acad Sci U S A       Date:  2010-09-16       Impact factor: 11.205

3.  16S rRNA gene-based phylogenetic microarray for simultaneous identification of members of the genus Burkholderia.

Authors:  Susan Schönmann; Alexander Loy; Céline Wimmersberger; Jens Sobek; Catharine Aquino; Peter Vandamme; Beat Frey; Hubert Rehrauer; Leo Eberl
Journal:  Environ Microbiol       Date:  2009-04       Impact factor: 5.491

4.  Global patterns in the biogeography of bacterial taxa.

Authors:  Diana R Nemergut; Elizabeth K Costello; Micah Hamady; Catherine Lozupone; Lin Jiang; Steven K Schmidt; Noah Fierer; Alan R Townsend; Cory C Cleveland; Lee Stanish; Rob Knight
Journal:  Environ Microbiol       Date:  2011-01       Impact factor: 5.491

5.  Phylotype-level 16S rRNA analysis reveals new bacterial indicators of health state in acute murine colitis.

Authors:  David Berry; Clarissa Schwab; Gabriel Milinovich; Jochen Reichert; Karim Ben Mahfoudh; Thomas Decker; Marion Engel; Brigitte Hai; Eva Hainzl; Susanne Heider; Lukas Kenner; Mathias Müller; Isabella Rauch; Birgit Strobl; Michael Wagner; Christa Schleper; Tim Urich; Alexander Loy
Journal:  ISME J       Date:  2012-05-10       Impact factor: 10.302

6.  Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies.

Authors:  Anna Klindworth; Elmar Pruesse; Timmy Schweer; Jörg Peplies; Christian Quast; Matthias Horn; Frank Oliver Glöckner
Journal:  Nucleic Acids Res       Date:  2012-08-28       Impact factor: 16.971

7.  Type material in the NCBI Taxonomy Database.

Authors:  Scott Federhen
Journal:  Nucleic Acids Res       Date:  2014-11-14       Impact factor: 19.160

8.  probeBase--an online resource for rRNA-targeted oligonucleotide probes: new features 2007.

Authors:  Alexander Loy; Frank Maixner; Michael Wagner; Matthias Horn
Journal:  Nucleic Acids Res       Date:  2006-11-11       Impact factor: 16.971

9.  The SILVA ribosomal RNA gene database project: improved data processing and web-based tools.

Authors:  Christian Quast; Elmar Pruesse; Pelin Yilmaz; Jan Gerken; Timmy Schweer; Pablo Yarza; Jörg Peplies; Frank Oliver Glöckner
Journal:  Nucleic Acids Res       Date:  2012-11-28       Impact factor: 16.971

10.  A flexible and economical barcoding approach for highly multiplexed amplicon sequencing of diverse target genes.

Authors:  Craig W Herbold; Claus Pelikan; Orest Kuzyk; Bela Hausmann; Roey Angel; David Berry; Alexander Loy
Journal:  Front Microbiol       Date:  2015-07-16       Impact factor: 5.640

View more
  36 in total

1.  Phylogenetic and Structural Identification of a Novel Magnetotactic Deltaproteobacteria Strain, WYHR-1, from a Freshwater Lake.

Authors:  Jinhua Li; Heng Zhang; Peiyu Liu; Nicolas Menguy; Andrew P Roberts; Haitao Chen; Yinzhao Wang; Yongxin Pan
Journal:  Appl Environ Microbiol       Date:  2019-07-01       Impact factor: 4.792

2.  Visualization of Microbiota in Tick Guts by Whole-mount In Situ Hybridization.

Authors:  Caitlin E Moss; Andrew Robson; Erol Fikrig; Sukanya Narasimhan
Journal:  J Vis Exp       Date:  2018-06-01       Impact factor: 1.355

3.  Bioremoval of priority polycyclic aromatic hydrocarbons by a microbial community with high sorption ability.

Authors:  Sandra Sanches; Mónica Martins; Ana F Silva; Claudia F Galinha; Maria A Santos; Inês A C Pereira; Maria Teresa Barreto Crespo
Journal:  Environ Sci Pollut Res Int       Date:  2016-11-23       Impact factor: 4.223

4.  Quality Control in Diagnostic Fluorescence In Situ Hybridization (FISH) in Microbiology.

Authors:  Judith Kikhney; Annette Moter
Journal:  Methods Mol Biol       Date:  2021

5.  A collection of bacterial isolates from the pig intestine reveals functional and taxonomic diversity.

Authors:  David Wylensek; Thomas C A Hitch; Thomas Riedel; Afrizal Afrizal; Neeraj Kumar; Esther Wortmann; Tianzhe Liu; Saravanan Devendran; Till R Lesker; Sara B Hernández; Viktoria Heine; Eva M Buhl; Paul M D'Agostino; Fabio Cumbo; Thomas Fischöder; Marzena Wyschkon; Torey Looft; Valeria R Parreira; Birte Abt; Heidi L Doden; Lindsey Ly; João M P Alves; Markus Reichlin; Krzysztof Flisikowski; Laura Navarro Suarez; Anthony P Neumann; Garret Suen; Tomas de Wouters; Sascha Rohn; Ilias Lagkouvardos; Emma Allen-Vercoe; Cathrin Spröer; Boyke Bunk; Anja J Taverne-Thiele; Marcel Giesbers; Jerry M Wells; Klaus Neuhaus; Angelika Schnieke; Felipe Cava; Nicola Segata; Lothar Elling; Till Strowig; Jason M Ridlon; Tobias A M Gulder; Jörg Overmann; Thomas Clavel
Journal:  Nat Commun       Date:  2020-12-15       Impact factor: 14.919

6.  Rapid antibiotic susceptibility testing and species identification for mixed samples.

Authors:  Vinodh Kandavalli; Praneeth Karempudi; Jimmy Larsson; Johan Elf
Journal:  Nat Commun       Date:  2022-10-20       Impact factor: 17.694

7.  "Candidatus Gortzia shahrazadis", a Novel Endosymbiont of Paramecium multimicronucleatum and a Revision of the Biogeographical Distribution of Holospora-Like Bacteria.

Authors:  Valentina Serra; Sergei I Fokin; Michele Castelli; Charan K Basuri; Venkatamahesh Nitla; Franco Verni; Bhagavatula V Sandeep; Chaganti Kalavati; Giulio Petroni
Journal:  Front Microbiol       Date:  2016-11-04       Impact factor: 5.640

8.  Rare Freshwater Ciliate Paramecium chlorelligerum Kahl, 1935 and Its Macronuclear Symbiotic Bacterium "Candidatus Holospora parva".

Authors:  Olivia Lanzoni; Sergei I Fokin; Natalia Lebedeva; Alexandra Migunova; Giulio Petroni; Alexey Potekhin
Journal:  PLoS One       Date:  2016-12-16       Impact factor: 3.240

9.  Characterization of microbial associations with methanotrophic archaea and sulfate-reducing bacteria through statistical comparison of nested Magneto-FISH enrichments.

Authors:  Elizabeth Trembath-Reichert; David H Case; Victoria J Orphan
Journal:  PeerJ       Date:  2016-04-18       Impact factor: 2.984

10.  The 2016 database issue of Nucleic Acids Research and an updated molecular biology database collection.

Authors:  Daniel J Rigden; Xosé M Fernández-Suárez; Michael Y Galperin
Journal:  Nucleic Acids Res       Date:  2016-01-04       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.