Literature DB >> 21471017

Comprehensive and relaxed search for oligonucleotide signatures in hierarchically clustered sequence datasets.

Kai Christian Bader1, Christian Grothoff, Harald Meier.   

Abstract

MOTIVATION: PCR, hybridization, DNA sequencing and other important methods in molecular diagnostics rely on both sequence-specific and sequence group-specific oligonucleotide primers and probes. Their design depends on the identification of oligonucleotide signatures in whole genome or marker gene sequences. Although genome and gene databases are generally available and regularly updated, collections of valuable signatures are rare. Even for single requests, the search for signatures becomes computationally expensive when working with large collections of target (and non-target) sequences. Moreover, with growing dataset sizes, the chance of finding exact group-matching signatures decreases, necessitating the application of relaxed search methods. The resultant substantial increase in complexity is exacerbated by the dearth of algorithms able to solve these problems efficiently.
RESULTS: We have developed CaSSiS, a fast and scalable method for computing comprehensive collections of sequence- and sequence group-specific oligonucleotide signatures from large sets of hierarchically clustered nucleic acid sequence data. Based on the ARB Positional Tree (PT-)Server and a newly developed BGRT data structure, CaSSiS not only determines sequence-specific signatures and perfect group-covering signatures for every node within the cluster (i.e. target groups), but also signatures with maximal group coverage (sensitivity) within a user-defined range of non-target hits (specificity) for groups lacking a perfect common signature. An upper limit of tolerated mismatches within the target group, as well as the minimum number of mismatches with non-target sequences, can be predefined. Test runs with one of the largest phylogenetic gene sequence datasets available indicate good runtime and memory performance, and in silico spot tests have shown the usefulness of the resulting signature sequences as blueprints for group-specific oligonucleotide probes. AVAILABILITY: Software and Supplementary Material are available at http://cassis.in.tum.de/.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21471017     DOI: 10.1093/bioinformatics/btr161

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Neptune: a bioinformatics tool for rapid discovery of genomic variation in bacterial populations.

Authors:  Eric Marinier; Rahat Zaheer; Chrystal Berry; Kelly A Weedmark; Michael Domaratzki; Philip Mabon; Natalie C Knox; Aleisha R Reimer; Morag R Graham; Linda Chui; Laura Patterson-Fortin; Jian Zhang; Franco Pagotto; Jeff Farber; Jim Mahony; Karine Seyer; Sadjia Bekal; Cécile Tremblay; Judy Isaac-Renton; Natalie Prystajecky; Jessica Chen; Peter Slade; Gary Van Domselaar
Journal:  Nucleic Acids Res       Date:  2017-10-13       Impact factor: 16.971

2.  Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences.

Authors:  Paul M Ruegger; Gianluca Della Vedova; Tao Jiang; James Borneman
Journal:  BMC Bioinformatics       Date:  2011-10-10       Impact factor: 3.169

3.  A robust PCR primer design platform applied to the detection of Acidobacteria Group 1 in soil.

Authors:  Jason D Gans; John Dunbar; Stephanie A Eichorst; La Verne Gallegos-Graves; Murray Wolinsky; Cheryl R Kuske
Journal:  Nucleic Acids Res       Date:  2012-03-20       Impact factor: 16.971

4.  PRISE2: software for designing sequence-selective PCR primers and probes.

Authors:  Yu-Ting Huang; Jiue-in Yang; Marek Chrobak; James Borneman
Journal:  BMC Bioinformatics       Date:  2014-09-25       Impact factor: 3.169

5.  Cluster oligonucleotide signatures for rapid identification by sequencing.

Authors:  Manuel Zahariev; Wen Chen; Cobus M Visagie; C André Lévesque
Journal:  BMC Bioinformatics       Date:  2018-10-29       Impact factor: 3.169

6.  PhylOPDb: a 16S rRNA oligonucleotide probe database for prokaryotic identification.

Authors:  Faouzi Jaziri; Nicolas Parisot; Anis Abid; Jérémie Denonfoux; Céline Ribière; Cyrielle Gasc; Delphine Boucher; Jean-François Brugère; Antoine Mahul; David R C Hill; Eric Peyretaillade; Pierre Peyret
Journal:  Database (Oxford)       Date:  2014-04-26       Impact factor: 3.451

7.  An algorithm of discovering signatures from DNA databases on a computer cluster.

Authors:  Hsiao Ping Lee; Tzu-Fang Sheu
Journal:  BMC Bioinformatics       Date:  2014-10-05       Impact factor: 3.169

8.  HTSFinder: Powerful Pipeline of DNA Signature Discovery by Parallel and Distributed Computing.

Authors:  Ramin Karimi; Andras Hajdu
Journal:  Evol Bioinform Online       Date:  2016-02-10       Impact factor: 1.625

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.