Literature DB >> 27296980

Unbiased probabilistic taxonomic classification for DNA barcoding.

Panu Somervuo1, Sonja Koskela1, Juho Pennanen1, R Henrik Nilsson2, Otso Ovaskainen3.   

Abstract

MOTIVATION: When targeted to a barcoding region, high-throughput sequencing can be used to identify species or operational taxonomical units from environmental samples, and thus to study the diversity and structure of species communities. Although there are many methods which provide confidence scores for assigning taxonomic affiliations, it is not straightforward to translate these values to unbiased probabilities. We present a probabilistic method for taxonomical classification (PROTAX) of DNA sequences. Given a pre-defined taxonomical tree structure that is partially populated by reference sequences, PROTAX decomposes the probability of one to the set of all possible outcomes. PROTAX accounts for species that are present in the taxonomy but that do not have reference sequences, the possibility of unknown taxonomical units, as well as mislabeled reference sequences. PROTAX is based on a statistical multinomial regression model, and it can utilize any kind of sequence similarity measures or the outputs of other classifiers as predictors.
RESULTS: We demonstrate the performance of PROTAX by using as predictors the output from BLAST, the phylogenetic classification software TIPP, and the RDP classifier. We show that PROTAX improves the predictions of the baseline implementations of TIPP and RDP classifiers, and that it is able to combine complementary information provided by BLAST and TIPP, resulting in accurate and unbiased classifications even with very challenging cases such as 50% mislabeling of reference sequences.
AVAILABILITY AND IMPLEMENTATION: Perl/R implementation of PROTAX is available at http://www.helsinki.fi/science/metapop/Software.htm CONTACT: panu.somervuo@helsinki.fi SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2016        PMID: 27296980     DOI: 10.1093/bioinformatics/btw346

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  18 in total

1.  A comparison of eDNA to camera trapping for assessment of terrestrial mammal diversity.

Authors:  Kevin Leempoel; Trevor Hebert; Elizabeth A Hadly
Journal:  Proc Biol Sci       Date:  2020-01-15       Impact factor: 5.349

Review 2.  Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.

Authors:  Bilal Wajid; Faria Anwar; Imran Wajid; Haseeb Nisar; Sharoze Meraj; Ali Zafar; Mustafa Kamal Al-Shawaqfeh; Ali Riza Ekti; Asia Khatoon; Jan S Suchodolski
Journal:  Funct Integr Genomics       Date:  2021-10-18       Impact factor: 3.410

3.  Performance of DNA metabarcoding, standard barcoding, and morphological approach in the identification of host-parasitoid interactions.

Authors:  Martin Šigut; Martin Kostovčík; Hana Šigutová; Jiří Hulcr; Pavel Drozd; Jan Hrček
Journal:  PLoS One       Date:  2017-12-13       Impact factor: 3.240

4.  Accuracy of taxonomy prediction for 16S rRNA and fungal ITS sequences.

Authors:  Robert C Edgar
Journal:  PeerJ       Date:  2018-04-18       Impact factor: 2.984

5.  Expanding and testing fluorescent amplified fragment length polymorphisms for identifying roots of boreal forest plant species.

Authors:  Paul Metzler; Marc La Flèche; Justine Karst
Journal:  Appl Plant Sci       Date:  2019-04-08       Impact factor: 1.936

6.  An efficient and robust laboratory workflow and tetrapod database for larger scale environmental DNA studies.

Authors:  Jan Axtner; Alex Crampton-Platt; Lisa A Hörig; Azlan Mohamed; Charles C Y Xu; Douglas W Yu; Andreas Wilting
Journal:  Gigascience       Date:  2019-04-01       Impact factor: 6.524

7.  funbarRF: DNA barcode-based fungal species prediction using multiclass Random Forest supervised learning model.

Authors:  Prabina Kumar Meher; Tanmaya Kumar Sahu; Shachi Gahoi; Ruchi Tomar; Atmakuri Ramakrishna Rao
Journal:  BMC Genet       Date:  2019-01-07       Impact factor: 2.797

Review 8.  Current Knowledge and Computational Techniques for Grapevine Meta-Omics Analysis.

Authors:  Salvatore Alaimo; Gioacchino P Marceca; Rosalba Giugno; Alfredo Ferro; Alfredo Pulvirenti
Journal:  Front Plant Sci       Date:  2018-01-09       Impact factor: 5.753

9.  PROTAX-Sound: A probabilistic framework for automated animal sound identification.

Authors:  Ulisses Moliterno de Camargo; Panu Somervuo; Otso Ovaskainen
Journal:  PLoS One       Date:  2017-09-01       Impact factor: 3.240

10.  A reference cytochrome c oxidase subunit I database curated for hierarchical classification of arthropod metabarcoding data.

Authors:  Rodney T Richardson; Johan Bengtsson-Palme; Mary M Gardiner; Reed M Johnson
Journal:  PeerJ       Date:  2018-06-26       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.