Literature DB >> 28011783

A new method for decontamination of de novo transcriptomes using a hierarchical clustering algorithm.

Joël Lafond-Lapalme1,2, Marc-Olivier Duceppe1, Shengrui Wang3, Peter Moffett2, Benjamin Mimee1.   

Abstract

Motivation: The identification of contaminating sequences in a de novo assembly is challenging because of the absence of information on the target species. For sample types where the target organism is impossible to isolate from its matrix, such as endoparasites, endosymbionts and soil-harvested samples, contamination is unavoidable. A few post-assembly decontamination methods are currently available but are based only on alignments to databases, which can lead to poor decontamination.
Results: We present a new decontamination method based on a hierarchical clustering algorithm called MCSC. This method uses frequent patterns found in sequences to create clusters. These clusters are then linked to the target species or tagged as contaminants using classic alignment tools. The main advantage of this decontamination method is that it allows sequences to be tagged correctly even if they are unknown or misaligned to a database. Availability and Implementation: Scripts and documentation about the MCSC decontamination method are available at https://github.com/Lafond-LapalmeJ/MCSC_Decontamination . Contact: : benjamin.mimee@agr.gc.ca. Supplementary information: Supplementary data are available at Bioinformatics online. © Crown copyright 2016.

Mesh:

Year:  2017        PMID: 28011783     DOI: 10.1093/bioinformatics/btw793

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Tracing foreign sequences in plant transcriptomes and genomes using OCT4, a POU domain protein.

Authors:  Adeleh Saffar; Maryam M Matin
Journal:  Mol Genet Genomics       Date:  2021-03-18       Impact factor: 3.291

2.  A software tool 'CroCo' detects pervasive cross-species contamination in next generation sequencing data.

Authors:  Paul Simion; Khalid Belkhir; Clémentine François; Julien Veyssier; Jochen C Rink; Michaël Manuel; Hervé Philippe; Maximilian J Telford
Journal:  BMC Biol       Date:  2018-03-05       Impact factor: 7.431

3.  The myxozoan minicollagen gene repertoire was not simplified by the parasitic lifestyle: computational identification of a novel myxozoan minicollagen gene.

Authors:  Jiří Kyslík; Anush Kosakyan; Serafim Nenarokov; Astrid S Holzer; Ivan Fiala
Journal:  BMC Genomics       Date:  2021-03-20       Impact factor: 3.969

4.  The time course of molecular acclimation to seawater in a euryhaline fish.

Authors:  Lucrezia C Bonzi; Alison A Monroe; Robert Lehmann; Michael L Berumen; Timothy Ravasi; Celia Schunter
Journal:  Sci Rep       Date:  2021-09-13       Impact factor: 4.379

5.  Inflammation and convergent placenta gene co-option contributed to a novel reproductive tissue.

Authors:  Leon Hilgers; Olivia Roth; Arne W Nolte; Alina Schüller; Tobias Spanke; Jana M Flury; Ilham V Utama; Janine Altmüller; Daisy Wowor; Bernhard Misof; Fabian Herder; Astrid Böhne; Julia Schwarzer
Journal:  Curr Biol       Date:  2021-12-20       Impact factor: 10.834

6.  A resource for sustainable management: De novo assembly and annotation of the liver transcriptome of the Atlantic chub mackerel, Scomber colias.

Authors:  André M Machado; Mónica Felício; Elza Fonseca; Rute R da Fonseca; L Filipe C Castro
Journal:  Data Brief       Date:  2018-03-13

7.  "Out of the Can": A Draft Genome Assembly, Liver Transcriptome, and Nutrigenomics of the European Sardine, Sardina pilchardus.

Authors:  André M Machado; Ole K Tørresen; Naoki Kabeya; Alvarina Couto; Bent Petersen; Mónica Felício; Paula F Campos; Elza Fonseca; Narcisa Bandarra; Mónica Lopes-Marques; Renato Ferraz; Raquel Ruivo; Miguel M Fonseca; Sissel Jentoft; Óscar Monroig; Rute R da Fonseca; L Filipe C Castro
Journal:  Genes (Basel)       Date:  2018-10-09       Impact factor: 4.096

8.  Shifting evolutionary sands: transcriptome characterization of the Aptostichus atomarius species complex.

Authors:  Nicole L Garrison; Michael S Brewer; Jason E Bond
Journal:  BMC Evol Biol       Date:  2020-06-15       Impact factor: 3.260

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.