Literature DB >> 32180366

Error, noise and bias in de novo transcriptome assemblies.

Adam H Freedman1, Michele Clamp1, Timothy B Sackton1.   

Abstract

De novo transcriptome assembly is a powerful tool, and has been widely used over the last decade for making evolutionary inferences. However, it relies on two implicit assumptions: that the assembled transcriptome is an unbiased representation of the underlying expressed transcriptome, and that expression estimates from the assembly are good, if noisy approximations of the relative abundance of expressed transcripts. Using publicly available data for model organisms, we demonstrate that, across assembly algorithms and data sets, these assumptions are consistently violated. Bias exists at the nucleotide level, with genotyping error rates ranging from 30% to 83%. As a result, diversity is underestimated in transcriptome assemblies, with consistent underestimation of heterozygosity in all but the most inbred samples. Even at the gene level, expression estimates show wide deviations from map-to-reference estimates, and positive bias at lower expression levels. Standard filtering of transcriptome assemblies improves the robustness of gene expression estimates but leads to the loss of a meaningful number of protein-coding genes, including many that are highly expressed. We demonstrate a computational method, length-rescaled CPM, to partly alleviate noise and bias in expression estimates. Researchers should consider ways to minimize the impact of bias in transcriptome assemblies.
© 2020 John Wiley & Sons Ltd.

Entities:  

Keywords:  adaptation; bioinformatics/phyloinfomatics; genomics/proteomics; population genetics; transcriptomics

Year:  2020        PMID: 32180366     DOI: 10.1111/1755-0998.13156

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  13 in total

Review 1.  A simple guide to de novo transcriptome assembly and annotation.

Authors:  Venket Raghavan; Louis Kraft; Fantin Mesny; Linda Rigerte
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

Review 2.  Proteotranscriptomics - A facilitator in omics research.

Authors:  Michal Levin; Falk Butter
Journal:  Comput Struct Biotechnol J       Date:  2022-07-09       Impact factor: 6.155

3.  Improving the Annotation of the Venom Gland Transcriptome of Pamphobeteus verdolaga, Prospecting Novel Bioactive Peptides.

Authors:  Cristian Salinas-Restrepo; Elizabeth Misas; Sebastian Estrada-Gómez; Juan Carlos Quintana-Castillo; Fanny Guzman; Juan C Calderón; Marco A Giraldo; Cesar Segura
Journal:  Toxins (Basel)       Date:  2022-06-15       Impact factor: 5.075

4.  Species and population specific gene expression in blood transcriptomes of marine turtles.

Authors:  Shreya M Banerjee; Jamie Adkins Stoll; Camryn D Allen; Jennifer M Lynch; Heather S Harris; Lauren Kenyon; Richard E Connon; Eleanor J Sterling; Eugenia Naro-Maciel; Kathryn McFadden; Margaret M Lamont; James Benge; Nadia B Fernandez; Jeffrey A Seminoff; Scott R Benson; Rebecca L Lewison; Tomoharu Eguchi; Tammy M Summers; Jessy R Hapdei; Marc R Rice; Summer Martin; T Todd Jones; Peter H Dutton; George H Balazs; Lisa M Komoroske
Journal:  BMC Genomics       Date:  2021-05-13       Impact factor: 3.969

5.  Prospects for sociogenomics in avian cooperative breeding and parental care.

Authors:  Flavia Termignoni-Garcia; Matthew I M Louder; Christopher N Balakrishnan; Lauren O'Connell; Scott V Edwards
Journal:  Curr Zool       Date:  2019-12-04       Impact factor: 2.624

6.  The Developmental Transcriptome of Bagworm, Metisa plana (Lepidoptera: Psychidae) and Insights into Chitin Biosynthesis Genes.

Authors:  Nur Lina Rahmat; Anis Nadyra Zifruddin; Cik Mohd Rizuan Zainal Abidin; Nor-Azlan Nor Muhammad; Maizom Hassan
Journal:  Genes (Basel)       Date:  2020-12-23       Impact factor: 4.096

7.  RNAflow: An Effective and Simple RNA-Seq Differential Gene Expression Pipeline Using Nextflow.

Authors:  Marie Lataretu; Martin Hölzer
Journal:  Genes (Basel)       Date:  2020-12-10       Impact factor: 4.096

8.  Proteomic Identification and Meta-Analysis in Salvia hispanica RNA-Seq de novo Assemblies.

Authors:  Ashwil Klein; Lizex H H Husselmann; Achmat Williams; Liam Bell; Bret Cooper; Brent Ragar; David L Tabb
Journal:  Plants (Basel)       Date:  2021-04-14

9.  Tripartite parasitic and symbiotic interactions as a possible mechanism of horizontal gene transfer.

Authors:  Chaoyang Zhao; Shaoming Miao; Yanfang Yin; Yanjuan Zhu; Paul Nabity; Raman Bansal; Chenxi Liu
Journal:  Ecol Evol       Date:  2021-04-06       Impact factor: 2.912

10.  Annotated genome sequences of the carnivorous plant Roridula gorgonias and a non-carnivorous relative, Clethra arborea.

Authors:  Stefanie Hartmann; Michaela Preick; Silke Abelt; André Scheffel; Michael Hofreiter
Journal:  BMC Res Notes       Date:  2020-09-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.