Literature DB >> 26331235

Genomic limitations to RNA sequencing expression profiling.

Cory D Hirsch1, Nathan M Springer1, Candice N Hirsch2.   

Abstract

The field of genomics has grown rapidly with the advent of massively parallel sequencing technologies, allowing for novel biological insights with regards to genomic, transcriptomic, and epigenomic variation. One widely utilized application of high-throughput sequencing is transcriptional profiling using RNA sequencing (RNAseq). Understanding the limitations of a technology is critical for accurate biological interpretations, and clear interpretation of RNAseq data can be difficult in species with complex genomes. To understand the limitations of accurate profiling of expression levels we simulated RNAseq reads from annotated gene models in several plant species including Arabidopsis, brachypodium, maize, potato, rice, soybean, and tomato. The simulated reads were aligned using various parameters such as unique versus multiple read alignments. This allowed the identification of genes recalcitrant to RNAseq analyses by having over- and/or under-estimated expression levels. In maize, over 25% of genes deviated by more than 20% from the expected count values, suggesting the need for cautious interpretation of RNAseq data for certain genes. The reasons identified for deviation from expected expression varied between species due to differences in genome structure including, but not limited to, genes encoding short transcripts, overlapping gene models, and gene family size. Utilizing existing empirical datasets we demonstrate the potential for biological misinterpretation resulting from inclusion of 'flagged genes' in analyses. While RNAseq is a powerful tool for understanding biology, there are limitations to this technology that need to be understood in order to improve our biological interpretations.
© 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.

Entities:  

Keywords:  Arabidopsis; RNAseq; expression profile; maize; structural annotation

Mesh:

Year:  2015        PMID: 26331235     DOI: 10.1111/tpj.13014

Source DB:  PubMed          Journal:  Plant J        ISSN: 0960-7412            Impact factor:   6.417


  8 in total

1.  Draft Assembly of Elite Inbred Line PH207 Provides Insights into Genomic and Transcriptome Diversity in Maize.

Authors:  Candice N Hirsch; Cory D Hirsch; Alex B Brohammer; Megan J Bowman; Ilya Soifer; Omer Barad; Doron Shem-Tov; Kobi Baruch; Fei Lu; Alvaro G Hernandez; Christopher J Fields; Chris L Wright; Klaus Koehler; Nathan M Springer; Edward Buckler; C Robin Buell; Natalia de Leon; Shawn M Kaeppler; Kevin L Childs; Mark A Mikel
Journal:  Plant Cell       Date:  2016-11-01       Impact factor: 11.277

2.  Simulome: a genome sequence and variant simulator.

Authors:  Adam Price; Cynthia Gibas
Journal:  Bioinformatics       Date:  2017-02-10       Impact factor: 6.937

3.  Slow darkening of pinto bean seed coat is associated with significant metabolite and transcript differences related to proanthocyanidin biosynthesis.

Authors:  Kishor Duwadi; Ryan S Austin; Hemanta R Mainali; Kirstin Bett; Frédéric Marsolais; Sangeeta Dhaubhadel
Journal:  BMC Genomics       Date:  2018-04-16       Impact factor: 3.969

4.  RNA-seq assistant: machine learning based methods to identify more transcriptional regulated genes.

Authors:  Likai Wang; Yanpeng Xi; Sibum Sung; Hong Qiao
Journal:  BMC Genomics       Date:  2018-07-20       Impact factor: 3.969

Review 5.  The Role of Artificial Intelligence in the Diagnosis and Prognosis of Renal Cell Tumors.

Authors:  Matteo Giulietti; Monia Cecati; Berina Sabanovic; Andrea Scirè; Alessia Cimadamore; Matteo Santoni; Rodolfo Montironi; Francesco Piva
Journal:  Diagnostics (Basel)       Date:  2021-01-30

6.  Osteogenic transdifferentiation of primary human fibroblasts to osteoblast-like cells with human platelet lysate.

Authors:  Ferdy K Cayami; Lauria Claeys; Ruben de Ruiter; Bernard J Smilde; Lisanne Wisse; Natalija Bogunovic; Elise Riesebos; Lyra Eken; Irsan Kooi; Erik A Sistermans; Nathalie Bravenboer; Gerard Pals; Sultana M H Faradz; Daoud Sie; E Marelise W Eekhoff; Dimitra Micha
Journal:  Sci Rep       Date:  2022-08-29       Impact factor: 4.996

7.  SPLICE-q: a Python tool for genome-wide quantification of splicing efficiency.

Authors:  Verônica R de Melo Costa; Julianus Pfeuffer; Annita Louloupi; Ulf A V Ørom; Rosario M Piro
Journal:  BMC Bioinformatics       Date:  2021-07-15       Impact factor: 3.169

8.  WheatExp: an RNA-seq expression database for polyploid wheat.

Authors:  Stephen Pearce; Hans Vazquez-Gross; Sayer Y Herin; David Hane; Yi Wang; Yong Q Gu; Jorge Dubcovsky
Journal:  BMC Plant Biol       Date:  2015-12-24       Impact factor: 4.215

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.