| Literature DB >> 28295162 |
Chun Chen1, Huong Le1, Chetan T Goudar1.
Abstract
RNA-Seq is a powerful transcriptomics tool for mammalian cell culture process development. Successful RNA-Seq data analysis requires a high quality reference for read mapping and gene expression quantification. Currently, there are two public genome references for Chinese hamster ovary (CHO) cells, the predominant mammalian cell line in the biopharmaceutical industry. In this study, we compared these two references by analyzing 60 RNA-Seq samples from a variety of CHO cell culture conditions. Among the 20,891 common genes in both references, we observed that 31.5% have more than 7.1% quantification differences, implying gene definition differences in the two references. We propose a framework to quantify this difference using two metrics, Consistency and Stringency, which account for the average quantification difference between the two references over all samples, and the sample-specific effect on the quantification result, respectively. These two metrics can be used to identify potential genes for future gene model improvement and to understand the reliability of differentially expressed genes identified by RNA-Seq data analysis. Before a more comprehensive genome reference for CHO cells emerges, the strategy proposed in this study can enable more robust transcriptome analysis from CHO cell RNA-Seq data. Biotechnol. Bioeng. 2017;114: 1603-1613.Entities:
Keywords: CHO cells; RNA-Seq; gene expression analysis; gene model; genome reference
Mesh:
Substances:
Year: 2017 PMID: 28295162 DOI: 10.1002/bit.26290
Source DB: PubMed Journal: Biotechnol Bioeng ISSN: 0006-3592 Impact factor: 4.530