| Literature DB >> 31865918 |
Chengyou Liu1, Leilei Zhou1, Yuhe Wang1, Shuchang Tian1, Junlin Zhu2, Hang Qin1, Yong Ding3,4, Hongbing Jiang5,6.
Abstract
Variations of gene expression levels play an important role in tumors. There are numerous methods to identify differentially expressed genes in high-throughput sequencing. Several algorithms endeavor to identify distinctive genetic patterns susceptable to particular diseases. Although these processes have been proved successful, the probability that the number of non-differentially expressed genes measured by false discovery rate (FDR) has a large standard deviation, and the misidentification rate (type I error) grows rapidly when the number of genes to be detected become larger. In this study we developed a new method, Unit Gamma Measurement (UGM), accounting for multiple hypotheses test statistics distribution, which could reduce the dependency problem. Simulated expression profile data and breast cancer RNA-Seq data were utilized to testify the accuracy of UGM. The results show that the number of non-differentially expressed genes identified by the UGM is very close to the real-evidence data, and the UGM also has a smaller standard error, range, quartile range and RMS error. In addition, the UGM can be used to screen many breast cancer-associated genes, such as BRCA1, BRCA2, PTEN, BRIP1, etc., provides better accuracy, robustness and efficiency, the method of identification differentially expressed genes in high-throughput sequencing.Entities:
Keywords: Cancer-associated genes; Differentially expressed genes; False discovery rate; RNA-Seq data; Root mean square error; Standard deviation
Mesh:
Year: 2019 PMID: 31865918 PMCID: PMC6927121 DOI: 10.1186/s12976-019-0117-1
Source DB: PubMed Journal: Theor Biol Med Model ISSN: 1742-4682 Impact factor: 2.432