| Literature DB >> 28843784 |
Bi Zhao1, Aqeela Erwin1, Bin Xue2.
Abstract
Identifying differentially expressed genes is critical in microarray data analysis. Many methods have been developed by combining p-value, fold-change, and various statistical models to determine these genes. When using these methods, it is necessary to set up various pre-determined cutoff values. However, many of these cutoff values are somewhat arbitrary and may not have clear connections to biology. In this study, a genetic distance method based on gene expression level was developed to analyze eight sets of microarray data extracted from the GEO database. Since the genes used in distance calculation have been ranked by fold-change, the genetic distance becomes more stable when adding more genes in the calculation, indicating there is an optimal set of genes which are sufficient to characterize the stable difference between samples. This set of genes is differentially expressed genes representing both the genotypic and phenotypic differences between samples.Keywords: Differentially expressed genes; Genetic distance; Genotypically and phenotypically significant DEGs; Microarray data analysis; gpsDEGs
Mesh:
Year: 2017 PMID: 28843784 DOI: 10.1016/j.ygeno.2017.08.007
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736