| Literature DB >> 24039299 |
Amin Zollanvari1, Jianping Hua, Edward R Dougherty.
Abstract
This paper provides exact analytical expressions for the first and second moments of the true error for linear discriminant analysis (LDA) when the data are univariate and taken from two stochastic Gaussian processes. The key point is that we assume a general setting in which the sample data from each class do not need to be identically distributed or independent within or between classes. We compare the true errors of designed classifiers under the typical i.i.d. model and when the data are correlated, providing exact expressions and demonstrating that, depending on the covariance structure, correlated data can result in classifiers with either greater error or less error than when training with uncorrelated data. The general theory is applied to autoregressive and moving-average models of the first order, and it is demonstrated using real genomic data.Entities:
Keywords: Auto-regressive models; Correlated data; Expected error; Gaussian processes; Linear discriminant analysis; Moving-average models; Non-i.i.d data; Stochastic settings
Year: 2013 PMID: 24039299 PMCID: PMC3769149 DOI: 10.1016/j.patcog.2013.04.002
Source DB: PubMed Journal: Pattern Recognit ISSN: 0031-3203 Impact factor: 7.740