Literature DB >> 25894321

Combining Models is More Likely to Give Better Predictions than Single Models.

Xiaoping Hu1, Laurence V Madden1, Simon Edwards1, Xiangming Xu1.   

Abstract

In agricultural research, it is often difficult to construct a single "best" predictive model based on data collected under field conditions. We studied the relative prediction performance of combining empirical linear models over the single best model in relation to number of models to be combined, number of variates in the models, magnitude of residual errors, and weighting schemes. Two scenarios were simulated: the modeler did or did not know the relative of performance of the models to be combined. For the former case, model averaging is achieved either through weights based on the Akaike Information Criterion (AIC) statistic or with arithmetic averaging; for the latter case, only the arithmetic averaging is possible (because the relative model predictive performance is not known for a common dataset). In addition to two experimental datasets on oat mycotoxins in relation to environmental variables, two datasets were generated assuming a consistent correlation structure among explanatory variates with two magnitudes of residual errors. For the majority of cases, model averaging resulted in improved prediction performance over the single-model predictions, especially when a modeler does not have the information of relative model performance. The fewer variates in the models to be combined, the greater is improvement of model averaging over the single-model predictions. Combining models led to very little improvement over individual models when there were many variates in individual models. Overall, simple arithmetic averaging resulted in slightly better performance than the AIC-based weighted averaging. The advantage in model averaging is also noticeable for larger residual errors. This study suggests that model averaging generally performs better than single-model predictions, especially when a modeler does not have information on the relative performance of the candidate models.

Mesh:

Year:  2015        PMID: 25894321     DOI: 10.1094/PHYTO-11-14-0315-R

Source DB:  PubMed          Journal:  Phytopathology        ISSN: 0031-949X            Impact factor:   4.025


  6 in total

1.  Estimating the Prevalence of Opioid use Disorder in the Cincinnati Region using Probabilistic Multiplier Methods and Model Averaging.

Authors:  Peter J Mallow; Nila Sathe; Michael Topmiller; Jennifer Chubinski; Dillon Carr; Roni Christopher
Journal:  J Health Econ Outcomes Res       Date:  2019-04-03

2.  Predicting plant disease epidemics from functionally represented weather series.

Authors:  D A Shah; P A Paul; E D De Wolf; L V Madden
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2019-06-24       Impact factor: 6.237

3.  Accuracy in the prediction of disease epidemics when ensembling simple but highly correlated models.

Authors:  Denis A Shah; Erick D De Wolf; Pierce A Paul; Laurence V Madden
Journal:  PLoS Comput Biol       Date:  2021-03-15       Impact factor: 4.475

4.  Improved two-stage model averaging for high-dimensional linear regression, with application to Riboflavin data analysis.

Authors:  Juming Pan
Journal:  BMC Bioinformatics       Date:  2021-03-25       Impact factor: 3.169

5.  Development of a Decision Support System for the Management of Mummy Berry Disease in Northwestern Washington.

Authors:  Mladen Cucak; Dalphy O C Harteveld; Lisa Wasko DeVetter; Tobin L Peever; Rafael de Andrade Moral; Chakradhar Mattupalli
Journal:  Plants (Basel)       Date:  2022-08-04

6.  Natural Co-Occurrence of Multiple Mycotoxins in Unprocessed Oats Grown in Ireland with Various Production Systems.

Authors:  Lorenzo De Colli; Karl De Ruyck; Mohamed F Abdallah; John Finnan; Ewen Mullins; Steven Kildea; John Spink; Christopher Elliott; Martin Danaher
Journal:  Toxins (Basel)       Date:  2021-03-04       Impact factor: 4.546

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.