Literature DB >> 32380076

Evaluating missing value imputation methods for food composition databases.

Gordana Ispirova1, Tome Eftimov2, Barbara Koroušić Seljak3.   

Abstract

Missing data are a common problem in most research fields and introduce an element of ambiguity into data analysis. They can arise due to different reasons: mishandling of samples, measurement error, deleted aberrant value or simply lack of analysis. The nutrition domain is no exception to the problem of missing data. This paper addresses the problem of missing data in food composition databases (FCDBs). Missing data in FCDBs results in incomplete FCDBs, which have limited usage, because any dietary assessment can be performed only on a complete dataset. Most often, this problem is resolved by calculating means/medians from excising data in the same database or borrowing data from other FCDBs. These solutions introduce significant error. We focus on missing data imputation techniques based on methods for substituting missing values with statistical prediction: Non-Negative Matrix Factorization (NMF), Multiple Imputations by Chained Equations (MICE), Nonparametric Missing Value Imputation using Random Forest (MissForest), and K-Nearest Neighbors (KNN), and compared them with commonly used approaches - fill-in with mean, fill-in with median. The data used was from national FCDBs collected by EuroFIR (European Food Information Resource Network). The results show that the state-of-the-art methods for imputation yield better results than the traditional approaches.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Food composition databases; Missing data; Missing-data imputation; Nutrient values; food composition data

Mesh:

Year:  2020        PMID: 32380076     DOI: 10.1016/j.fct.2020.111368

Source DB:  PubMed          Journal:  Food Chem Toxicol        ISSN: 0278-6915            Impact factor:   6.023


  3 in total

1.  Feature Optimization Method of Material Identification for Loose Particles Inside Sealed Relays.

Authors:  Zhigang Sun; Aiping Jiang; Guotao Wang; Min Zhang; Huizhen Yan
Journal:  Sensors (Basel)       Date:  2022-05-07       Impact factor: 3.847

Review 2.  Statistical Methods for the Analysis of Food Composition Databases: A Review.

Authors:  Yusentha Balakrishna; Samuel Manda; Henry Mwambi; Averalda van Graan
Journal:  Nutrients       Date:  2022-05-25       Impact factor: 6.706

3.  Children's Neurological Status Epilepticus and Poor Prognostic Factors through Electroencephalogram Image under Composite Domain Analysis Algorithm.

Authors:  Runhan Zhang; Chao Gao; Junting Liu; Manting Zhao; Yongli Wu
Journal:  J Healthc Eng       Date:  2021-11-25       Impact factor: 2.682

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.