Literature DB >> 35434472

An Open-Access Data Platform: Global Nutrition and Health Atlas (GNHA).

Bingjie Zhou1, Shiwei Liang1, Kyle M Monahan2, Naglaa El-Abbadi1, Melissa S Cruz1, Yutong Chen1, Annie DeVane1, Julia Reedy1, Jianyi Zhang1, Iaroslava Semenova3, Ivan Montoliu3, Dariush Mozaffarian1, Dantong Wang3, Elena N Naumova1.   

Abstract

The rapid development of nutrition science is embracing digital transformation to generate large amounts of data. Precision nutrition and "Big Data" place increasing demand for data repositories and visualization, which enhances the digital transformation. We defined the need for an integrated nutrition data platform as a web-based platform that can collect, store, track, analyze, monitor, and visually display key metrics in nutrition and health while allowing users to interact with visuals and download data provided in the platform. Interactive dashboards create new opportunities for scholars and practitioners to generate and test hypotheses. We present the development and implementation of the Global Nutrition and Health Atlas (GNHA; https://sites.tufts.edu/gnha/), an open-access online platform covering nutrition and health data with 26 themes and 500+ indicators from 190+ countries up to 30 y. We view GNHA as an interactive tool aiming to share information and perspectives and foster collaborations and innovations.
© The Author(s) 2022. Published by Oxford University Press on behalf of the American Society for Nutrition.

Entities:  

Keywords:  dashboard; data repository; food system; nutrition; visualization

Year:  2022        PMID: 35434472      PMCID: PMC9007240          DOI: 10.1093/cdn/nzac031

Source DB:  PubMed          Journal:  Curr Dev Nutr        ISSN: 2475-2991


Introduction

With the rapid development of nutrition surveillance and intensive research in public health and personalized nutrition, large volumes of data are generated from national prospective cohorts, longitudinal studies, household surveys, and other sources. Nutrition data platforms present a growing opportunity for this rapid development (1–4). Over the last decades, interactive dashboards, commonly used in business data analytics, are being frequently applied to health and nutrition domains (5, 6). In the field of food and nutrition, the Hunger Map Live dashboard, produced by the World Food Program, attempts to monitor nutrition outcomes with an emphasis on predicting hunger on a global scale (7). The Joint Child Malnutrition Estimates interactive dashboard developed by UNICEF aims to support countries in generating, analyzing, and using data to monitor malnutrition in children and women (8). These comprehensive data platforms allow a broad range of users to explore multiple data sources through interactive graphs, maps, and data summaries (9–11). Only a few nutrition dashboards to date are equipped with all functionalities needed for effective research and communication. Moreover, knowledge and information gaps among main stakeholders in data collection, data compilation, and data analysis are substantial. As expected, most existing nutrition data platforms are highly specialized and focus on specific nutritional topics—for example, vitamin deficiency, hunger, or food fortification. A few have attempted to cover global nutrition and health, yet the data quality, especially data granularity and completeness, were not emphasized. Interactive visualizations in modern nutrition data platforms are still limited, and visualization quality could be further improved.

Methods

Database content

We used several datasets from the Nutrition and Health Atlas (NHA) that Nestlé Research developed in 2015 as the foundation of Global Nutrition and Health Atlas (GNHA) 2021 version. We checked the accessibility of the data sources and references from the original NHA data. We updated links with new ones if they had been moved to other locations. We then reformatted the data structure by adding gender (male, female), multiple age groups, and residential areas (urban and rural) into separate columns. This step is allowed to reduce duplication and improve data granularity. Finally, according to original sources, all compiled variables were updated to the most recent data (as of August 2021). The overall data structure is shown in Supplemental Figure 1.

Software implementation and query tools

We chose Harvard Dataverse, a protected and manageable research data repository, for storing and sharing. In Dataverse (Dataverse Project, dataverse.org), datasets are accompanied by well-documented metadata and a citation with a digital object identifier (DOI) for each dataset; data can be previewed and downloaded as comma-separated values (CSV) files. We created a GNHA Dataverse, uploaded all data to the GNHA Dataverse page, and embedded this specific page into the GNHA website. We also made a guestbook to monitor the downloaded entries. A sample of a Dataverse interface is shown in Supplemental Figure 2. We also saved data in the Box cloud service to provide high flexibility for previewing and filtering data. We connected the Box folder to an interactive data visualization software, Tableau, and used iFrame to embed the Tableau tabular dashboard to our website. In this data preview tool, filters were added to every column of a dataset and can be applied to multiple columns simultaneously. An example of Tableau query tool interface is shown in Supplemental Figure 3.

Visualizations

To create the visualization gallery, we connected the database to Tableau, created interactive visualization in Tableau, and then used iFrame to embed the Tableau dashboards to the GNHA platform. Visualizations are grouped into 7 categories: 6 dimensions and 1 global comparison. Each dimension has a Tableau dashboard with 2 tabs. The first tab is a uniform template across all dimensions that presents 1 indicator in all countries with available data using several plots to show different perspectives. The second tab is a country profile, offering a unique design according to specific content in each dimension. The country profile includes several key indicators within 1 dimension. Users can change the view to show a summary of this dimension in different countries. Finally, the global comparison dashboard contains a scatterplot, as an intersection in which data can be pulled from any dimension. All visualizations are interactive, supported with filters and pop-up windows, and can be downloaded in PDF or image format.

GNHA

We have developed a comprehensive nutrition and health data platform covering 6 dimensions: Demographics, Dietary Intake, Nutritional Status, Health Status, Health Economics, and Food Sustainability. Twenty-six themes and 500 indicators are organized under these 6 dimensions to cover global health and nutrition data in more than 190 countries over the past 30 y. In the present version, data are collected from more than 250 established public sources, including international institutions, governmental reports, nongovernmental organizations and academic institutions, and peer-reviewed research papers. The GNHA platform serves 3 major missions: enabling public education, enhancing collaboration, and driving innovation. We strive to support these missions by improving data literacy and science communication, embedded in the platform's dashboard features. The platform also allows to download data for analysis, recreate results claimed by different studies, and compare data originating from various sources (12). Therefore, we focused on broad information coverage, data credibility, visualization quality, and dashboard functionalities. Specifically, we ensured that all content on our platform—the data, visualizations, and descriptions—was accessible for various audiences, from students, nutritionists, and scientists to the public and policymakers. We produced tutorials to guide users through every element of the GNHA website, including the dashboard gallery, data portal, and data sources. We embedded detailed instructions within the visualization dashboard to help users with data and figures interpretation. Finally, we provided comprehensive metadata in an interactive and downloadable manner to help users check the reliability and credibility of the selected data sources. Data visualizations are the key component of the GNHA platform. In the visualization gallery, we first have a Global Overview tab () to present data on 1 variable from a specific theme. This tab is implemented a uniform design with a combination of several interconnected graphs to present 1 indicator from multiple angles. For example, information on geographic distribution, time trend, and country rankings of a presented variable can be all viewed in 1 page. Compared with Global Overview, which presents 1 indicator at a time, the Country Profile tab provides an overview in a single country, which includes several key metrics from a selected dimension (Figure 1B). Finally, we created a Comparison dashboard (), which enables indicators from different themes and dimensions to be viewed simultaneously. The Comparison dashboard serves as a bridge that connects all GNHA data together and gives users a quick way to review the relation between any 2 variables of interest. Furthermore, we provided extensive instructions on applying filters and using interactive features to get a comprehensive understanding of the graphs. All visualizations are equipped with an explanation of why we apply those types of plots and how to correctly interpret them. One of the important goals for the GNHA visualization gallery is to serve as an easy and quick tool for students, scholars, and all other users to teach and learn essential information and generate insights through visualizations (13).
FIGURE 1

Examples of Tableau dashboard screenshots: (A) Global Overview as tab 1 and (B) Country Profile as tab 2. The Global Overview tab displays information of 1 indicator at a time from a specific dimension. In this example, any indicator within the “Dietary Intake” dimension can be selected from the dropdown menu “Choose an indicator.” This tab uses a uniform template for all dimensions with several plots to show data from different perspectives. This dashboard contains the following: 1) a choropleth map at global scale on the top left showing the geographical distribution of this indicator; 2) a histogram on the top right showing countries’ overall distributions with the color scheme representing the quartiles aligned with the color scheme in the world map; 3) a bar graph below the histogram showing the rankings and specific values of all countries for this indicator; 4) a line chart under the world map showing the available time points and the temporal trend of the chosen indicator; and 5) a boxplot at the bottom showing the countries’ distribution by year sharing the same time axis with the line chart. This organization with interconnected plots forms a cohesive and comprehensive data representation. The country-specific view provides a country profile with representative indicators for each dimension. Compared with Global Overview, which presents 1 indicator at a time, the Country Profile includes several key indicators within a selected dimension. In this example, we chose to show data from various sources on food supply, food security indicators, along with daily food consumption in the United States. Note that this dashboard allows to depict information that is limited or not available. GDD, Global Dietary Database.

FIGURE 2

An example of the Tableau comparison dashboard—Global Comparison. The scatter plot is the central part of this dashboard screenshot. Users can choose indicators for the horizontal and vertical axes from any dimension. In this example, we chose to show the relation between total processed meats and disability-adjusted life-years lost due to cancers in 2018. We also provide the data completeness metric for each indicator by estimating the number of countries which have data available in a given year. Once indicators on 2 axes are chosen, users can also click the Year “Play” button on the right side to create an animation that shows a dynamic relation between variables over time. Points on the scatter plot can be colored by country's income level or world region at the users’ choice. GDD, Global Dietary Database.

Examples of Tableau dashboard screenshots: (A) Global Overview as tab 1 and (B) Country Profile as tab 2. The Global Overview tab displays information of 1 indicator at a time from a specific dimension. In this example, any indicator within the “Dietary Intake” dimension can be selected from the dropdown menu “Choose an indicator.” This tab uses a uniform template for all dimensions with several plots to show data from different perspectives. This dashboard contains the following: 1) a choropleth map at global scale on the top left showing the geographical distribution of this indicator; 2) a histogram on the top right showing countries’ overall distributions with the color scheme representing the quartiles aligned with the color scheme in the world map; 3) a bar graph below the histogram showing the rankings and specific values of all countries for this indicator; 4) a line chart under the world map showing the available time points and the temporal trend of the chosen indicator; and 5) a boxplot at the bottom showing the countries’ distribution by year sharing the same time axis with the line chart. This organization with interconnected plots forms a cohesive and comprehensive data representation. The country-specific view provides a country profile with representative indicators for each dimension. Compared with Global Overview, which presents 1 indicator at a time, the Country Profile includes several key indicators within a selected dimension. In this example, we chose to show data from various sources on food supply, food security indicators, along with daily food consumption in the United States. Note that this dashboard allows to depict information that is limited or not available. GDD, Global Dietary Database. An example of the Tableau comparison dashboard—Global Comparison. The scatter plot is the central part of this dashboard screenshot. Users can choose indicators for the horizontal and vertical axes from any dimension. In this example, we chose to show the relation between total processed meats and disability-adjusted life-years lost due to cancers in 2018. We also provide the data completeness metric for each indicator by estimating the number of countries which have data available in a given year. Once indicators on 2 axes are chosen, users can also click the Year “Play” button on the right side to create an animation that shows a dynamic relation between variables over time. Points on the scatter plot can be colored by country's income level or world region at the users’ choice. GDD, Global Dietary Database. The platform is expandable and customizable at different levels. As we included data from multiple sources with varying formatting styles, we standardized them to adhere to a proper format with clearly labeled time, location, topic, and demographic characteristics. This standardization allows data to be filtered, selected with various combinations; it supports data sharing and fulfills the requirements for open data access and to promote research reproducibility. Our previous studies identified that data quality represents a major gap for many existing nutrition and health dashboards and platforms (14). Thus, we supplemented our platform by providing well-documented metadata on data completeness, a glossary of terms and their definition, the original sources, and the spatial-temporal characteristics of the data. Overall, data literacy is highly emphasized as the main GNHA feature throughout the entire process. With the proposed organization of the compiled data and detailed guidance for data visualization, we aim to offer tools for effective data literacy programming in nutrition and health. We view GNHA as not just a data-sharing platform but, more essentially, as a visual literature review tool that connects data published in different platforms, from individual studies to large-scale organizations like WHO and the World Bank.

Conclusions

The GNHA platform is hosted by the Friedman School of Nutrition Science and Policy, which aims to generate trusted science, educate future leaders, and produce a real-world impact in nutrition science and policy. We have invested heavily in data organization, standardization, and harmonization, so the data structure is flexible to incorporate data with diverse types and formats and updated regularly. Facing the growing challenges of keeping data up to date, we aim to form communities of practice and open for data contribution. Click here for additional data file.
  6 in total

1.  Evaluating visual analytics for health informatics applications: a systematic review from the American Medical Informatics Association Visual Analytics Working Group Task Force on Evaluation.

Authors:  Danny T Y Wu; Annie T Chen; John D Manning; Gal Levy-Fix; Uba Backonja; David Borland; Jesus J Caban; Dawn W Dowding; Harry Hochheiser; Vadim Kagan; Swaminathan Kandaswamy; Manish Kumar; Alexis Nunez; Eric Pan; David Gotz
Journal:  J Am Med Inform Assoc       Date:  2019-04-01       Impact factor: 4.497

2.  Reproducible Research: A Retrospective.

Authors:  Roger D Peng; Stephanie C Hicks
Journal:  Annu Rev Public Health       Date:  2021-01-19       Impact factor: 21.981

3.  Big data and systematic reviews in nutritional epidemiology.

Authors:  Ambika Satija; Frank B Hu
Journal:  Nutr Rev       Date:  2014-11-18       Impact factor: 7.110

4.  Nutritional surveillance.

Authors:  J B Mason; J T Mitchell
Journal:  Bull World Health Organ       Date:  1983       Impact factor: 9.408

5.  Completeness of open access FluNet influenza surveillance data for Pan-America in 2005-2019.

Authors:  Ryan B Simpson; Jordyn Gottlieb; Bingjie Zhou; Meghan A Hartwick; Elena N Naumova
Journal:  Sci Rep       Date:  2021-01-12       Impact factor: 4.379

6.  An analecta of visualizations for foodborne illness trends and seasonality.

Authors:  Ryan B Simpson; Bingjie Zhou; Tania M Alarcon Falconi; Elena N Naumova
Journal:  Sci Data       Date:  2020-10-13       Impact factor: 6.444

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.