Literature DB >> 34902551

Deep graph convolutional network for US birth data harmonization.

Lishan Yu1, Hamisu M Salihu2, Deepa Dongarwar3, Luyao Chen4, Xiaoqian Jiang4.   

Abstract

In this paper, we developed a feasible and efficient deep-learning-based framework to combine the United States (US) natality data for the last five decades, with changing variables and factors, into a consistent database. We constructed a graph based on the property and elements of databases, including variables, and conducted a graph convolutional network (GCN) to learn the embeddings of variables on the constructed graph, where the learned embeddings implied the similarity of variables. Specifically, we devised a loss function with a slack margin and a banlist mechanism (for a random walk) to learn the desired structure (two nodes sharing more information were more similar to each other.), and developed an active learning mechanism to conduct the harmonization. Toward a total of 9,321 variables from 49 databases (i.e., 783 stemmed variables, from 1970 to 2018), we applied our model iteratively together with human reviews for four rounds, then obtained 323 hyperchains of variables. During the harmonization, the first round of our model achieved recall and precision of 87.56%, 57.70%, respectively. Our harmonized graph neural network (HGNN) method provides a feasible and efficient way to connect relevant databases at a meta-level. Adapting to the database's property and characteristics, HGNN can learn patterns globally, which is powerful to discover the similarity between variables among databases. Our proposed method provides an effective way to reduce the manual effort in database harmonization and integration of fragmented data into useful databases for future research.
Copyright © 2021 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Database harmonization; Deep learning; Graph neural network; Natality data

Mesh:

Year:  2021        PMID: 34902551      PMCID: PMC8766952          DOI: 10.1016/j.jbi.2021.103974

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  4 in total

1.  Birth defects surveillance, epidemiology, and significance in public health.

Authors:  Julianne S Collins; Russell S Kirby
Journal:  Birth Defects Res A Clin Mol Teratol       Date:  2009-11

2.  Graph convolutional networks for computational drug development and discovery.

Authors:  Mengying Sun; Sendong Zhao; Coryandar Gilvary; Olivier Elemento; Jiayu Zhou; Fei Wang
Journal:  Brief Bioinform       Date:  2020-05-21       Impact factor: 11.622

3.  Harmonized representation learning on dynamic EHR graphs.

Authors:  Dongha Lee; Xiaoqian Jiang; Hwanjo Yu
Journal:  J Biomed Inform       Date:  2020-04-25       Impact factor: 6.317

4.  Modeling polypharmacy side effects with graph convolutional networks.

Authors:  Marinka Zitnik; Monica Agrawal; Jure Leskovec
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.