Xiayuan Huang1, Nicholas Tatonetti2, Katie LaRow2, Brooke Delgoffee3, John Mayer3, David Page4, Scott J Hebbring5. 1. Department of Biostatistics & Medical Informatics, University of Wisconsin-Madison, Madison, United States Wisconsin. 2. Department of Biomedical Informatics, Columbia University, New York, New York, United States. 3. Office of Research Computing and Analytics, Marshfield Clinic Research Foundation, Marshfield, United States Wisconsin. 4. Department of Biostatistics & Bioinformatics, Duke University, Durham, North Carolina, United States. 5. Center for Precision Medicine Research, Marshfield Clinic Research Foundation, Marshfield, United States Wisconsin.
Abstract
MOTIVATION: The use and functionality of Electronic Health Records (EHR) have increased rapidly in the past few decades. EHRs are becoming an important depository of patient health information and can capture family data. Pedigree analysis is a longstanding and powerful approach that can gain insight into the underlying genetic and environmental factors in human health, but traditional approaches to identifying and recruiting families are low-throughput and labor-intensive. Therefore, high-throughput methods to automatically construct family pedigrees are needed. RESULTS: We developed a stand-alone application: Electronic Pedigrees, or E-Pedigrees, which combines two validated family prediction algorithms into a single software package for high throughput pedigrees construction. The convenient platform considers patients' basic demographic information and/or emergency contact data to infer high-accuracy parent-child relationship. Importantly, E-Pedigrees allows users to layer in additional pedigree data when available and provides options for applying different logical rules to improve accuracy of inferred family relationships. This software is fast and easy to use, is compatible with different EHR data sources, and its output is a standard PED file appropriate for multiple downstream analyses. AVAILABILITY: The Python 3.3+ version E-Pedigrees application is freely available on: https://github.com/xiayuan-huang/E-pedigrees. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: The use and functionality of Electronic Health Records (EHR) have increased rapidly in the past few decades. EHRs are becoming an important depository of patient health information and can capture family data. Pedigree analysis is a longstanding and powerful approach that can gain insight into the underlying genetic and environmental factors in human health, but traditional approaches to identifying and recruiting families are low-throughput and labor-intensive. Therefore, high-throughput methods to automatically construct family pedigrees are needed. RESULTS: We developed a stand-alone application: Electronic Pedigrees, or E-Pedigrees, which combines two validated family prediction algorithms into a single software package for high throughput pedigrees construction. The convenient platform considers patients' basic demographic information and/or emergency contact data to infer high-accuracy parent-child relationship. Importantly, E-Pedigrees allows users to layer in additional pedigree data when available and provides options for applying different logical rules to improve accuracy of inferred family relationships. This software is fast and easy to use, is compatible with different EHR data sources, and its output is a standard PED file appropriate for multiple downstream analyses. AVAILABILITY: The Python 3.3+ version E-Pedigrees application is freely available on: https://github.com/xiayuan-huang/E-pedigrees. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Catherine A McCarty; Russell A Wilke; Philip F Giampietro; Steve D Wesbrook; Michael D Caldwell Journal: Per Med Date: 2005-03 Impact factor: 2.512
Authors: Benjamin A Goldstein; Ann Marie Navar; Michael J Pencina; John P A Ioannidis Journal: J Am Med Inform Assoc Date: 2016-05-17 Impact factor: 4.497
Authors: Heidi A Hanson; Claire L Leiser; Brock O'Neil; Christopher Martin; Sumati Gupta; Ken R Smith; Christopher Dechet; William T Lowrance; Michael J Madsen; Nicola J Camp Journal: Cancer Epidemiol Biomarkers Prev Date: 2020-02-25 Impact factor: 4.254
Authors: Fernanda C G Polubriaginof; Rami Vanguri; Kayla Quinnies; Gillian M Belbin; Alexandre Yahi; Hojjat Salmasian; Tal Lorberbaum; Victor Nwankwo; Li Li; Mark M Shervey; Patricia Glowe; Iuliana Ionita-Laza; Mary Simmerling; George Hripcsak; Suzanne Bakken; David Goldstein; Krzysztof Kiryluk; Eimear E Kenny; Joel Dudley; David K Vawdrey; Nicholas P Tatonetti Journal: Cell Date: 2018-05-17 Impact factor: 41.582
Authors: Yik Y Teo; Andrew E Fry; Miguel A Sanjoaquin; Bonnie Pederson; Kerrin S Small; Kirk A Rockett; Dominic P Kwiatkowski; Taane G Clark Journal: Hum Hered Date: 2008-10-17 Impact factor: 0.444