MOTIVATION: Genome-wide association studies (GWASs) are effective for describing genetic complexities of common diseases. Phenome-wide association studies (PheWASs) offer an alternative and complementary approach to GWAS using data embedded in the electronic health record (EHR) to define the phenome. International Classification of Disease version 9 (ICD9) codes are used frequently to define the phenome, but using ICD9 codes alone misses other clinically relevant information from the EHR that can be used for PheWAS analyses and discovery. RESULTS: As an alternative to ICD9 coding, a text-based phenome was defined by 23 384 clinically relevant terms extracted from Marshfield Clinic's EHR. Five single nucleotide polymorphisms (SNPs) with known phenotypic associations were genotyped in 4235 individuals and associated across the text-based phenome. All five SNPs genotyped were associated with expected terms (P<0.02), most at or near the top of their respective PheWAS ranking. Raw association results indicate that text data performed equivalently to ICD9 coding and demonstrate the utility of information beyond ICD9 coding for application in PheWAS.
MOTIVATION: Genome-wide association studies (GWASs) are effective for describing genetic complexities of common diseases. Phenome-wide association studies (PheWASs) offer an alternative and complementary approach to GWAS using data embedded in the electronic health record (EHR) to define the phenome. International Classification of Disease version 9 (ICD9) codes are used frequently to define the phenome, but using ICD9 codes alone misses other clinically relevant information from the EHR that can be used for PheWAS analyses and discovery. RESULTS: As an alternative to ICD9 coding, a text-based phenome was defined by 23 384 clinically relevant terms extracted from Marshfield Clinic's EHR. Five single nucleotide polymorphisms (SNPs) with known phenotypic associations were genotyped in 4235 individuals and associated across the text-based phenome. All five SNPs genotyped were associated with expected terms (P<0.02), most at or near the top of their respective PheWAS ranking. Raw association results indicate that text data performed equivalently to ICD9 coding and demonstrate the utility of information beyond ICD9 coding for application in PheWAS.
Authors: A L Goldberger; L A Amaral; L Glass; J M Hausdorff; P C Ivanov; R G Mark; J E Mietus; G B Moody; C K Peng; H E Stanley Journal: Circulation Date: 2000-06-13 Impact factor: 29.690
Authors: Mark I McCarthy; Gonçalo R Abecasis; Lon R Cardon; David B Goldstein; Julian Little; John P A Ioannidis; Joel N Hirschhorn Journal: Nat Rev Genet Date: 2008-05 Impact factor: 53.242
Authors: Catherine A McCarty; Donna Chapman-Stone; Teresa Derfus; Philip F Giampietro; Norman Fost Journal: Am J Med Genet A Date: 2008-12-01 Impact factor: 2.802
Authors: Ishna Neamatullah; Margaret M Douglass; Li-wei H Lehman; Andrew Reisner; Mauricio Villarroel; William J Long; Peter Szolovits; George B Moody; Roger G Mark; Gari D Clifford Journal: BMC Med Inform Decis Mak Date: 2008-07-24 Impact factor: 2.796
Authors: Xiayuan Huang; Robert C Elston; Guilherme J Rosa; John Mayer; Zhan Ye; Terrie Kitchner; Murray H Brilliant; David Page; Scott J Hebbring Journal: Bioinformatics Date: 2018-02-15 Impact factor: 6.937
Authors: Anurag Verma; Anastasia Lucas; Shefali S Verma; Yu Zhang; Navya Josyula; Anqa Khan; Dustin N Hartzel; Daniel R Lavage; Joseph Leader; Marylyn D Ritchie; Sarah A Pendergrass Journal: Am J Hum Genet Date: 2018-03-29 Impact factor: 11.025
Authors: Elizabeth Blue; Tin L Louie; Jessica X Chong; Scott J Hebbring; Kathleen C Barnes; Nicholas M Rafaels; Michael R Knowles; Ronald L Gibson; Michael J Bamshad; Mary J Emond Journal: Ann Am Thorac Soc Date: 2018-04
Authors: Pedro L Teixeira; Wei-Qi Wei; Robert M Cronin; Huan Mo; Jacob P VanHouten; Robert J Carroll; Eric LaRose; Lisa A Bastarache; S Trent Rosenbloom; Todd L Edwards; Dan M Roden; Thomas A Lasko; Richard A Dart; Anne M Nikolai; Peggy L Peissig; Joshua C Denny Journal: J Am Med Inform Assoc Date: 2016-08-07 Impact factor: 4.497