BACKGROUND: Fetal hemoglobin (HbF) is the major modifier of the clinical course of sickle cell anemia. Its levels are highly heritable, and its interpersonal variability is modulated in part by 3 quantitative trait loci that affect HbF gene expression. Genome-wide association studies have identified single-nucleotide polymorphisms (SNPs) in these quantitative trait loci that are highly associated with HbF but explain only 10% to 12% of the variance of HbF. Combining SNPs into a genetic risk score can help to explain a larger amount of the variability of HbF level, but the challenge of this approach is to select the optimal number of SNPs to be included in the genetic risk score. METHODS AND RESULTS: We developed a collection of 14 models with genetic risk score composed of different numbers of SNPs and used the ensemble of these models to predict HbF in patients with sickle cell anemia. The models were trained in 841 patients with sickle cell anemia and were tested in 3 independent cohorts. The ensemble of 14 models explained 23.4% of the variability in HbF in the discovery cohort, whereas the correlation between predicted and observed HbF in the 3 independent cohorts ranged between 0.28 and 0.44. The models included SNPs in BCL11A, the HBS1L-MYB intergenic region, and the site of the HBB gene cluster, quantitative trait loci previously associated with HbF. CONCLUSIONS: An ensemble of 14 genetic risk models can predict HbF levels with accuracy between 0.28 and 0.44, and the approach may also prove useful in other applications.
BACKGROUND: Fetal hemoglobin (HbF) is the major modifier of the clinical course of sickle cell anemia. Its levels are highly heritable, and its interpersonal variability is modulated in part by 3 quantitative trait loci that affect HbF gene expression. Genome-wide association studies have identified single-nucleotide polymorphisms (SNPs) in these quantitative trait loci that are highly associated with HbF but explain only 10% to 12% of the variance of HbF. Combining SNPs into a genetic risk score can help to explain a larger amount of the variability of HbF level, but the challenge of this approach is to select the optimal number of SNPs to be included in the genetic risk score. METHODS AND RESULTS: We developed a collection of 14 models with genetic risk score composed of different numbers of SNPs and used the ensemble of these models to predict HbF in patients with sickle cell anemia. The models were trained in 841 patients with sickle cell anemia and were tested in 3 independent cohorts. The ensemble of 14 models explained 23.4% of the variability in HbF in the discovery cohort, whereas the correlation between predicted and observed HbF in the 3 independent cohorts ranged between 0.28 and 0.44. The models included SNPs in BCL11A, the HBS1L-MYB intergenic region, and the site of the HBB gene cluster, quantitative trait loci previously associated with HbF. CONCLUSIONS: An ensemble of 14 genetic risk models can predict HbF levels with accuracy between 0.28 and 0.44, and the approach may also prove useful in other applications.
Authors: Paola Sebastiani; Marco F Ramoni; Vikki Nolan; Clinton T Baldwin; Martin H Steinberg Journal: Nat Genet Date: 2005-03-20 Impact factor: 38.330
Authors: Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich Journal: Nat Genet Date: 2006-07-23 Impact factor: 38.330
Authors: Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio Journal: Proc Natl Acad Sci U S A Date: 2009-05-27 Impact factor: 11.205
Authors: M H Steinberg; H Hsu; R L Nagel; P F Milner; J G Adams; L Benjamin; S Fryd; P Gillette; J Gilman; O Josifovska Journal: Am J Hematol Date: 1995-03 Impact factor: 10.047
Authors: Aaron R Folsom; Lloyd E Chambless; Christie M Ballantyne; Josef Coresh; Gerardo Heiss; Kenneth K Wu; Eric Boerwinkle; Thomas H Mosley; Paul Sorlie; Guoqing Diao; A Richey Sharrett Journal: Arch Intern Med Date: 2006-07-10
Authors: Amanda E Sedgewick; Nadia Timofeev; Paola Sebastiani; Jason C C So; Edmond S K Ma; Li Chong Chan; Goonnapa Fucharoen; Supan Fucharoen; Cynara G Barbosa; Badri N Vardarajan; Lindsay A Farrer; Clinton T Baldwin; Martin H Steinberg; David H K Chui Journal: Blood Cells Mol Dis Date: 2008-08-08 Impact factor: 3.039
Authors: Zhi Wei; Kai Wang; Hui-Qi Qu; Haitao Zhang; Jonathan Bradfield; Cecilia Kim; Edward Frackleton; Cuiping Hou; Joseph T Glessner; Rosetta Chiavacci; Charles Stanley; Dimitri Monos; Struan F A Grant; Constantin Polychronakos; Hakon Hakonarson Journal: PLoS Genet Date: 2009-10-09 Impact factor: 5.917
Authors: Kate Gardner; Tony Fulford; Nicholas Silver; Helen Rooks; Nikolaos Angelis; Marlene Allman; Siana Nkya; Julie Makani; Jo Howard; Rachel Kesse-Adu; David C Rees; Sara Stuart-Smith; Tullie Yeghen; Moji Awogbade; Raphael Z Sangeda; Josephine Mgaya; Hamel Patel; Stephen Newhouse; Stephan Menzel; Swee Lay Thein Journal: Blood Adv Date: 2018-02-13
Authors: Paula J Griffin; Paola Sebastiani; Heather Edward; Clinton T Baldwin; Mark T Gladwin; Victor R Gordeuk; David H K Chui; Martin H Steinberg Journal: Am J Hematol Date: 2014-08-04 Impact factor: 10.047
Authors: Swati Padhee; Amanuel Alambo; Tanvi Banerjee; Arvind Subramaniam; Daniel M Abrams; Gary K Nave; Nirmish Shah Journal: Pattern Recognit (2021) Date: 2021-02-23
Authors: Li Liu; Alexander Pertsemlidis; Liang-Hao Ding; Michael D Story; Martin H Steinberg; Paola Sebastiani; Carolyn Hoppe; Samir K Ballas; Betty S Pace Journal: Exp Biol Med (Maywood) Date: 2016-03-27
Authors: Mengtian Du; Sarah Van Ness; Victor Gordeuk; Sayed M Nouraie; Sergei Nekhai; Mark Gladwin; Martin H Steinberg; Paola Sebastiani Journal: Blood Cells Mol Dis Date: 2018-05-16 Impact factor: 3.039