OBJECTIVE: High-throughput gene expression profiling of tissue samples from patients with systemic sclerosis (SSc) has identified 4 "intrinsic" gene expression subsets: inflammatory, fibroproliferative, normal-like, and limited. Prior methods required agglomerative clustering of many samples. In order to classify individual patients in clinical trials or for diagnostic purposes, supervised methods that can assign single samples to molecular subsets are required. We undertook this study to introduce a novel machine learning classifier as a robust accurate intrinsic subset predictor. METHODS: Three independent gene expression cohorts were curated and merged to create a data set covering 297 skin biopsy samples from 102 unique patients and controls, which was used to train a machine learning algorithm. We performed external validation using 3 independent SSc cohorts, including a gene expression data set generated by an independent laboratory on a different microarray platform. In total, 413 skin biopsy samples from 213 individuals were analyzed in the training and testing cohorts. RESULTS: Repeated cross-fold validation identified consistent and discriminative markers using multinomial elastic net, performing with an average classification accuracy of 87.1% with high sensitivity and specificity. In external validation, the classifier achieved an average accuracy of 85.4%. Reanalyzing data from a previous study, we identified subsets of patients that represent the canonical inflammatory, fibroproliferative, and normal-like subsets. CONCLUSION: We developed a highly accurate classifier for SSc molecular subsets for individual patient samples. The method can be used in SSc clinical trials to identify an intrinsic subset on individual samples. Our method provides a robust data-driven approach to aid clinical decision-making and interpretation of heterogeneous molecular information in SSc patients.
OBJECTIVE: High-throughput gene expression profiling of tissue samples from patients with systemic sclerosis (SSc) has identified 4 "intrinsic" gene expression subsets: inflammatory, fibroproliferative, normal-like, and limited. Prior methods required agglomerative clustering of many samples. In order to classify individual patients in clinical trials or for diagnostic purposes, supervised methods that can assign single samples to molecular subsets are required. We undertook this study to introduce a novel machine learning classifier as a robust accurate intrinsic subset predictor. METHODS: Three independent gene expression cohorts were curated and merged to create a data set covering 297 skin biopsy samples from 102 unique patients and controls, which was used to train a machine learning algorithm. We performed external validation using 3 independent SSc cohorts, including a gene expression data set generated by an independent laboratory on a different microarray platform. In total, 413 skin biopsy samples from 213 individuals were analyzed in the training and testing cohorts. RESULTS: Repeated cross-fold validation identified consistent and discriminative markers using multinomial elastic net, performing with an average classification accuracy of 87.1% with high sensitivity and specificity. In external validation, the classifier achieved an average accuracy of 85.4%. Reanalyzing data from a previous study, we identified subsets of patients that represent the canonical inflammatory, fibroproliferative, and normal-like subsets. CONCLUSION: We developed a highly accurate classifier for SSc molecular subsets for individual patient samples. The method can be used in SSc clinical trials to identify an intrinsic subset on individual samples. Our method provides a robust data-driven approach to aid clinical decision-making and interpretation of heterogeneous molecular information in SSc patients.
Authors: Jessica K Gordon; Viktor Martyanov; Jennifer M Franks; Elana J Bernstein; Jackie Szymonifka; Cynthia Magro; Horatio F Wildman; Tammara A Wood; Michael L Whitfield; Robert F Spiera Journal: Arthritis Rheumatol Date: 2017-12-29 Impact factor: 10.995
Authors: Sarah A Pendergrass; Raphael Lemaire; Ian P Francis; J Matthew Mahoney; Robert Lafyatis; Michael L Whitfield Journal: J Invest Dermatol Date: 2012-02-09 Impact factor: 8.551
Authors: Jaclyn N Taroni; Casey S Greene; Viktor Martyanov; Tammara A Wood; Romy B Christmann; Harrison W Farber; Robert A Lafyatis; Christopher P Denton; Monique E Hinchcliff; Patricia A Pioli; J Matthew Mahoney; Michael L Whitfield Journal: Genome Med Date: 2017-03-23 Impact factor: 11.117
Authors: Jaclyn N Taroni; Viktor Martyanov; Chiang-Ching Huang; J Matthew Mahoney; Ikuo Hirano; Brandon Shetuni; Guang-Yu Yang; Darren Brenner; Barbara Jung; Tammara A Wood; Swati Bhattacharyya; Orit Almagor; Jungwha Lee; Arlene Sirajuddin; John Varga; Rowland W Chang; Michael L Whitfield; Monique Hinchcliff Journal: Arthritis Res Ther Date: 2015-07-29 Impact factor: 5.156
Authors: Jüri Reimand; Tambet Arak; Priit Adler; Liis Kolberg; Sulev Reisberg; Hedi Peterson; Jaak Vilo Journal: Nucleic Acids Res Date: 2016-04-20 Impact factor: 16.971
Authors: Dinesh Khanna; Cathie Spino; Sindhu Johnson; Lorinda Chung; Michael L Whitfield; Christopher P Denton; Veronica Berrocal; Jennifer Franks; Bhavan Mehta; Jerry Molitor; Virginia D Steen; Robert Lafyatis; Robert W Simms; Anna Gill; Suzanne Kafaja; Tracy M Frech; Vivien Hsu; Robyn T Domsic; Janet E Pope; Jessica K Gordon; Maureen D Mayes; Elena Schiopu; Amber Young; Nora Sandorfi; Jane Park; Faye N Hant; Elana J Bernstein; Soumya Chatterjee; Flavia V Castelino; Ali Ajam; Yue Wang; Tammara Wood; Yannick Allanore; Marco Matucci-Cerinic; Oliver Distler; Ora Singer; Erica Bush; David A Fox; Daniel E Furst Journal: Arthritis Rheumatol Date: 2019-12-10 Impact factor: 10.995
Authors: Lynette Keyes-Elstein; Ashley Pinckney; Ellen Goldmuntz; Beverly Welch; Jennifer M Franks; Viktor Martyanov; Tammara A Wood; Leslie Crofford; Maureen Mayes; Peter McSweeney; Richard Nash; George Georges; M E Csuka; Robert Simms; Daniel Furst; Dinesh Khanna; E William St Clair; Michael L Whitfield; Keith M Sullivan Journal: Arthritis Care Res (Hoboken) Date: 2021-09-17 Impact factor: 4.794
Authors: Brian Skaug; Dinesh Khanna; William R Swindell; Monique E Hinchcliff; Tracy M Frech; Virginia D Steen; Faye N Hant; Jessica K Gordon; Ami A Shah; Lisha Zhu; W Jim Zheng; Jeffrey L Browning; Alexander M S Barron; Minghua Wu; Sudha Visvanathan; Patrick Baum; Jennifer M Franks; Michael L Whitfield; Victoria K Shanmugam; Robyn T Domsic; Flavia V Castelino; Elana J Bernstein; Nancy Wareing; Marka A Lyons; Jun Ying; Julio Charles; Maureen D Mayes; Shervin Assassi Journal: Ann Rheum Dis Date: 2019-11-25 Impact factor: 19.103
Authors: Jörg H W Distler; Andrea-Hermina Györfi; Meera Ramanujam; Michael L Whitfield; Melanie Königshoff; Robert Lafyatis Journal: Nat Rev Rheumatol Date: 2019-11-11 Impact factor: 20.543
Authors: Dana E Orange; Jessica K Gordon; Kimberly Showalter; Robert Spiera; Cynthia Magro; Phaedra Agius; Viktor Martyanov; Jennifer M Franks; Roshan Sharma; Heather Geiger; Tammara A Wood; Yaxia Zhang; Caryn R Hale; Jackie Finik; Michael L Whitfield Journal: Ann Rheum Dis Date: 2020-10-07 Impact factor: 19.103