| Literature DB >> 34889978 |
Chiara Fallerini1,2, Nicola Picchiotti3,4, Margherita Baldassarri1,2, Francesca Mari1,2,5, Alessandra Renieri6,7,8,9, Simone Furini1, Kristina Zguro1, Sergio Daga1,2, Francesca Fava1,2,5, Elisa Benetti1, Sara Amitrano5, Mirella Bruttini1,2,5, Maria Palmieri1,2, Susanna Croci1,2, Mirjam Lista1,2, Giada Beligni1,2, Floriana Valentino1,2, Ilaria Meloni1,2, Marco Tanfoni3, Francesca Minnai10, Francesca Colombo10, Enrico Cabri11, Maddalena Fratelli11, Chiara Gabbi12, Stefania Mantovani13, Elisa Frullanti1,2, Marco Gori3,14, Francis P Crawley15, Guillaume Butler-Laporte16,17, Brent Richards16,18,19, Hugo Zeberg20, Miklos Lipcsey21,22, Michael Hultström21,23, Kerstin U Ludwig24, Eva C Schulte25,26,27, Erola Pairo-Castineira28,29, John Kenneth Baillie28,29,30, Axel Schmidt24, Robert Frithiof21.
Abstract
The combined impact of common and rare exonic variants in COVID-19 host genetics is currently insufficiently understood. Here, common and rare variants from whole-exome sequencing data of about 4000 SARS-CoV-2-positive individuals were used to define an interpretable machine-learning model for predicting COVID-19 severity. First, variants were converted into separate sets of Boolean features, depending on the absence or the presence of variants in each gene. An ensemble of LASSO logistic regression models was used to identify the most informative Boolean features with respect to the genetic bases of severity. The Boolean features selected by these logistic models were combined into an Integrated PolyGenic Score that offers a synthetic and interpretable index for describing the contribution of host genetics in COVID-19 severity, as demonstrated through testing in several independent cohorts. Selected features belong to ultra-rare, rare, low-frequency, and common variants, including those in linkage disequilibrium with known GWAS loci. Noteworthily, around one quarter of the selected genes are sex-specific. Pathway analysis of the selected genes associated with COVID-19 severity reflected the multi-organ nature of the disease. The proposed model might provide useful information for developing diagnostics and therapeutics, while also being able to guide bedside disease management.Entities:
Mesh:
Year: 2021 PMID: 34889978 PMCID: PMC8661833 DOI: 10.1007/s00439-021-02397-7
Source DB: PubMed Journal: Hum Genet ISSN: 0340-6717 Impact factor: 4.132