Literature DB >> 28087895

Crohn disease risk prediction-Best practices and pitfalls with exome data.

Manuel Giollo1, David T Jones1, Marco Carraro2, Emanuela Leonardi3, Carlo Ferrari4, Silvio C E Tosatto3,5.   

Abstract

The Critical Assessment of Genome Interpretation (CAGI) experiment is the first attempt to evaluate the state-of-the-art in genetic data interpretation. Among the proposed challenges, Crohn disease (CD) risk prediction has become the most classic problem spanning three editions. The scientific question is very hard: can anybody assess the risk to develop CD given the exome data alone? This is one of the ultimate goals of genetic analysis, which motivated most CAGI participants to look for powerful new methods. In the 2016 CD challenge, we implemented all the best methods proposed in the past editions. This resulted in 10 algorithms, which were evaluated fairly by CAGI organizers. We also used all the data available from CAGI 11 and 13 to maximize the amount of training samples. The most effective algorithms used known genes associated with CD from the literature. No method could evaluate effectively the importance of unannotated variants by using heuristics. As a downside, all CD datasets were strongly affected by sample stratification. This affected the performance reported by assessors. Therefore, we expect that future datasets will be normalized in order to remove population effects. This will improve methods comparison and promote algorithms focused on causal variants discovery.
© 2017 Wiley Periodicals, Inc.

Entities:  

Keywords:  Crohn disease; SNV evaluation; disease risk prediction; exome data; genetic analysis; linear models; machine learning; methods comparison; next-generation sequencing; variants prioritization

Mesh:

Year:  2017        PMID: 28087895      PMCID: PMC5509518          DOI: 10.1002/humu.23177

Source DB:  PubMed          Journal:  Hum Mutat        ISSN: 1059-7794            Impact factor:   4.878


  31 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease.

Authors:  Jeffrey C Barrett; Sarah Hansoul; Dan L Nicolae; Judy H Cho; Richard H Duerr; John D Rioux; Steven R Brant; Mark S Silverberg; Kent D Taylor; M Michael Barmada; Alain Bitton; Themistocles Dassopoulos; Lisa Wu Datta; Todd Green; Anne M Griffiths; Emily O Kistner; Michael T Murtha; Miguel D Regueiro; Jerome I Rotter; L Philip Schumm; A Hillary Steinhart; Stephan R Targan; Ramnik J Xavier; Cécile Libioulle; Cynthia Sandor; Mark Lathrop; Jacques Belaiche; Olivier Dewit; Ivo Gut; Simon Heath; Debby Laukens; Myriam Mni; Paul Rutgeerts; André Van Gossum; Diana Zelenika; Denis Franchimont; Jean-Pierre Hugot; Martine de Vos; Severine Vermeire; Edouard Louis; Lon R Cardon; Carl A Anderson; Hazel Drummond; Elaine Nimmo; Tariq Ahmad; Natalie J Prescott; Clive M Onnie; Sheila A Fisher; Jonathan Marchini; Jilur Ghori; Suzannah Bumpstead; Rhian Gwilliam; Mark Tremelling; Panos Deloukas; John Mansfield; Derek Jewell; Jack Satsangi; Christopher G Mathew; Miles Parkes; Michel Georges; Mark J Daly
Journal:  Nat Genet       Date:  2008-06-29       Impact factor: 38.330

3.  Incidence of inflammatory bowel disease across Europe: is there a difference between north and south? Results of the European Collaborative Study on Inflammatory Bowel Disease (EC-IBD).

Authors:  S Shivananda; J Lennard-Jones; R Logan; N Fear; A Price; L Carpenter; M van Blankenstein
Journal:  Gut       Date:  1996-11       Impact factor: 23.059

4.  Bioconductor: open software development for computational biology and bioinformatics.

Authors:  Robert C Gentleman; Vincent J Carey; Douglas M Bates; Ben Bolstad; Marcel Dettling; Sandrine Dudoit; Byron Ellis; Laurent Gautier; Yongchao Ge; Jeff Gentry; Kurt Hornik; Torsten Hothorn; Wolfgang Huber; Stefano Iacus; Rafael Irizarry; Friedrich Leisch; Cheng Li; Martin Maechler; Anthony J Rossini; Gunther Sawitzki; Colin Smith; Gordon Smyth; Luke Tierney; Jean Y H Yang; Jianhua Zhang
Journal:  Genome Biol       Date:  2004-09-15       Impact factor: 13.583

5.  Phenotype-Genotype Integrator (PheGenI): synthesizing genome-wide association study (GWAS) data with existing genomic resources.

Authors:  Erin M Ramos; Douglas Hoffman; Heather A Junkins; Donna Maglott; Lon Phan; Stephen T Sherry; Mike Feolo; Lucia A Hindorff
Journal:  Eur J Hum Genet       Date:  2013-05-22       Impact factor: 4.246

6.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

7.  Phenopedia and Genopedia: disease-centered and gene-centered views of the evolving knowledge of human genetic associations.

Authors:  W Yu; M Clyne; M J Khoury; M Gwinn
Journal:  Bioinformatics       Date:  2009-10-27       Impact factor: 6.937

8.  Association between variants of PRDM1 and NDP52 and Crohn's disease, based on exome sequencing and functional studies.

Authors:  David Ellinghaus; Hu Zhang; Sebastian Zeissig; Simone Lipinski; Andreas Till; Tao Jiang; Björn Stade; Yana Bromberg; Eva Ellinghaus; Andreas Keller; Manuel A Rivas; Jurgita Skieceviciene; Nadezhda T Doncheva; Xiao Liu; Qing Liu; Fuman Jiang; Michael Forster; Gabriele Mayr; Mario Albrecht; Robert Häsler; Bernhard O Boehm; Jane Goodall; Carlo R Berzuini; James Lee; Vibeke Andersen; Ulla Vogel; Limas Kupcinskas; Manfred Kayser; Michael Krawczak; Susanna Nikolaus; Rinse K Weersma; Cyriel Y Ponsioen; Miquel Sans; Cisca Wijmenga; David P Strachan; Wendy L McArdle; Séverine Vermeire; Paul Rutgeerts; Jeremy D Sanderson; Christopher G Mathew; Morten H Vatn; Jun Wang; Markus M Nöthen; Richard H Duerr; Carsten Büning; Stephan Brand; Jürgen Glas; Juliane Winkelmann; Thomas Illig; Anna Latiano; Vito Annese; Jonas Halfvarson; Mauro D'Amato; Mark J Daly; Michael Nothnagel; Tom H Karlsen; Suresh Subramani; Philip Rosenstiel; Stefan Schreiber; Miles Parkes; Andre Franke
Journal:  Gastroenterology       Date:  2013-04-25       Impact factor: 22.682

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  Combining information from common type 2 diabetes risk polymorphisms improves disease prediction.

Authors:  Michael N Weedon; Mark I McCarthy; Graham Hitman; Mark Walker; Christopher J Groves; Eleftheria Zeggini; N William Rayner; Beverley Shields; Katharine R Owen; Andrew T Hattersley; Timothy M Frayling
Journal:  PLoS Med       Date:  2006-10       Impact factor: 11.069

View more
  6 in total

Review 1.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09

2.  Reports from CAGI: The Critical Assessment of Genome Interpretation.

Authors:  Roger A Hoskins; Susanna Repo; Daniel Barsky; Gaia Andreoletti; John Moult; Steven E Brenner
Journal:  Hum Mutat       Date:  2017-09       Impact factor: 4.878

Review 3.  Genome interpretation using in silico predictors of variant impact.

Authors:  Panagiotis Katsonis; Kevin Wilhelm; Amanda Williams; Olivier Lichtarge
Journal:  Hum Genet       Date:  2022-04-30       Impact factor: 5.881

4.  A Systematic Review of Artificial Intelligence and Machine Learning Applications to Inflammatory Bowel Disease, with Practical Guidelines for Interpretation.

Authors:  Imogen S Stafford; Mark M Gosink; Enrico Mossotto; Sarah Ennis; Manfred Hauben
Journal:  Inflamm Bowel Dis       Date:  2022-10-03       Impact factor: 7.290

Review 5.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09

Review 6.  Machine Learning Modeling from Omics Data as Prospective Tool for Improvement of Inflammatory Bowel Disease Diagnosis and Clinical Classifications.

Authors:  Biljana Stankovic; Nikola Kotur; Gordana Nikcevic; Vladimir Gasic; Branka Zukic; Sonja Pavlovic
Journal:  Genes (Basel)       Date:  2021-09-18       Impact factor: 4.096

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.