Literature DB >> 29218890

Democratizing data science through data science training.

John Darrell Van Horn1, Lily Fierro, Jeana Kamdar, Jonathan Gordon, Crystal Stewart, Avnish Bhattrai, Sumiko Abe, Xiaoxiao Lei, Caroline O'Driscoll, Aakanchha Sinha, Priyambada Jain, Gully Burns, Kristina Lerman, José Luis Ambite.   

Abstract

The biomedical sciences have experienced an explosion of data which promises to overwhelm many current practitioners. Without easy access to data science training resources, biomedical researchers may find themselves unable to wrangle their own datasets. In 2014, to address the challenges posed such a data onslaught, the National Institutes of Health (NIH) launched the Big Data to Knowledge (BD2K) initiative. To this end, the BD2K Training Coordinating Center (TCC; bigdatau.org) was funded to facilitate both in-person and online learning, and open up the concepts of data science to the widest possible audience. Here, we describe the activities of the BD2K TCC and its focus on the construction of the Educational Resource Discovery Index (ERuDIte), which identifies, collects, describes, and organizes online data science materials from BD2K awardees, open online courses, and videos from scientific lectures and tutorials. ERuDIte now indexes over 9,500 resources. Given the richness of online training materials and the constant evolution of biomedical data science, computational methods applying information retrieval, natural language processing, and machine learning techniques are required - in effect, using data science to inform training in data science. In so doing, the TCC seeks to democratize novel insights and discoveries brought forth via large-scale data science training.

Entities:  

Mesh:

Year:  2018        PMID: 29218890      PMCID: PMC5731238     

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  9 in total

1.  Training the translational scientist.

Authors:  Rebecca D Jackson; Sherine Gabriel; Anne Pariser; Peter Feig
Journal:  Sci Transl Med       Date:  2010-12-22       Impact factor: 17.956

2.  The NIH Big Data to Knowledge (BD2K) initiative.

Authors:  Philip E Bourne; Vivien Bonazzi; Michelle Dunn; Eric D Green; Mark Guyer; George Komatsoulis; Jennie Larkin; Beth Russell
Journal:  J Am Med Inform Assoc       Date:  2015-11       Impact factor: 4.497

3.  Sparsey™: event recognition via deep hierarchical sparse distributed codes.

Authors:  Gerard J Rinkus
Journal:  Front Comput Neurosci       Date:  2014-12-15       Impact factor: 2.380

4.  Data science, learning, and applications to biomedical and health sciences.

Authors:  Nabil R Adam; Robert Wieder; Debopriya Ghosh
Journal:  Ann N Y Acad Sci       Date:  2017-01       Impact factor: 5.691

5.  Human neuroimaging as a "Big Data" science.

Authors:  John Darrell Van Horn; Arthur W Toga
Journal:  Brain Imaging Behav       Date:  2014-06       Impact factor: 3.978

6.  THE TRAINING OF NEXT GENERATION DATA SCIENTISTS IN BIOMEDICINE.

Authors:  Lana X Garmire; Stephen Gliske; Quynh C Nguyen; Jonathan H Chen; Shamim Nemati; John D VAN Horn; Jason H Moore; Carol Shreffler; Michelle Dunn
Journal:  Pac Symp Biocomput       Date:  2017

7.  The National Institutes of Health's Big Data to Knowledge (BD2K) initiative: capitalizing on biomedical big data.

Authors:  Ronald Margolis; Leslie Derr; Michelle Dunn; Michael Huerta; Jennie Larkin; Jerry Sheehan; Mark Guyer; Eric D Green
Journal:  J Am Med Inform Assoc       Date:  2014-07-09       Impact factor: 4.497

8.  Large-scale physical activity data reveal worldwide activity inequality.

Authors:  Tim Althoff; Rok Sosič; Jennifer L Hicks; Abby C King; Scott L Delp; Jure Leskovec
Journal:  Nature       Date:  2017-07-10       Impact factor: 49.962

9.  Building the biomedical data science workforce.

Authors:  Michelle C Dunn; Philip E Bourne
Journal:  PLoS Biol       Date:  2017-07-17       Impact factor: 8.029

  9 in total
  6 in total

Review 1.  Using 'collective omics data' for biomedical research training.

Authors:  Damien Chaussabel; Darawan Rinchai
Journal:  Immunology       Date:  2018-05-30       Impact factor: 7.397

2.  Big Data Science Training Program at a Minority Serving Institution: Processes and Initial Outcomes.

Authors:  Archana Jaiswal McEligot; Math P Cuajungco; Sam Behseta; Laura Chandler; Harmanpreet Chauhan; Sinjini Mitra; Pimbucha Rusmevichientong; Shana Charles
Journal:  Calif J Health Promot       Date:  2018

3.  Bridging the Brain and Data Sciences.

Authors:  John Darrell Van Horn
Journal:  Big Data       Date:  2020-11-18       Impact factor: 4.426

4.  The Mastery Rubric for Bioinformatics: A tool to support design and evaluation of career-spanning education and training.

Authors:  Rochelle E Tractenberg; Jessica M Lindvall; Teresa K Attwood; Allegra Via
Journal:  PLoS One       Date:  2019-11-26       Impact factor: 3.240

5.  A behind-the-scenes tour of the IEDB curation process: an optimized process empirically integrating automation and human curation efforts.

Authors:  Nima Salimi; Lindy Edwards; Gabriele Foos; Jason A Greenbaum; Sheridan Martini; Brian Reardon; Deborah Shackelford; Randi Vita; Leora Zalman; Bjoern Peters; Alessandro Sette
Journal:  Immunology       Date:  2020-07-26       Impact factor: 7.397

Review 6.  Opportunities and Challenges in Democratizing Immunology Datasets.

Authors:  Sanchita Bhattacharya; Zicheng Hu; Atul J Butte
Journal:  Front Immunol       Date:  2021-04-16       Impact factor: 7.561

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.