Literature DB >> 31210270

Using association rule mining and ontologies to generate metadata recommendations from multiple biomedical databases.

Marcos Martínez-Romero1, Martin J O'Connor1, Attila L Egyedi1, Debra Willrett1, Josef Hardi1, John Graybeal1, Mark A Musen1.   

Abstract

Metadata-the machine-readable descriptions of the data-are increasingly seen as crucial for describing the vast array of biomedical datasets that are currently being deposited in public repositories. While most public repositories have firm requirements that metadata must accompany submitted datasets, the quality of those metadata is generally very poor. A key problem is that the typical metadata acquisition process is onerous and time consuming, with little interactive guidance or assistance provided to users. Secondary problems include the lack of validation and sparse use of standardized terms or ontologies when authoring metadata. There is a pressing need for improvements to the metadata acquisition process that will help users to enter metadata quickly and accurately. In this paper, we outline a recommendation system for metadata that aims to address this challenge. Our approach uses association rule mining to uncover hidden associations among metadata values and to represent them in the form of association rules. These rules are then used to present users with real-time recommendations when authoring metadata. The novelties of our method are that it is able to combine analyses of metadata from multiple repositories when generating recommendations and can enhance those recommendations by aligning them with ontology terms. We implemented our approach as a service integrated into the CEDAR Workbench metadata authoring platform, and evaluated it using metadata from two public biomedical repositories: US-based National Center for Biotechnology Information BioSample and European Bioinformatics Institute BioSamples. The results show that our approach is able to use analyses of previously entered metadata coupled with ontology-based mappings to present users with accurate recommendations when authoring metadata.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2019        PMID: 31210270      PMCID: PMC6866600          DOI: 10.1093/database/baz059

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


  15 in total

1.  Fast and Accurate Metadata Authoring Using Ontology-Based Recommendations.

Authors:  Marcos Martínez-Romero; Martin J O'Connor; Ravi D Shankar; Maryam Panahiazar; Debra Willrett; Attila L Egyedi; Olivier Gevaert; John Graybeal; Mark A Musen
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  RightField: embedding ontology annotation in spreadsheets.

Authors:  Katy Wolstencroft; Stuart Owen; Matthew Horridge; Olga Krebs; Wolfgang Mueller; Jacky L Snoep; Franco du Preez; Carole Goble
Journal:  Bioinformatics       Date:  2011-05-26       Impact factor: 6.937

3.  Annotare--a tool for annotating high-throughput biomedical investigations and resulting data.

Authors:  Ravi Shankar; Helen Parkinson; Tony Burdett; Emma Hastings; Junmin Liu; Michael Miller; Rashmi Srinivasa; Joseph White; Alvis Brazma; Gavin Sherlock; Christian J Stoeckert; Catherine A Ball
Journal:  Bioinformatics       Date:  2010-08-23       Impact factor: 6.937

4.  The center for expanded data annotation and retrieval.

Authors:  Mark A Musen; Carol A Bean; Kei-Hoi Cheung; Michel Dumontier; Kim A Durante; Olivier Gevaert; Alejandra Gonzalez-Beltran; Purvesh Khatri; Steven H Kleinstein; Martin J O'Connor; Yannick Pouliot; Philippe Rocca-Serra; Susanna-Assunta Sansone; Jeffrey A Wiser
Journal:  J Am Med Inform Assoc       Date:  2015-06-25       Impact factor: 4.497

5.  Who shares? Who doesn't? Factors associated with openly archiving raw research data.

Authors:  Heather A Piwowar
Journal:  PLoS One       Date:  2011-07-13       Impact factor: 3.240

6.  The open biomedical annotator.

Authors:  Clement Jonquet; Nigam H Shah; Mark A Musen
Journal:  Summit Transl Bioinform       Date:  2009-03-01

7.  A sea of standards for omics data: sink or swim?

Authors:  Jessica D Tenenbaum; Susanna-Assunta Sansone; Melissa Haendel
Journal:  J Am Med Inform Assoc       Date:  2013-09-27       Impact factor: 4.497

8.  The variable quality of metadata about biological samples used in biomedical experiments.

Authors:  Rafael S Gonçalves; Mark A Musen
Journal:  Sci Data       Date:  2019-02-19       Impact factor: 6.444

9.  The FAIR Guiding Principles for scientific data management and stewardship.

Authors:  Mark D Wilkinson; Michel Dumontier; I Jsbrand Jan Aalbersberg; Gabrielle Appleton; Myles Axton; Arie Baak; Niklas Blomberg; Jan-Willem Boiten; Luiz Bonino da Silva Santos; Philip E Bourne; Jildau Bouwman; Anthony J Brookes; Tim Clark; Mercè Crosas; Ingrid Dillo; Olivier Dumon; Scott Edmunds; Chris T Evelo; Richard Finkers; Alejandra Gonzalez-Beltran; Alasdair J G Gray; Paul Groth; Carole Goble; Jeffrey S Grethe; Jaap Heringa; Peter A C 't Hoen; Rob Hooft; Tobias Kuhn; Ruben Kok; Joost Kok; Scott J Lusher; Maryann E Martone; Albert Mons; Abel L Packer; Bengt Persson; Philippe Rocca-Serra; Marco Roos; Rene van Schaik; Susanna-Assunta Sansone; Erik Schultes; Thierry Sengstag; Ted Slater; George Strawn; Morris A Swertz; Mark Thompson; Johan van der Lei; Erik van Mulligen; Jan Velterop; Andra Waagmeester; Peter Wittenburg; Katherine Wolstencroft; Jun Zhao; Barend Mons
Journal:  Sci Data       Date:  2016-03-15       Impact factor: 6.444

10.  Reproducibility and Reuse of Adaptive Immune Receptor Repertoire Data.

Authors:  Felix Breden; Eline T Luning Prak; Bjoern Peters; Florian Rubelt; Chaim A Schramm; Christian E Busse; Jason A Vander Heiden; Scott Christley; Syed Ahmad Chan Bukhari; Adrian Thorogood; Frederick A Matsen Iv; Yariv Wine; Uri Laserson; David Klatzmann; Daniel C Douek; Marie-Paule Lefranc; Andrew M Collins; Tania Bubela; Steven H Kleinstein; Corey T Watson; Lindsay G Cowell; Jamie K Scott; Thomas B Kepler
Journal:  Front Immunol       Date:  2017-11-01       Impact factor: 8.786

View more
  3 in total

1.  SKIOME Project: a curated collection of skin microbiome datasets enriched with study-related metadata.

Authors:  Giulia Agostinetto; Davide Bozzi; Danilo Porro; Maurizio Casiraghi; Massimo Labra; Antonia Bruno
Journal:  Database (Oxford)       Date:  2022-05-16       Impact factor: 4.462

Review 2.  Understanding the Nature of Metadata: Systematic Review.

Authors:  Hannes Ulrich; Ann-Kristin Kock-Schoppenhauer; Noemi Deppenwiese; Robert Gött; Jori Kern; Martin Lablans; Raphael W Majeed; Mark R Stöhr; Jürgen Stausberg; Julian Varghese; Martin Dugas; Josef Ingenerf
Journal:  J Med Internet Res       Date:  2022-01-11       Impact factor: 5.428

3.  An open-source framework for neuroscience metadata management applied to digital reconstructions of neuronal morphology.

Authors:  Kayvan Bijari; Masood A Akram; Giorgio A Ascoli
Journal:  Brain Inform       Date:  2020-03-26
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.