| Literature DB >> 32615639 |
Nima Salimi1, Lindy Edwards1, Gabriele Foos1, Jason A Greenbaum1, Sheridan Martini1, Brian Reardon1, Deborah Shackelford1, Randi Vita1, Leora Zalman1, Bjoern Peters1,2, Alessandro Sette1,2.
Abstract
The Immune Epitope Database and Analysis Resource (IEDB) provides the scientific community with open access to epitope data, as well as epitope prediction and analysis tools. The IEDB houses the most extensive collection of experimentally validated B-cell and T-cell epitope data, sourced primarily from published literature by expert curation. The data procurement requires systematic identification, categorization, curation and quality-checking processes. Here, we provide insights into these processes, with particular focus on the dividends they have paid in terms of attaining project milestones, as well as how objective analyses of our processes have identified opportunities for process optimization. These experiences are shared as a case study of the benefits of process implementation and review in biomedical big data, as well as to encourage idea-sharing among players in this ever-growing space.Entities:
Keywords: B cell; T cell; curation; database; epitope
Mesh:
Substances:
Year: 2020 PMID: 32615639 PMCID: PMC7496777 DOI: 10.1111/imm.13234
Source DB: PubMed Journal: Immunology ISSN: 0019-2805 Impact factor: 7.397
Figure 1The IEDB curation process consists of four high‐level steps, each with its own specific sub‐steps designed to ensure data quality and consistency.
Figure 2The assay‐centric, contextual representation of epitope data is rooted in an ontologically based curation system enabling the Curator to extract and input data from text and figures into defined data fields.
Figure 3The distribution of effort required for the major steps in the curation workflow, based on compiled input from IEDB staff. This analysis was used to target steps for process optimization.