| Literature DB >> 30715275 |
Helen Attrill1, Pascale Gaudet2, Rachael P Huntley3, Ruth C Lovering3, Stacia R Engel4, Sylvain Poux5, Kimberly M Van Auken6, George Georghiou7, Marcus C Chibucos8, Tanya Z Berardini9, Valerie Wood10, Harold Drabkin11, Petra Fey12, Penelope Garmiri7, Midori A Harris10, Tony Sawford7, Leonore Reiser9, Rebecca Tauber8, Sabrina Toro13.
Abstract
High-throughput studies constitute an essential and valued source of information for researchers. However, high-throughput experimental workflows are often complex, with multiple data sets that may contain large numbers of false positives. The representation of high-throughput data in the Gene Ontology (GO) therefore presents a challenging annotation problem, when the overarching goal of GO curation is to provide the most precise view of a gene's role in biology. To address this, representatives from annotation teams within the GO Consortium reviewed high-throughput data annotation practices. We present an annotation framework for high-throughput studies that will facilitate good standards in GO curation and, through the use of new high-throughput evidence codes, increase the visibility of these annotations to the research community.Entities:
Mesh:
Year: 2019 PMID: 30715275 PMCID: PMC6355445 DOI: 10.1093/database/baz007
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
High-throughput evidence codes
| ECO ID | Term name | Three-letter GO abbreviation | LTP equivalent ECO ID | LTP equivalent term name | Three-letter GO abbreviation |
|---|---|---|---|---|---|
| ECO:0006056 | High throughput evidence used in manual assertion | HTP | ECO:0000269 | Experimental evidence used in manual assertion | EXP |
| ECO:0007001 | High throughput mutant phenotype evidence used in manual assertion | HMP | ECO:0000315 | Mutant phenotype evidence used in manual assertion | IMP |
| ECO:0007003 | High throughput genetic interaction evidence used in manual assertion | HGI | ECO:0000316 | Genetic interaction evidence used in manual assertion | IGI |
| ECO:0007005 | High throughput direct assay evidence used in manual assertion | HDA | ECO:0000314 | Direct assay evidence used in manual assertion | IDA |
| ECO:0007007 | High throughput expression pattern evidence used in manual assertion | HEP | ECO:0000270 | Expression pattern evidence used in manual assertion | IEP |
Five new high-throughput evidence codes added to the Evidence and Conclusion Ontology for GO curation are shown in the first two columns (ECO identifier and term name). These are shown next to the equivalent conventional (labeled LTP) term. The standard GO three-letter abbreviation is shown for each code (EXP (Inferred from Experiment), IDA (Inferred from Direct Assay), IMP (Inferred from Mutant Phenotype), IGI (Inferred from Genetic Interaction) and IEP (Inferred from Expression Pattern)). The new high-throughput evidence codes were chosen specifically to mirror those used in conventional GO annotation, so that retrofit of data would be straightforward—many databases use the conventional GO evidence codes and the standard GO annotation exchange file, the gene association file, uses the three-letter GO abbreviation (http://geneontology.org/page/go-annotation-file-formats). Going forward, as groups adopt the gene product association data exchange format for GO annotations, which directly uses ECO codes, there is scope to expand the set of high-throughput evidence codes used by GO curators to capture the specific types of assay used in high-throughput studies, such as immunofluorescence confocal microscopy or functional complementation evidence. Curators that wish to expand the set of high-throughput ECO codes should submit requests directly to ECO http://www.evidenceontology.org/submit_term_request/. Abbreviations: Evidence and Conclusion Ontology identifier (ECO ID), low throughput (LTP).
Figure 1High-throughput evidence codes to support GO curation. The five new evidence codes added to the Evidence and Conclusion Ontology for GO curation are shown in red boxes: the parent term `high throughput evidence used in manual assertion’ (ECO:0006056) and four child terms. Is_a relationships with other classes in the Evidence & Conclusion Ontology are shown, including the evidence equivalents used in conventional GO annotation (yellow boxes). The new high-throughput evidence codes should be used by GO curators when annotating high-throughput data in accordance with GOC annotation guidelines. The graph layout was generated using the Ontology Lookup Service (https://www.ebi.ac.uk/ols/index) OLS-graphview.