| Literature DB >> 31744546 |
Naihui Zhou1,2, Yuxiang Jiang3, Timothy R Bergquist4, Alexandra J Lee5, Balint Z Kacsoh6,7, Alex W Crocker8, Kimberley A Lewis8, George Georghiou9, Huy N Nguyen1,10, Md Nafiz Hamid1,2, Larry Davis2, Tunca Dogan11,12, Volkan Atalay13, Ahmet S Rifaioglu13,14, Alperen Dalkıran13, Rengul Cetin Atalay15, Chengxin Zhang16, Rebecca L Hurto17, Peter L Freddolino16,17, Yang Zhang16,17, Prajwal Bhat18, Fran Supek19,20, José M Fernández21,22, Branislava Gemovic23, Vladimir R Perovic23, Radoslav S Davidović23, Neven Sumonja23, Nevena Veljkovic23, Ehsaneddin Asgari24,25, Mohammad R K Mofrad26, Giuseppe Profiti27,28, Castrense Savojardo27, Pier Luigi Martelli27, Rita Casadio27, Florian Boecker29, Heiko Schoof30, Indika Kahanda31, Natalie Thurlby32, Alice C McHardy33,34, Alexandre Renaux35,36,37, Rabie Saidi12, Julian Gough38, Alex A Freitas39, Magdalena Antczak40, Fabio Fabris39, Mark N Wass40, Jie Hou41,42, Jianlin Cheng42, Zheng Wang43, Alfonso E Romero44, Alberto Paccanaro44, Haixuan Yang45,46, Tatyana Goldberg47, Chenguang Zhao48,49,50, Liisa Holm51, Petri Törönen51, Alan J Medlar51, Elaine Zosa52, Itamar Borukhov53, Ilya Novikov54, Angela Wilkins55, Olivier Lichtarge55, Po-Han Chi56, Wei-Cheng Tseng57, Michal Linial58, Peter W Rose59, Christophe Dessimoz60,61,62, Vedrana Vidulin63, Saso Dzeroski64,65, Ian Sillitoe66, Sayoni Das67, Jonathan Gill Lees67,68, David T Jones69,70, Cen Wan71,69, Domenico Cozzetto71,69, Rui Fa71,69, Mateo Torres44, Alex Warwick Vesztrocy70,72, Jose Manuel Rodriguez73, Michael L Tress74, Marco Frasca75, Marco Notaro75, Giuliano Grossi75, Alessandro Petrini75, Matteo Re75, Giorgio Valentini75, Marco Mesiti75,76, Daniel B Roche77, Jonas Reeb77, David W Ritchie78, Sabeur Aridhi78, Seyed Ziaeddin Alborzi78,79, Marie-Dominique Devignes78,80,79, Da Chen Emily Koo81, Richard Bonneau82,83, Vladimir Gligorijević84, Meet Barot85, Hai Fang86, Stefano Toppo87, Enrico Lavezzo87, Marco Falda88, Michele Berselli87, Silvio C E Tosatto89,90, Marco Carraro90, Damiano Piovesan90, Hafeez Ur Rehman91, Qizhong Mao92,93, Shanshan Zhang92, Slobodan Vucetic92, Gage S Black94,95, Dane Jo94,95, Erica Suh94, Jonathan B Dayton94,95, Dallas J Larsen94,95, Ashton R Omdahl94,95, Liam J McGuffin96, Danielle A Brackenridge96, Patricia C Babbitt97,98, Jeffrey M Yunes99,98, Paolo Fontana100, Feng Zhang101,102, Shanfeng Zhu103,104,105, Ronghui You103,104,105, Zihan Zhang103,105, Suyang Dai103,105, Shuwei Yao103,104, Weidong Tian106,107, Renzhi Cao108, Caleb Chandler108, Miguel Amezola108, Devon Johnson108, Jia-Ming Chang109, Wen-Hung Liao109, Yi-Wei Liu109, Stefano Pascarelli110, Yotam Frank111, Robert Hoehndorf112, Maxat Kulmanov112, Imane Boudellioua113,114, Gianfranco Politano115, Stefano Di Carlo115, Alfredo Benso115, Kai Hakala116,117, Filip Ginter116,118, Farrokh Mehryary116,117, Suwisa Kaewphan116,117,119, Jari Björne120,121, Hans Moen118, Martti E E Tolvanen122, Tapio Salakoski120,121, Daisuke Kihara123,124, Aashish Jain125, Tomislav Šmuc126, Adrian Altenhoff127,128, Asa Ben-Hur129, Burkhard Rost47,130, Steven E Brenner131, Christine A Orengo67, Constance J Jeffery132, Giovanni Bosco133, Deborah A Hogan6,8, Maria J Martin9, Claire O'Donovan9, Sean D Mooney4, Casey S Greene134,135, Predrag Radivojac136, Iddo Friedberg137.
Abstract
BACKGROUND: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function.Entities:
Keywords: Biofilm; Community challenge; Critical assessment; Long-term memory; Protein function prediction
Mesh:
Year: 2019 PMID: 31744546 PMCID: PMC6864930 DOI: 10.1186/s13059-019-1835-8
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1A comparison in Fmax between the top 5 CAFA2 models against the top 5 CAFA3 models. Colored boxes encode the results such that (1) the colors indicate margins of a CAFA3 method over a CAFA2 method in Fmax and (2) the numbers in the box indicate the percentage of wins. a CAFA2 top 5 models (rows, from top to bottom) against CAFA3 top 5 models (columns, from left to right). b Comparison of the performance (Fmax) of Naïve baselines trained respectively on SwissProt2014 and SwissProt2017. Colored box between the two bars shows the percentage of wins and margin of wins as in a. c Comparison of the performance (Fmax) of BLAST baselines trained on SwissProt2014 and SwissProt2017. Colored box between the two bars shows the percentage of wins and margin of wins as in a. Statistical significance was assessed using 10,000 bootstrap samples of benchmark proteins
Fig. 2Performance evaluation based on the Fmax for the top CAFA1, CAFA2, and CAFA3 methods. The top 12 methods are shown in this barplot ranked in descending order from left to right. The baseline methods are appended to the right; they were trained on training data from 2017, 2014, and 2011, respectively. Coverage of the methods were shown as text inside the bars. Coverage is defined as the percentage of proteins in the benchmark that are predicted by the methods. Color scheme: CAFA2, ivory; CAFA3, green; Naïve, red; BLAST, blue. Note that in MFO and BPO, CAFA1 methods were ranked, but since none made to the top 12 of all 3 CAFA challenges, they were not displayed. The CAFA1 challenge did not collect predictions for CCO. a: molecular function; b: Biological process; c: Cellular Component
Fig. 3Performance evaluation based on the Fmax for the top-performing methods in 3 ontologies. Evaluation was carried out on No knowledge benchmarks in the full mode. a–c: bar plots showing the Fmax of the top 10 methods. The 95% confidence interval was estimated using 10,000 bootstrap iterations on the benchmark set. Coverage of the methods was shown as text inside the bars. Coverage is defined as the percentage of proteins in the benchmark which are predicted by the methods. d–f: precision-recall curves for the top 10 methods. The perfect prediction should have Fmax=1, at the top right corner of the plot. The dot on the curve indicates where the maximum F score is achieved
Fig. 4Performance evaluation based on Smin for the top-performing methods in 3 ontologies. Evaluation was carried out on No knowledge benchmarks in the full mode. a–c: bar plots showing Smin of the top 10 methods. The 95% confidence interval was estimated using 10,000 bootstrap iterations on the benchmark set. Coverage of the methods was shown as text inside the bars. Coverage is defined as the percentage of proteins in the benchmark which are predicted by the methods. d–f: remaining uncertainty-missing information (RU-MI) curves for the top 10 methods. The perfect prediction should have Smin=0, at the bottom left corner of the plot. The dot on the curve indicates where the minimum semantic distance is achieved
Fig. 5Evaluation based on the Fmax for the top-performing methods in eukaryotic and bacterial species
Fig. 6Number of proteins in each benchmark species and ontology
Fig. 7Heatmap of similarity for the top 10 methods in CAFA1, CAFA2, and CAFA3. Similarity is represented by Euclidean distance of the prediction scores from each pair of methods, using the intersection set of benchmarks in the “Top methods have improved from CAFA2 to CAFA3, but improvement was less dramatic than from CAFA1 to CAFA2” section. The higher (darker red color) the euclidean distance, the less similar the methods are. Top 10 methods from each of the CAFA challenges are displayed and ranked by their performance in Fmax. Cells highlighted by black borders are between a pair of methods that come from the same PI. a: Molecular Function; b: Biological Process; c: Cellular Component
Fig. 8Keyword analysis of all CAFA3 participating methods. a–c: both relative frequency of the keywords and weighted frequencies are provided for three respective GO ontologies. The weighted frequencies accounts for the performance of the the particular model using the given keyword. If that model performs well (with high Fmax), then it gives more weight to the calculation of the total weighted average of that keyword. d shows the ratio of relative frequency between the Fmax-weighted and equal-weighted. Red indicates the ratio is greater than one while blue indicates the ratio is less than one. Only the top five keywords ranked by ratio are shown. The larger the ratio, the more difference there is between the Fmax-weighted and the equal-weighted
Number of proteins in Candida albicans and Pseudomonas aeruginosa associated with the GO term “Biofilm formation” (GO:0042710) in the GOA databases versus experimental results
| GOA annotations | ||||
|---|---|---|---|---|
| Total, 2308 | Unannotated | Annotated | ||
| CAFA experiments | False | 2034 | 29 | |
| True | 240 | 5 | ||
| Total, 4056 | Unannotated | Annotated | ||
| CAFA experiments | False | 3491 | 25 | |
| True | 532 | 9 | ||
Fig. 9AUROC of the top five teams in CAFA- π. The best-performing model from each team is picked for the top five teams, regardless of whether that model is submitted as model 1. Four baseline models all based on BLAST were computed for Candida, while six baseline models were computed for Pseudomonas, including two based on expression profiles. All team methods are in gray while BLAST methods are in red, BLAST computational methods are in blue, and expression are in yellow, see Table 3 for the description of the baselines
Baseline methods in term-centric evaluation of protein function prediction
| Model number | Training data | Score assignment | |
|---|---|---|---|
| Expression | 1 | Gene expression compendium for | Highest correlation score out of all pairwise correlations |
| 2 | Top 10 average correlation score | ||
| BLAST | 1 | All experimental annotation in UniProt-GOA. Sequences from Swiss-Prot | Highest sequence identity out of all pairwise BLASTp hits |
| 2 | All experimental annotation in UniProt-GOA. Sequences from Swiss-Prot and TrEMBL | ||
| blastcomp | 1 | All experimental and computational annotations in UniProt-GOA. Sequences from Swiss-Prot | |
| 2 | All experimental and computational annotations in UniProt-GOA. Sequences from Swiss-Prot and TrEMBL |
Number of proteins in Pseudomonas aeruginosa associated with function motility (GO:0001539) in the GOA databases versus experimental results
| GOA annotations | |||
|---|---|---|---|
| Total, 3630 | Unannotated | Annotated | |
| CAFA experiments | False | 3195 | 12 |
| True | 403 | 21 | |
Fig. 10AUROC of top five teams in CAFA3. The best-performing model from each team is picked for the top five teams, regardless of whether that model is submitted as model 1. All team methods are in gray while BLAST methods are in red and BLAST computational methods are in blue, see Table 3 for the description of the baselines
Fig. 11CAFA participation has been growing. Each principal investigator is allowed to head multiple teams, but each member can only belong to one team. Each team can submit up to three models
Fig. 12CAFA3 timeline
Fig. 13Experimental procedure of determining genes associated with the functions biofilm formation (a) and motility (b) in P. aeruginosa
Fig. 14a: different phenotypes in response to doxycycline treatment: low growth, smooth, no growth and intermediate. b: adherence phenotypes. See text for details
Fig. 15AUROC of top 5 teams in CAFA- π. The best-performing model from each team is picked for the top five teams, regardless of whether that model is submitted as model 1. All team methods are in gray while BLAST methods are in red, BLAST computational methods are in blue and expression are in yellow. See Table 3 for description of the baselines