Literature DB >> 34273967

Accurate plant pathogen effector protein classification ab initio with deepredeff: an ensemble of convolutional neural networks.

Ruth Kristianingsih1, Dan MacLean2.   

Abstract

BACKGROUND: Plant pathogens cause billions of dollars of crop loss every year and are a major threat to global food security. Effector proteins are the tools such pathogens use to infect the cell, predicting effectors de novo from sequence is difficult because of the heterogeneity of the sequences. We hypothesised that deep learning classifiers based on Convolutional Neural Networks would be able to identify effectors and deliver new insights.
RESULTS: We created a training set of manually curated effector sequences from PHI-Base and used these to train a range of model architectures for classifying bacteria, fungal and oomycete sequences. The best performing classifiers had accuracies from 93 to 84%. The models were tested against popular effector detection software on our own test data and data provided with those models. We observed better performance from our models. Specifically our models showed greater accuracy and lower tendencies to call false positives on a secreted protein negative test set and a greater generalisability. We used GRAD-CAM activation map analysis to identify the sequences that activated our CNN-LSTM models and found short but distinct N-terminal regions in each taxon that was indicative of effector sequences. No motifs could be observed in these regions but an analysis of amino acid types indicated differing patterns of enrichment and depletion that varied between taxa.
CONCLUSIONS: Small training sets can be used effectively to train highly accurate and sensitive deep learning models without need for the operator to know anything other than sequence and without arbitrary decisions made about what sequence features or physico-chemical properties are important. Biological insight on subsequences important for classification can be achieved by examining the activations in the model.
© 2021. The Author(s).

Entities:  

Keywords:  AI; Deep learning; Effector protein

Mesh:

Substances:

Year:  2021        PMID: 34273967     DOI: 10.1186/s12859-021-04293-3

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  7 in total

1.  EffectorP: predicting fungal effector proteins from secretomes using machine learning.

Authors:  Jana Sperschneider; Donald M Gardiner; Peter N Dodds; Francesco Tini; Lorenzo Covarelli; Karam B Singh; John M Manners; Jennifer M Taylor
Journal:  New Phytol       Date:  2015-12-17       Impact factor: 10.151

2.  Face recognition: a convolutional neural-network approach.

Authors:  S Lawrence; C L Giles; A C Tsoi; A D Back
Journal:  IEEE Trans Neural Netw       Date:  1997

3.  Improved prediction of fungal effector proteins from secretomes with EffectorP 2.0.

Authors:  Jana Sperschneider; Peter N Dodds; Donald M Gardiner; Karam B Singh; Jennifer M Taylor
Journal:  Mol Plant Pathol       Date:  2018-05-11       Impact factor: 5.663

4.  ApoplastP: prediction of effectors and plant proteins in the apoplast using machine learning.

Authors:  Jana Sperschneider; Peter N Dodds; Karam B Singh; Jennifer M Taylor
Journal:  New Phytol       Date:  2017-12-15       Impact factor: 10.151

5.  Genome sequence and analysis of the Irish potato famine pathogen Phytophthora infestans.

Authors:  Brian J Haas; Sophien Kamoun; Michael C Zody; Rays H Y Jiang; Robert E Handsaker; Liliana M Cano; Manfred Grabherr; Chinnappa D Kodira; Sylvain Raffaele; Trudy Torto-Alalibo; Tolga O Bozkurt; Audrey M V Ah-Fong; Lucia Alvarado; Vicky L Anderson; Miles R Armstrong; Anna Avrova; Laura Baxter; Jim Beynon; Petra C Boevink; Stephanie R Bollmann; Jorunn I B Bos; Vincent Bulone; Guohong Cai; Cahid Cakir; James C Carrington; Megan Chawner; Lucio Conti; Stefano Costanzo; Richard Ewan; Noah Fahlgren; Michael A Fischbach; Johanna Fugelstad; Eleanor M Gilroy; Sante Gnerre; Pamela J Green; Laura J Grenville-Briggs; John Griffith; Niklaus J Grünwald; Karolyn Horn; Neil R Horner; Chia-Hui Hu; Edgar Huitema; Dong-Hoon Jeong; Alexandra M E Jones; Jonathan D G Jones; Richard W Jones; Elinor K Karlsson; Sridhara G Kunjeti; Kurt Lamour; Zhenyu Liu; Lijun Ma; Daniel Maclean; Marcus C Chibucos; Hayes McDonald; Jessica McWalters; Harold J G Meijer; William Morgan; Paul F Morris; Carol A Munro; Keith O'Neill; Manuel Ospina-Giraldo; Andrés Pinzón; Leighton Pritchard; Bernard Ramsahoye; Qinghu Ren; Silvia Restrepo; Sourav Roy; Ari Sadanandom; Alon Savidor; Sebastian Schornack; David C Schwartz; Ulrike D Schumann; Ben Schwessinger; Lauren Seyer; Ted Sharpe; Cristina Silvar; Jing Song; David J Studholme; Sean Sykes; Marco Thines; Peter J I van de Vondervoort; Vipaporn Phuntumart; Stephan Wawra; Rob Weide; Joe Win; Carolyn Young; Shiguo Zhou; William Fry; Blake C Meyers; Pieter van West; Jean Ristaino; Francine Govers; Paul R J Birch; Stephen C Whisson; Howard S Judelson; Chad Nusbaum
Journal:  Nature       Date:  2009-09-09       Impact factor: 49.962

Review 6.  Recent trends in control methods for bacterial wilt diseases caused by Ralstonia solanacearum.

Authors:  Yanetri Asi Nion; Koki Toyota
Journal:  Microbes Environ       Date:  2015-03-26       Impact factor: 2.912

7.  Extracting biological age from biomedical data via deep learning: too much of a good thing?

Authors:  Timothy V Pyrkov; Konstantin Slipensky; Mikhail Barg; Alexey Kondrashin; Boris Zhurov; Alexander Zenin; Mikhail Pyatnitskiy; Leonid Menshikov; Sergei Markov; Peter O Fedichev
Journal:  Sci Rep       Date:  2018-03-26       Impact factor: 4.379

  7 in total
  2 in total

1.  PHI-base in 2022: a multi-species phenotype database for Pathogen-Host Interactions.

Authors:  Martin Urban; Alayne Cuzick; James Seager; Valerie Wood; Kim Rutherford; Shilpa Yagwakote Venkatesh; Jashobanta Sahu; S Vijaylakshmi Iyer; Lokanath Khamari; Nishadi De Silva; Manuel Carbajo Martinez; Helder Pedro; Andrew D Yates; Kim E Hammond-Kosack
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

Review 2.  Exploiting Structural Modelling Tools to Explore Host-Translocated Effector Proteins.

Authors:  Sahel Amoozadeh; Jodie Johnston; Claudia-Nicole Meisrimler
Journal:  Int J Mol Sci       Date:  2021-11-30       Impact factor: 5.923

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.