MOTIVATION: N-linked glycosylation occurs predominantly at the N-X-T/S motif, where X is any amino acid except proline. Not all N-X-T/S sequons are glycosylated, and a number of web servers for predicting N-linked glycan occupancy using sequence and/or residue pattern information have been developed. None of the currently available servers, however, utilizes protein structural information for the prediction of N-glycan occupancy. RESULTS: Here, we describe a novel classifier algorithm, NGlycPred, for the prediction of glycan occupancy at the N-X-T/S sequons. The algorithm utilizes both structural as well as residue pattern information and was trained on a set of glycosylated protein structures using the Random Forest algorithm. The best predictor achieved a balanced accuracy of 0.687 under 10-fold cross-validation on a curated dataset of 479 N-X-T/S sequons and outperformed sequence-based predictors when evaluated on the same dataset. The incorporation of structural information, including local contact order, surface accessibility/composition and secondary structure thus improves the prediction accuracy of glycan occupancy at the N-X-T/S consensus sequon. AVAILABILITY AND IMPLEMENTATION: NGlycPred is freely available to non-commercial users as a web-based server at http://exon.niaid.nih.gov/nglycpred/.
MOTIVATION:N-linked glycosylation occurs predominantly at the N-X-T/S motif, where X is any amino acid except proline. Not all N-X-T/S sequons are glycosylated, and a number of web servers for predicting N-linked glycan occupancy using sequence and/or residue pattern information have been developed. None of the currently available servers, however, utilizes protein structural information for the prediction of N-glycan occupancy. RESULTS: Here, we describe a novel classifier algorithm, NGlycPred, for the prediction of glycan occupancy at the N-X-T/S sequons. The algorithm utilizes both structural as well as residue pattern information and was trained on a set of glycosylated protein structures using the Random Forest algorithm. The best predictor achieved a balanced accuracy of 0.687 under 10-fold cross-validation on a curated dataset of 479 N-X-T/S sequons and outperformed sequence-based predictors when evaluated on the same dataset. The incorporation of structural information, including local contact order, surface accessibility/composition and secondary structure thus improves the prediction accuracy of glycan occupancy at the N-X-T/S consensus sequon. AVAILABILITY AND IMPLEMENTATION:NGlycPred is freely available to non-commercial users as a web-based server at http://exon.niaid.nih.gov/nglycpred/.
Authors: H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971
Authors: Gerard J Kleywegt; Mark R Harris; Jin Yu Zou; Thomas C Taylor; Anders Wählby; T Alwyn Jones Journal: Acta Crystallogr D Biol Crystallogr Date: 2004-11-26
Authors: Elizabeth K Culyba; Joshua L Price; Sarah R Hanson; Apratim Dhar; Chi-Huey Wong; Martin Gruebele; Evan T Powers; Jeffery W Kelly Journal: Science Date: 2011-02-04 Impact factor: 47.728
Authors: Andrei-J Petrescu; Adina-L Milac; Stefana M Petrescu; Raymond A Dwek; Mark R Wormald Journal: Glycobiology Date: 2003-09-26 Impact factor: 4.313
Authors: Gwo-Yu Chuang; Baoshan Zhang; Krisha McKee; Sijy O'Dell; Young Do Kwon; Tongqing Zhou; Julie Blinn; Krissey Lloyd; Robert Parks; Tarra Von Holle; Sung-Youl Ko; Wing-Pui Kong; Amarendra Pegu; Keyun Wang; Kavitha Baruah; Max Crispin; John R Mascola; M Anthony Moody; Barton F Haynes; Ivelin S Georgiev; Peter D Kwong Journal: Protein Sci Date: 2015-05-12 Impact factor: 6.725
Authors: Joan O Ngwuta; Man Chen; Kayvon Modjarrad; M Gordon Joyce; Masaru Kanekiyo; Azad Kumar; Hadi M Yassine; Syed M Moin; April M Killikelly; Gwo-Yu Chuang; Aliaksandr Druz; Ivelin S Georgiev; Emily J Rundlet; Mallika Sastry; Guillaume B E Stewart-Jones; Yongping Yang; Baoshan Zhang; Martha C Nason; Cristina Capella; Mark E Peeples; Julie E Ledgerwood; Jason S McLellan; Peter D Kwong; Barney S Graham Journal: Sci Transl Med Date: 2015-10-14 Impact factor: 17.956