Xiaojing Wang1, Di Wu, Siyuan Zheng, Jing Sun, Lin Tao, Yixue Li, Zhiwei Cao. 1. Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Graduate School of the Chinese Academy of Sciences, 320 YueYang Road, Shanghai 200031, PR China. xjwang03@sibs.ac.cn
Abstract
BACKGROUND: In the adaptive immune system, variable regions of immunoglobulin (IG) are encoded by random recombination of variable (V), diversity (D), and joining (J) gene segments in the germline. Partitioning the functional antibody sequences to their sourcing germline gene segments is vital not only for understanding antibody maturation but also for promoting the potential engineering of the therapeutic antibodies. To date, several tools have been developed to perform such "trace-back" calculations. Yet, the predicting ability and processing volume of those tools vary significantly for different sets of data. Moreover, none of them give a confidence for immunoglobulin heavy diversity (IGHD) identification. Developing fast, efficient and enhanced tools is always needed with the booming of immunological data. RESULTS: Here, a program named Ab-origin is presented. It is designed by batch query against germline databases based on empirical knowledge, optimized scoring scheme and appropriate parameters. Special efforts have been paid to improve the identification accuracy of the short and volatile region, IGHD. In particular, a threshold score for certain sensitivity and specificity is provided to give the confidence level of the IGHD identification. CONCLUSION: When evaluated using different sets of both simulated data and experimental data, Ab-origin outperformed all the other five popular tools in terms of prediction accuracy. The features of batch query and confidence indication of IGHD identification would provide extra help to users. The program is freely available at http://mpsq.biosino.org/ab-origin/supplementary.html.
BACKGROUND: In the adaptive immune system, variable regions of immunoglobulin (IG) are encoded by random recombination of variable (V), diversity (D), and joining (J) gene segments in the germline. Partitioning the functional antibody sequences to their sourcing germline gene segments is vital not only for understanding antibody maturation but also for promoting the potential engineering of the therapeutic antibodies. To date, several tools have been developed to perform such "trace-back" calculations. Yet, the predicting ability and processing volume of those tools vary significantly for different sets of data. Moreover, none of them give a confidence for immunoglobulin heavy diversity (IGHD) identification. Developing fast, efficient and enhanced tools is always needed with the booming of immunological data. RESULTS: Here, a program named Ab-origin is presented. It is designed by batch query against germline databases based on empirical knowledge, optimized scoring scheme and appropriate parameters. Special efforts have been paid to improve the identification accuracy of the short and volatile region, IGHD. In particular, a threshold score for certain sensitivity and specificity is provided to give the confidence level of the IGHD identification. CONCLUSION: When evaluated using different sets of both simulated data and experimental data, Ab-origin outperformed all the other five popular tools in terms of prediction accuracy. The features of batch query and confidence indication of IGHD identification would provide extra help to users. The program is freely available at http://mpsq.biosino.org/ab-origin/supplementary.html.
Authors: Ann Michelle Morrison; Kelly Coughlin; James P Shine; Brent A Coull; Andrea C Rex Journal: Appl Environ Microbiol Date: 2003-11 Impact factor: 4.792
Authors: M Margarida Souto-Carneiro; Nancy S Longo; Daniel E Russ; Hong-wei Sun; Peter E Lipsky Journal: J Immunol Date: 2004-06-01 Impact factor: 5.422
Authors: Simon D W Frost; Ben Murrell; A S Md Mukarram Hossain; Gregg J Silverman; Sergei L Kosakovsky Pond Journal: Philos Trans R Soc Lond B Biol Sci Date: 2015-09-05 Impact factor: 6.237
Authors: Kathryn A K Finton; Della Friend; James Jaffe; Mesfin Gewe; Margaret A Holmes; H Benjamin Larman; Andrew Stuart; Kevin Larimore; Philip D Greenberg; Stephen J Elledge; Leonidas Stamatatos; Roland K Strong Journal: PLoS Pathog Date: 2014-09-25 Impact factor: 6.823
Authors: Inimary T Toby; Mikhail K Levin; Edward A Salinas; Scott Christley; Sanchita Bhattacharya; Felix Breden; Adam Buntzman; Brian Corrie; John Fonner; Namita T Gupta; Uri Hershberg; Nishanth Marthandan; Aaron Rosenfeld; William Rounds; Florian Rubelt; Walter Scarborough; Jamie K Scott; Mohamed Uduman; Jason A Vander Heiden; Richard H Scheuermann; Nancy Monson; Steven H Kleinstein; Lindsay G Cowell Journal: BMC Bioinformatics Date: 2016-10-06 Impact factor: 3.169