Literature DB >> 28207401

Filling a Protein Scaffold With a Reference.

Letu Qingge, Xiaowen Liu, Farong Zhong, Binhai Zhu.   

Abstract

In mass spectrometry-based de novo protein sequencing, it is hard to complete the sequence of the whole protein. Motivated by this, we study the (one-sided) problem of filling a protein scaffold S with some missing amino acids, given a sequence of contigs none of which is allowed to be altered, with respect to a complete reference protein P of length n , such that the BLOSUM62 score between P and the filled sequence S' is maximized. We show that this problem is polynomial-time solvable in O(n26) time. We also consider the case when the contigs are not of high quality and they are concatenated into an (incomplete) sequence I , where the missing amino acids can be inserted anywhere in I to obtain I' , such that the BLOSUM62 score between P and I' is maximized. We show that this problem is polynomial-time solvable in O(n22) time. Due to the high time complexity, both of these algorithms are impractical, we hence present several algorithms based on greedy and local search, trying to solve the problems practically. The empirical results, based on some antibody and mammalian proteins, show that the algorithms can fill protein scaffolds with high quality, provided that a good pair of scaffold and reference are given.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28207401      PMCID: PMC5439369          DOI: 10.1109/TNB.2017.2666780

Source DB:  PubMed          Journal:  IEEE Trans Nanobioscience        ISSN: 1536-1241            Impact factor:   2.935


  11 in total

1.  PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry.

Authors:  Bin Ma; Kaizhong Zhang; Christopher Hendrie; Chengzhi Liang; Ming Li; Amanda Doherty-Kirby; Gilles Lajoie
Journal:  Rapid Commun Mass Spectrom       Date:  2003       Impact factor: 2.419

2.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

3.  Scaffold filling under the breakpoint and related distances.

Authors:  Haitao Jiang; Chunfang Zheng; David Sankoff; Binhai Zhu
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2012 Jul-Aug       Impact factor: 3.710

4.  Shotgun protein sequencing by tandem mass spectra assembly.

Authors:  Nuno Bandeira; Haixu Tang; Vineet Bafna; Pavel Pevzner
Journal:  Anal Chem       Date:  2004-12-15       Impact factor: 6.986

5.  An improved approximation algorithm for scaffold filling to maximize the common adjacencies.

Authors:  Nan Liu; Haitao Jiang; Daming Zhu; Binhai Zhu
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2013 Jul-Aug       Impact factor: 3.710

6.  Automated protein (re)sequencing with MS/MS and a homologous database yields almost full coverage and accuracy.

Authors:  Xiaowen Liu; Yonghua Han; Denis Yuen; Bin Ma
Journal:  Bioinformatics       Date:  2009-06-17       Impact factor: 6.937

7.  Scaffold filling, contig fusion and comparative gene order inference.

Authors:  Adriana Muñoz; Chunfang Zheng; Qian Zhu; Victor A Albert; Steve Rounsley; David Sankoff
Journal:  BMC Bioinformatics       Date:  2010-06-04       Impact factor: 3.169

8.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

9.  The Blocks database--a system for protein classification.

Authors:  S Pietrokovski; J G Henikoff; S Henikoff
Journal:  Nucleic Acids Res       Date:  1996-01-01       Impact factor: 16.971

10.  Automated de novo protein sequencing of monoclonal antibodies.

Authors:  Nuno Bandeira; Victoria Pham; Pavel Pevzner; David Arnott; Jennie R Lill
Journal:  Nat Biotechnol       Date:  2008-12       Impact factor: 54.908

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.