Literature DB >> 22039211

Cascade detection for the extraction of localized sequence features; specificity results for HIV-1 protease and structure-function results for the Schellman loop.

Nicholas E Newell1.   

Abstract

MOTIVATION: The extraction of the set of features most relevant to function from classified biological sequence sets is still a challenging problem. A central issue is the determination of expected counts for higher order features so that artifact features may be screened.
RESULTS: Cascade detection (CD), a new algorithm for the extraction of localized features from sequence sets, is introduced. CD is a natural extension of the proportional modeling techniques used in contingency table analysis into the domain of feature detection. The algorithm is successfully tested on synthetic data and then applied to feature detection problems from two different domains to demonstrate its broad utility. An analysis of HIV-1 protease specificity reveals patterns of strong first-order features that group hydrophobic residues by side chain geometry and exhibit substantial symmetry about the cleavage site. Higher order results suggest that favorable cooperativity is weak by comparison and broadly distributed, but indicate possible synergies between negative charge and hydrophobicity in the substrate. Structure-function results for the Schellman loop, a helix-capping motif in proteins, contain strong first-order features and also show statistically significant cooperativities that provide new insights into the design of the motif. These include a new 'hydrophobic staple' and multiple amphipathic and electrostatic pair features. CD should prove useful not only for sequence analysis, but also for the detection of multifactor synergies in cross-classified data from clinical studies or other sources. AVAILABILITY: Windows XP/7 application and data files available at: https://sites.google.com/site/cascadedetect/home. CONTACT: nacnewell@comcast.net SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22039211     DOI: 10.1093/bioinformatics/btr594

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  The importance of physicochemical characteristics and nonlinear classifiers in determining HIV-1 protease specificity.

Authors:  Timmy Manning; Paul Walsh
Journal:  Bioengineered       Date:  2016-04-02       Impact factor: 3.269

2.  Bi-allelic ADARB1 Variants Associated with Microcephaly, Intellectual Disability, and Seizures.

Authors:  Tiong Yang Tan; Jiří Sedmík; Mark P Fitzgerald; Rivka Sukenik Halevy; Liam P Keegan; Ingo Helbig; Lina Basel-Salmon; Lior Cohen; Rachel Straussberg; Wendy K Chung; Mayada Helal; Reza Maroofian; Henry Houlden; Jane Juusola; Simon Sadedin; Lynn Pais; Katherine B Howell; Susan M White; John Christodoulou; Mary A O'Connell
Journal:  Am J Hum Genet       Date:  2020-03-26       Impact factor: 11.025

3.  Mapping side chain interactions at protein helix termini.

Authors:  Nicholas E Newell
Journal:  BMC Bioinformatics       Date:  2015-07-25       Impact factor: 3.169

4.  Prediction of HIV-1 protease cleavage site using a combination of sequence, structural, and physicochemical features.

Authors:  Onkar Singh; Emily Chia-Yu Su
Journal:  BMC Bioinformatics       Date:  2016-12-23       Impact factor: 3.169

5.  Cleavage entropy as quantitative measure of protease specificity.

Authors:  Julian E Fuchs; Susanne von Grafenstein; Roland G Huber; Michael A Margreiter; Gudrun M Spitzer; Hannes G Wallnoefer; Klaus R Liedl
Journal:  PLoS Comput Biol       Date:  2013-04-18       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.