Literature DB >> 23142965

DLocalMotif: a discriminative approach for discovering local motifs in protein sequences.

Ahmed M Mehdi1, Muhammad Shoaib B Sehgal, Bostjan Kobe, Timothy L Bailey, Mikael Bodén.   

Abstract

MOTIVATION: Local motifs are patterns of DNA or protein sequences that occur within a sequence interval relative to a biologically defined anchor or landmark. Current protein motif discovery methods do not adequately consider such constraints to identify biologically significant motifs that are only weakly over-represented but spatially confined. Using negatives, i.e. sequences known to not contain a local motif, can further increase the specificity of their discovery.
RESULTS: This article introduces the method DLocalMotif that makes use of positional information and negative data for local motif discovery in protein sequences. DLocalMotif combines three scoring functions, measuring degrees of motif over-representation, entropy and spatial confinement, specifically designed to discriminatively exploit the availability of negative data. The method is shown to outperform current methods that use only a subset of these motif characteristics. We apply the method to several biological datasets. The analysis of peroxisomal targeting signals uncovers several novel motifs that occur immediately upstream of the dominant peroxisomal targeting signal-1 signal. The analysis of proline-tyrosine nuclear localization signals uncovers multiple novel motifs that overlap with C2H2 zinc finger domains. We also evaluate the method on classical nuclear localization signals and endoplasmic reticulum retention signals and find that DLocalMotif successfully recovers biologically relevant sequence properties. AVAILABILITY: http://bioinf.scmb.uq.edu.au/dlocalmotif/

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 23142965      PMCID: PMC6636396          DOI: 10.1093/bioinformatics/bts654

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data.

Authors:  Hongbo Zhang; Lin Zhu; De-Shuang Huang
Journal:  Sci Rep       Date:  2017-06-12       Impact factor: 4.379

2.  Probabilistic variable-length segmentation of protein sequences for discriminative motif discovery (DiMotif) and sequence embedding (ProtVecX).

Authors:  Ehsaneddin Asgari; Alice C McHardy; Mohammad R K Mofrad
Journal:  Sci Rep       Date:  2019-03-05       Impact factor: 4.379

3.  TET2 missense variants in human neoplasia. A proposal of structural and functional classification.

Authors:  Elena Bussaglia; Rosa Antón; Josep F Nomdedéu; Pablo Fuentes-Prior
Journal:  Mol Genet Genomic Med       Date:  2019-06-11       Impact factor: 2.183

4.  GLTM: A Global-Local Attention LSTM Model to Locate Dimer Motif of Single-Pass Membrane Proteins.

Authors:  Quanchao Ma; Kai Zou; Zhihai Zhang; Fan Yang
Journal:  Front Genet       Date:  2022-03-15       Impact factor: 4.599

5.  Discriminative motif discovery via simulated evolution and random under-sampling.

Authors:  Tao Song; Hong Gu
Journal:  PLoS One       Date:  2014-02-13       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.