Literature DB >> 17431306

A normalized Levenshtein distance metric.

Li Yujian1, Liu Bo.   

Abstract

Although a number of normalized edit distances presented so far may offer good performance in some applications, none of them can be regarded as a genuine metric between strings because they do not satisfy the triangle inequality. Given two strings X and Y over a finite alphabet, this paper defines a new normalized edit distance between X and Y as a simple function of their lengths (|X| and |Y|) and the Generalized Levenshtein Distance (GLD) between them. The new distance can be easily computed through GLD with a complexity of O(|X|.|Y|) and it is a metric valued in [0, 1] under the condition that the weight function is a metric over the set of elementary edit operations with all costs of insertions/deletions having the same weight. Experiments using the AESA algorithm in handwritten digit recognition show that the new distance can generally provide similar results to some other normalized edit distances and may perform slightly better if the triangle inequality is violated in a particular data set.

Mesh:

Year:  2007        PMID: 17431306     DOI: 10.1109/TPAMI.2007.1078

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  24 in total

1.  Biome representational in silico karyotyping.

Authors:  Valliammai Muthappan; Aaron Y Lee; Tamara L Lamprecht; Lakshmi Akileswaran; Suzanne M Dintzis; Choli Lee; Vincent Magrini; Elaine R Mardis; Jay Shendure; Russell N Van Gelder
Journal:  Genome Res       Date:  2011-02-10       Impact factor: 9.043

2.  Alignment free identification of clones in B cell receptor repertoires.

Authors:  Ofir Lindenbaum; Nima Nouri; Yuval Kluger; Steven H Kleinstein
Journal:  Nucleic Acids Res       Date:  2021-02-26       Impact factor: 16.971

3.  Identifying digenic disease genes via machine learning in the Undiagnosed Diseases Network.

Authors:  Souhrid Mukherjee; Joy D Cogan; John H Newman; John A Phillips; Rizwan Hamid; Jens Meiler; John A Capra
Journal:  Am J Hum Genet       Date:  2021-09-15       Impact factor: 11.025

4.  Applying Image Recognition and Tracking Methods for Fish Physiology Detection Based on a Visual Sensor.

Authors:  Jia-Ming Liang; Shashank Mishra; Yu-Lin Cheng
Journal:  Sensors (Basel)       Date:  2022-07-25       Impact factor: 3.847

5.  Clustering-based identification of clonally-related immunoglobulin gene sequence sets.

Authors:  Zhiliang Chen; Andrew M Collins; Yan Wang; Bruno A Gaëta
Journal:  Immunome Res       Date:  2010-09-27

6.  Online transcranial Doppler ultrasonographic control of an onscreen keyboard.

Authors:  Jie Lu; Khondaker A Mamun; Tom Chau
Journal:  Front Hum Neurosci       Date:  2014-04-22       Impact factor: 3.169

Review 7.  Nominal ISOMERs (Incorrect Spellings Of Medicines Eluding Researchers)-variants in the spellings of drug names in PubMed: a database review.

Authors:  Robin E Ferner; Jeffrey K Aronson
Journal:  BMJ       Date:  2016-12-14

8.  Using String Metrics to Identify Patient Journeys through Care Pathways.

Authors:  Richard Williams; Iain E Buchan; Mattia Prosperi; John Ainsworth
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

9.  Matching health information seekers' queries to medical terms.

Authors:  Lina F Soualmia; Elise Prieur-Gaston; Zied Moalla; Thierry Lecroq; Stéfan J Darmoni
Journal:  BMC Bioinformatics       Date:  2012-09-07       Impact factor: 3.169

10.  DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data.

Authors:  Gustavo Arango-Argoty; Emily Garner; Amy Pruden; Lenwood S Heath; Peter Vikesland; Liqing Zhang
Journal:  Microbiome       Date:  2018-02-01       Impact factor: 14.650

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.