Literature DB >> 10887166

Fast exact string pattern-matching algorithms adapted to the characteristics of the medical language.

C Lovis1, R H Baud.   

Abstract

OBJECTIVE: The authors consider the problem of exact string pattern matching using algorithms that do not require any preprocessing. To choose the most appropriate algorithm, distinctive features of the medical language must be taken into account. The characteristics of medical language are emphasized in this regard, the best algorithm of those reviewed is proposed, and detailed evaluations of time complexity for processing medical texts are provided.
DESIGN: The authors first illustrate and discuss the techniques of various string pattern-matching algorithms. Next, the source code and the behavior of representative exact string pattern-matching algorithms are presented in a comprehensive manner to promote their implementation. Detailed explanations of the use of various techniques to improve performance are given. MEASUREMENTS: Real-time measures of time complexity with English medical texts are presented. They lead to results distinct from those found in the computer science literature, which are typically computed with normally distributed texts.
RESULTS: The Boyer-Moore-Horspool algorithm achieves the best overall results when used with medical texts. This algorithm usually performs at least twice as fast as the other algorithms tested.
CONCLUSION: The time performance of exact string pattern matching can be greatly improved if an efficient algorithm is used. Considering the growing amount of text handled in the electronic patient record, it is worth implementing this efficient algorithm.

Entities:  

Mesh:

Year:  2000        PMID: 10887166      PMCID: PMC61442          DOI: 10.1136/jamia.2000.0070378

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  13 in total

1.  Power of expression in the electronic patient record: structured data or narrative text?

Authors:  C Lovis; R H Baud; P Planche
Journal:  Int J Med Inform       Date:  2000-09       Impact factor: 4.046

2.  Morpho-semantic analysis and translation of medical compound terms.

Authors:  P Dujols; P Aubas; C Baylon; F Grémy
Journal:  Methods Inf Med       Date:  1991       Impact factor: 2.176

3.  Internet integrated in the daily medical practice within an electronic patient record.

Authors:  C Lovis; R H Baud; J R Scherrer
Journal:  Comput Biol Med       Date:  1998-09       Impact factor: 4.589

4.  Medical dictionaries for patient encoding systems: a methodology.

Authors:  C Lovis; R Baud; A M Rassinoux; P A Michel; J R Scherrer
Journal:  Artif Intell Med       Date:  1998 Sep-Oct       Impact factor: 5.326

5.  COFFEE: an objective function for multiple sequence alignments.

Authors:  C Notredame; L Holm; D G Higgins
Journal:  Bioinformatics       Date:  1998-06       Impact factor: 6.937

6.  Morpho-semantic parsing of medical expressions.

Authors:  R H Baud; C Lovis; A M Rassinoux; J R Scherrer
Journal:  Proc AMIA Symp       Date:  1998

7.  Genetic algorithms: principles of natural selection applied to computation.

Authors:  S Forrest
Journal:  Science       Date:  1993-08-13       Impact factor: 47.728

8.  Morphosemantic analysis of compound word forms denoting surgical procedures.

Authors:  L M Norton; M G Pacak
Journal:  Methods Inf Med       Date:  1983-01       Impact factor: 2.176

9.  The use of morphosemantic regularities in the medical vocabulary for automatic lexical coding.

Authors:  S Wolff
Journal:  Methods Inf Med       Date:  1984-10       Impact factor: 2.176

10.  Morphosemantic analysis of -ITIS forms in medical language.

Authors:  M G Pacak; L M Norton; G S Dunham
Journal:  Methods Inf Med       Date:  1980-04       Impact factor: 2.176

View more
  3 in total

1.  The contribution of morphological knowledge to French MeSH mapping for information retrieval.

Authors:  P Zweigenbaum; S J Darmoni; N Grabar
Journal:  Proc AMIA Symp       Date:  2001

2.  A de-identifier for medical discharge summaries.

Authors:  Ozlem Uzuner; Tawanda C Sibanda; Yuan Luo; Peter Szolovits
Journal:  Artif Intell Med       Date:  2007-11-28       Impact factor: 5.326

3.  Identifying patient smoking status from medical discharge records.

Authors:  Ozlem Uzuner; Ira Goldstein; Yuan Luo; Isaac Kohane
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.