Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Strategies for searching medical natural language text. Distribution of words in the anatomic diagnoses of 7000 autopsy subjects.

Literature DB >> 6546837

Strategies for searching medical natural language text. Distribution of words in the anatomic diagnoses of 7000 autopsy subjects.

Abstract

Computerized indexing and retrieval of medical records is increasingly important; but the use of natural language versus coded languages (SNOP, SNOMED) for this purpose remains controversial. In an effort to develop search strategies for natural language text, the authors examined the anatomic diagnosis reports by computer for 7000 consecutive autopsy subjects spanning a 13-year period at The Johns Hopkins Hospital. There were 923,657 words, 11,642 of them distinct. The authors observed an average of 1052 keystrokes, 28 lines, and 131 words per autopsy report, with an average 4.6 words per line and 7.0 letters per word. The entire text file represented 921 hours of secretarial effort. Words ranged in frequency from 33,959 occurrences of "and" to one occurrence for each of 3398 different words. Searches for rare diseases with unique names or for representative examples of common diseases were most readily performed with the use of computer-printed key word in context (KWIC) books. For uncommon diseases designated by commonly used terms (such as "cystic fibrosis"), needs were best served by a computerized search for logical combinations of key words. In an unbalanced word distribution, each conjunction (logical and) search should be performed in ascending order of word frequency; but each alternation (logical inclusive or) search should be performed in descending order of word frequency. Natural language text searches will assume a larger role in medical records analysis as the labor-intensive procedure of translation into a coded language becomes more costly, compared with the computer-intensive procedure of text searching.

Entities: Disease

Mesh：

Year: 1984 PMID： 6546837 PMCID： PMC1900346

Source DB: PubMed Journal: Am J Pathol ISSN： 0002-9440 Impact factor: 4.307

32 in total

1. Natural language storage and retrieval of medical diagnostic information. Experience at the UCLA hospital and clinics over a 10-year period.

Authors: R S Okubo; W S Russell; B Dimsdale; B G Lamson
Journal: Comput Programs Biomed Date: 1975-12

6. Mathematical models for the diagnosis of liver disease. Problems arising in the use of conditional probability theory.

Authors: P M Fraser; D A Franklin
Journal: Q J Med Date: 1974-01

Strategies for searching medical natural language text. Distribution of words in the anatomic diagnoses of 7000 autopsy subjects.

1. Natural language storage and retrieval of medical diagnostic information. Experience at the UCLA hospital and clinics over a 10-year period.

2. Computer learning of the differential diagnosis of goiters.

3. General purpose information handling techniques for pathological data.

4. Towards the simulation of clinical cognition. Taking a present illness by computer.

5. Theory development in medical decision-making.

6. Mathematical models for the diagnosis of liver disease. Problems arising in the use of conditional probability theory.

7. Diagnostic errors discovered at autopsy.

8. POLARS: a Pathology On-line Logging And Reporting System.

9. A computer-based system for autopsy diagnosis storage and retrieval without numerical coding.

10. Computer-aided diagnosis of cardiovascular disorders.

1. Medical Subject Headings and medical terminology: an analysis of terminology used in hospital charts.

2. A new paradigm for hypothesis testing in medicine, with examination of the Neyman Pearson condition.