| Literature DB >> 23505427 |
Fernanda C Dórea1, C Anne Muckle, David Kelton, J T McClure, Beverly J McEwen, W Bruce McNab, Javier Sanchez, Crawford W Revie.
Abstract
BACKGROUND: Recent focus on earlier detection of pathogen introduction in human and animal populations has led to the development of surveillance systems based on automated monitoring of health data. Real- or near real-time monitoring of pre-diagnostic data requires automated classification of records into syndromes--syndromic surveillance--using algorithms that incorporate medical knowledge in a reliable and efficient way, while remaining comprehensible to end users.Entities:
Mesh:
Year: 2013 PMID: 23505427 PMCID: PMC3591392 DOI: 10.1371/journal.pone.0057334
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sample of the data available, restricted to the fields relevant for syndrome classification.
| Date | Sample ID | Client Sample ID | Sample Type | Diagnostic test code | Diagnostic test description |
| 2010-01-04 | 10-####-0001 | Tulip | Milk | Beta-Lactamase_Test |
|
| 2010-01-04 | 10-####-0002 | Plum |
| Culture_Bact | Bacterial_culture |
| 2010-01-04 | 10-$$$$-0005 | A517_SMALL |
| Culture_Bact | Bacterial_culture |
| 2010-01-04 | 10-$$$$-0009 | B516 | Tissue_Pooled | RLA |
|
| 2010-01-04 | 10-$$$$-0010 | #517,_#516 | Tissue_-_Fixed | Histopathology | Histopathology |
| 2010-01-07 | 10-####-0002 | 139_W-H-1_-_ | Fluid | Culture_Bact | Bacterial_culture |
| 2010-01-07 | 10-####-0004 | 139_W-H-1_-_ | Tissue | Culture_Bact | Bacterial_culture |
| 2010-01-05 | 10-$$$$-0001 | Webb/None_Given | Tissue_-_Fixed | IHC_-_Bov_Corona |
|
| 2010-01-05 | 10-$$$$-0002 | Webb/None_Given | Ear_-_Notch | BVDV_Antigen_ELISA |
|
| 2010-01-05 | 10-####-0001 | 11675_BOOSTER_110004 |
| Culture_Bact | Bacterial_culture |
| 2010-01-27 | 10-$$$$-0031 | Black_Face_w_white_spot | Blood_-_Serum | N._caninum_ELISA |
|
| 2010-01-27 | 10-####-0002 |
| Tissue | Culture_Bact | Bacterial_culture |
| 2010-01-27 | 10-####-0003 | LuLiKiSpThTy | Tissue_Pooled | Cell_Cult_Isolation | Virus_isolation_in_cell_culture |
| 2010-01-27 | 10-####-0005 |
| Tissue | Culture_Bact | Bacterial_culture |
| 2010-01-27 | 10-####-0006 |
| Tissue | Culture_Bact | Bacterial_culture |
The field containing Submission ID was removed to ensure confidentiality, and omitted in the Sample ID shown.
Samples from the same case are represented in the table with consecutive rows of the same shading. Keywords and test names relevant for classification are shown in bold.
Syndromic groups, defined based on an evaluation of three years of diagnostic test requests.
| Syndromic group | Criteria for syndromic group creation | Number of test requests | Number of cases |
| Abortion | Clinical sign | 559 | 225 |
| Circulatory | Organ systems | 57 | 50 |
| Eyes and ears | 37 | 20 | |
| GIT | 8,733 | 2,564 | |
| Haematopoietic | 231 | 199 | |
| Hepatic | 135 | 119 | |
| Mastitis | 49,246 | 6,766 | |
| Musculoskeletal | 233 | 149 | |
| Nervous | 150 | 129 | |
| Reproductive | 857 | 192 | |
| Respiratory | 8,501 | 1,452 | |
| Skin and Tegument | 14 | 7 | |
| Systemic | 3,328 | 700 | |
| Urinary | 501 | 146 | |
| BSE | Individual diseases with high number of testrequests | 5,306 | 158 |
| BLV | 34,468 | 3,321 | |
| BVD | 12,689 | 2,354 | |
| Johnes disease | 11,123 | 2,040 | |
| Neosporosis | 6,198 | 1,467 | |
| Clinical Pathology (hematology/biochemistry) | Other types of tests | 61,059 | 4,282 |
| Environmental samples | 655 | 58 | |
| Antimicrobial susceptibility | 140 | 33 | |
| Toxicology | 6,866 | 955 | |
| Nonspecific samples | Samples whose syndromic group could not be determined | 7,708 | 3,374 |
| Total | 218,795 | 30,760 |
GIT = Gastro-intestinal tract; BSE = Bovine Spongiform Encephalopathy; BLV = Bovine Leukemia Virus; BVD = Bovine Viral Diarrhea.
BSE test requests are large compared to counts of other test submissions that can be classified as “Nervous”.
The number of cases after classification is higher than the initial number of cases because multiple syndromes can be identified within a single submission.
Figure 1Number of syndromes identified in each case using information from individual test requests.
Figure 2Percentage of test requests classified by direct mapping and automated classification.
Instances and syndromic groups in the unmapped subset of the data.
| Syndromicgroup | Instances | Percentageof total | Cumulativepercentage |
| Mastitis | 38,934 | 73.26% | 73.26% |
| Nonspecific | 7,667 | 14.43% | 87.68% |
| GIT | 2,857 | 5.38% | 93.06% |
| Respiratory | 1,309 | 2.46% | 95.52% |
| Reproductive | 732 | 1.38% | 96.90% |
| Abortion | 553 | 1.04% | 97.94% |
| Musculoskeletal | 232 | 0.44% | 98.38% |
| Haematopoietic | 231 | 0.43% | 98.81% |
| Hepatic | 129 | 0.24% | 99.06% |
| Urinary | 125 | 0.24% | 99.29% |
| Envir. samples | 109 | 0.21% | 99.50% |
| Systemic | 98 | 0.18% | 99.68% |
| Nervous | 67 | 0.13% | 99.81% |
| Circulatory | 57 | 0.11% | 99.91% |
| Eyes and ears | 38 | 0.07% | 99.98% |
| Skin and Tegument | 8 | 0.02% | 100.00% |
| Total | 53,146 |
Performance measures for the algorithms implemented.
| Class average (Macro) | Weighted average (micro) | |||||
| Algorithm | recall | precision | F-score | recall | precision | F-score |
| Manually modified rules | .994 | 1.000 | .997 | 1.000 | 1.000 | 1.000 |
| Rule Induction | .626 | .793 | .677 | .991 | .981 | .979 |
| Naïve Bayes | .983 | .939 | .955 | .994 | .996 | .994 |
| Decision Trees | .290 | .416 | .311 | .936 | .937 | .923 |
The total number of groups in the training data was 16, and the total number of instances 53,146.
The Rule Induction algorithms failed to learn 3 classes, and the Decision Tree 11 classes.
Figure 3Daily counts of cases allocated to Bovine Viral Diarrhea (top) and Mastitis (bottom) syndromes.