| Literature DB >> 22166371 |
Abstract
BACKGROUND: Online news reports are increasingly becoming a source for event-based early warning systems that detect natural disasters. Harnessing the massive volume of information available from multilingual newswire presents as many challanges as opportunities due to the patterns of reporting complex spatio-temporal events.Entities:
Year: 2011 PMID: 22166371 PMCID: PMC3239300 DOI: 10.1186/2041-1480-2-S5-S10
Source DB: PubMed Journal: J Biomed Semantics
Figure 1Epidemics in the silver standard. Epidemics covered by the silver standard and numbers of events found by BioCaster in English and all languages. PM: Number of ProMED-mail postings, i.e. the number of true positives for each outbreak data set over the trial period of 134 days; RVF = Rift Valley Fever; FMD = Foot-and-mouth disease; HFMD = Hand, foot and mouth disease; AI = Avian Influenza. e-numbers are event series labels.
Aggregated results for global events
| English only news (Worldwide) | |||||||
|---|---|---|---|---|---|---|---|
| model | Se | Sp | PPV | NPV | Alarms | Days | F1 |
| C3 | 0.52 | 0.95 | 0.54 | 0.91 | 7.6 | 3.2 | 0.53 |
| (0.44,0.59) | (0.94,0.96) | (0.46,0.61) | (0.93,0.96) | ||||
| C2 | 0.42 | 0.97 | 0.57 | 0.92 | 5.1 | 3.1 | 0.48 |
| (0.34,0.50) | (0.96,0.98) | (0.48,0.66) | (0.93,0.96) | ||||
| W2 | 0.42 | 0.97 | 0.59 | 0.92 | 5.2 | 3.1 | 0.49 |
| (0.34,0.50) | (0.96,0.98) | (0.49,0.68) | (0.93,0.95) | ||||
| F-stat | 0.67 | 0.88 | 0.45 | 0.85 | 16.2 | 4.0 | 0.54 |
| (0.61,0.73) | (0.86,0.89) | (0.40,0.51) | (0.93,0.96) | ||||
| EWMA | 0.44 | 0.95 | 0.51 | 0.90 | 6.5 | 3.0 | 0.47 |
| (0.37,0.52) | (0.94,0.96) | (0.42,0.59) | (0.92,0.95) | ||||
| All language news (Worldwide) | |||||||
| model | Se | Sp | PPV | NPV | Alarms | Days | F1 |
| C3 | 0.67 | 0.91 | 0.48 | 0.89 | 12.0 | 4 | 0.56 |
| (0.59,0.73) | (0.90,0.93) | (0.41,0.54) | (0.95,0.97) | ||||
| C2 | 0.54 | 0.95 | 0.49 | 0.91 | 7.1 | 3.7 | 0.51 |
| (0.46,061) | (0.94,0.96) | (0.42,0.56) | (0.95,0.97) | ||||
| W2 | 0.55 | 0.95 | 0.52 | 0.91 | 10.6 | 3.7 | 0.54 |
| (0.47,0.63) | (0.94,0.96) | (0.44,0.60) | (0.94,0.97) | ||||
| F-stat | 0.87 | 0.80 | 0.45 | 0.80 | 26.6 | 5.3 | 0.60 |
| (0.83,0.91) | (0.77,0.81) | (0.41,0.50) | (0.96,0.98) | ||||
| EWMA | 0.48 | 0.93 | 0.44 | 0.89 | 7.8 | 3.7 | 0.46 |
| (0.40,0.56) | (0.92,0.94) | (0.36,0.52) | (0.93,0.95) | ||||
Aggregated evaluation metrics for data sets e1 to e16 stratified by source language. The mean number of ProMED-mail alerts per 100 days was 7.4. Model alarms per 100 days; Mean number of days that alerts were given before ProMED-mail reports. Figures in parentheses show 95% CI.
Aggregated results for SE Asian events
| English only news (SE Asia) | |||||||
|---|---|---|---|---|---|---|---|
| model | Se | Sp | PPV | NPV | Alarms | Days | F1 |
| C3 | 0.62 | 0.94 | 0.53 | 0.9 | 9.7 | 4.0 | 0.57 |
| (0.49,0.72) | (0.92,0.96) | (0.42,0.64) | (0.93,0.97) | ||||
| C2 | 0.53 | 0.96 | 0.61 | 0.92 | 6.6 | 3.9 | 0.57 |
| (0.41,0.66) | (0.95,0.98) | (0.47,0.73) | (0.93,0.97) | ||||
| W2 | 0.50 | 0.97 | 0.62 | 0.92 | 6.5 | 3.8 | 0.55 |
| (0.38,062) | (0.95,0.98) | (0.48,0.74) | (0.92,0.96) | ||||
| F-stat | 0.76 | 0.83 | 0.42 | 0.82 | 20.9 | 5.0 | 0.54 |
| (0.67,0.84) | (0.80,0.86) | (0.35,0.50) | (0.94,0.97) | ||||
| EWMA | 0.55 | 0.95 | 0.6 | 0.91 | 7.8 | 3.9 | 0.57 |
| (0.43,0.66) | (0.93,0.97) | (0.47,0.71) | (0.92,0.96) | ||||
| All language news8 (SE Asia) | |||||||
| model | Se | Sp | PPV | NPV | Alarms | Days | F1 |
| C3 | 0.71 | 0.91 | 0.50 | 0.89 | 13.4 | 4.9 | 0.59 |
| (0.60,0.80) | (0.88,0.93) | (0.41,0.59) | (0.94,0.97) | ||||
| C2 | 0.62 | 0.94 | 0.50 | 0.91 | 8.3 | 4.3 | 0.55 |
| (0.48,0.74) | (0.92,0.96) | (0.38,0.62) | (0.94,0.98) | ||||
| W2 | 0.61 | 0.94 | 0.53 | 0.91 | 17.1 | 4.6 | 0.57 |
| (0.49,0.73) | (0.92,0.96) | (0.41,0.65) | (0.94,0.97) | ||||
| F-stat | 0.90 | 0.77 | 0.47 | 0.79 | 30.7 | 5.8 | 0.62 |
| (0.84,0.94) | (0.73,0.80) | (0.40,0.53) | (0.95,0.98) | ||||
| EWMA | 0.53 | 0.94 | 0.48 | 0.89 | 8.1 | 3.9 | 0.50 |
| (0.40,0.65) | (0.91,0.96) | (0.36,0.61) | (0.92,0.96) | ||||
Aggregated evaluation metrics for data sets e1 to e6 stratified by source language. The mean number of ProMED-mail alerts per 100 days was 8.1. Model alarms per 100 days; Mean number of days that alerts were given before ProMED-mail reports. Figures in parentheses show 95% CI.