OBJECTIVE: The US Vaccine Adverse Event Reporting System (VAERS) collects spontaneous reports of adverse events following vaccination. Medical officers review the reports and often apply standardized case definitions, such as those developed by the Brighton Collaboration. Our objective was to demonstrate a multi-level text mining approach for automated text classification of VAERS reports that could potentially reduce human workload. DESIGN: We selected 6034 VAERS reports for H1N1 vaccine that were classified by medical officers as potentially positive (N(pos)=237) or negative for anaphylaxis. We created a categorized corpus of text files that included the class label and the symptom text field of each report. A validation set of 1100 labeled text files was also used. Text mining techniques were applied to extract three feature sets for important keywords, low- and high-level patterns. A rule-based classifier processed the high-level feature representation, while several machine learning classifiers were trained for the remaining two feature representations. MEASUREMENTS: Classifiers' performance was evaluated by macro-averaging recall, precision, and F-measure, and Friedman's test; misclassification error rate analysis was also performed. RESULTS: Rule-based classifier, boosted trees, and weighted support vector machines performed well in terms of macro-recall, however at the expense of a higher mean misclassification error rate. The rule-based classifier performed very well in terms of average sensitivity and specificity (79.05% and 94.80%, respectively). CONCLUSION: Our validated results showed the possibility of developing effective medical text classifiers for VAERS reports by combining text mining with informative feature selection; this strategy has the potential to reduce reviewer workload considerably.
OBJECTIVE: The US Vaccine Adverse Event Reporting System (VAERS) collects spontaneous reports of adverse events following vaccination. Medical officers review the reports and often apply standardized case definitions, such as those developed by the Brighton Collaboration. Our objective was to demonstrate a multi-level text mining approach for automated text classification of VAERS reports that could potentially reduce human workload. DESIGN: We selected 6034 VAERS reports for H1N1 vaccine that were classified by medical officers as potentially positive (N(pos)=237) or negative for anaphylaxis. We created a categorized corpus of text files that included the class label and the symptom text field of each report. A validation set of 1100 labeled text files was also used. Text mining techniques were applied to extract three feature sets for important keywords, low- and high-level patterns. A rule-based classifier processed the high-level feature representation, while several machine learning classifiers were trained for the remaining two feature representations. MEASUREMENTS: Classifiers' performance was evaluated by macro-averaging recall, precision, and F-measure, and Friedman's test; misclassification error rate analysis was also performed. RESULTS: Rule-based classifier, boosted trees, and weighted support vector machines performed well in terms of macro-recall, however at the expense of a higher mean misclassification error rate. The rule-based classifier performed very well in terms of average sensitivity and specificity (79.05% and 94.80%, respectively). CONCLUSION: Our validated results showed the possibility of developing effective medical text classifiers for VAERS reports by combining text mining with informative feature selection; this strategy has the potential to reduce reviewer workload considerably.
Authors: Jeffrey A Linder; Jennifer S Haas; Aarthi Iyer; Michael A Labuzetta; Michael Ibara; Michael Celeste; George Getty; David W Bates Journal: Pharmacoepidemiol Drug Saf Date: 2010-12 Impact factor: 2.890
Authors: Jens U Rüggeberg; Michael S Gold; José-Maria Bayas; Michael D Blum; Jan Bonhoeffer; Sheila Friedlander; Glacus de Souza Brito; Ulrich Heininger; Babatunde Imoukhuede; Ali Khamesipour; Michel Erlewyn-Lajeunesse; Susana Martin; Mika Mäkelä; Patricia Nell; Vitali Pool; Nick Simpson Journal: Vaccine Date: 2007-03-12 Impact factor: 3.641
Authors: Jan Bonhoeffer; Katrin Kohl; Robert Chen; Philippe Duclos; Harald Heijbel; Ulrich Heininger; Tom Jefferson; Elisabeth Loupi Journal: Vaccine Date: 2002-12-13 Impact factor: 3.641
Authors: Rave Harpaz; Alison Callahan; Suzanne Tamang; Yen Low; David Odgers; Sam Finlayson; Kenneth Jung; Paea LePendu; Nigam H Shah Journal: Drug Saf Date: 2014-10 Impact factor: 5.606
Authors: Taxiarchis Botsis; Thomas Buttolph; Michael D Nguyen; Scott Winiecki; Emily Jane Woo; Robert Ball Journal: J Am Med Inform Assoc Date: 2012-08-25 Impact factor: 4.497
Authors: James A McCart; Donald J Berndt; Jay Jarman; Dezon K Finch; Stephen L Luther Journal: J Am Med Inform Assoc Date: 2012-12-15 Impact factor: 4.497