Identifying Potential Adverse Effects Using the Web: A New Approach to Medical Hypothesis Generation

Loading...
Thumbnail Image
Penn collection
Operations, Information and Decisions Papers
Degree type
Discipline
Subject
data mining
information extraction; medical message board
drug adverse effect
Health Services Research
Other Medical Sciences
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Benton, Adrian
Ungar, Lyle
Hill, Shawndra
Hennessy, Sean
Mao, Jun
Chung, Annie
Leonard, Charles E
Holmes, John H
Contributor
Abstract

Medical message boards are online resources where users with a particular condition exchange information, some of which they might not otherwise share with medical providers. Many of these boards contain a large number of posts and contain patient opinions and experiences that would be potentially useful to clinicians and researchers. We present an approach that is able to collect a corpus of medical message board posts, de-identify the corpus, and extract information on potential adverse drug effects discussed by users. Using a corpus of posts to breast cancer message boards, we identified drug event pairs using co-occurrence statistics. We then compared the identified drug event pairs with adverse effects listed on the package labels of tamoxifen, anastrozole, exemestane, and letrozole. Of the pairs identified by our system, 75–80% were documented on the drug labels. Some of the undocumented pairs may represent previously unidentified adverse drug effects.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2011-12-01
Journal title
Journal of Biomedical Informatics
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation
Collection