Operations, Information and Decisions Papers

Document Type

Journal Article

Date of this Version

12-2011

Publication Source

Journal of Biomedical Informatics

Volume

44

Issue

6

Start Page

989

Last Page

996

DOI

10.1016/j.jbi.2011.07.005

Abstract

Medical message boards are online resources where users with a particular condition exchange information, some of which they might not otherwise share with medical providers. Many of these boards contain a large number of posts and contain patient opinions and experiences that would be potentially useful to clinicians and researchers. We present an approach that is able to collect a corpus of medical message board posts, de-identify the corpus, and extract information on potential adverse drug effects discussed by users. Using a corpus of posts to breast cancer message boards, we identified drug event pairs using co-occurrence statistics. We then compared the identified drug event pairs with adverse effects listed on the package labels of tamoxifen, anastrozole, exemestane, and letrozole. Of the pairs identified by our system, 75–80% were documented on the drug labels. Some of the undocumented pairs may represent previously unidentified adverse drug effects.

Copyright/Permission Statement

© . This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/

Keywords

data mining, information extraction; medical message board, drug adverse effect

Share

COinS
 

Date Posted: 27 November 2017

This document has been peer reviewed.