TY - GEN
T1 - Enabling real-Time drug abuse detection in tweets
AU - Phan, Nhathai
AU - Bhole, Manasi
AU - Ae Chun, Soon
AU - Geller, James
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/5/16
Y1 - 2017/5/16
N2 - Prescription drug abuse is one of the fastest growing public health problems in the USA. To address this epidemic, a near real-Time monitoring strategy, instead of one resorting to a retrospective health records, may improve detecting the prevalence and patterns of abuse of both illegal drugs and prescription medications. In this paper, our primary goals are to demonstrate the possibility of utilizing social media, e.g., Twitter, for automatic monitoring of illegal drug and prescription medication abuse. We use machine learning methods for an automatic classification that can identify tweets that are indicative of drug abuse. We collected tweets associated with well-known illegal and prescription drugs. We manually annotated 300 tweets that are likely to be related to drug abuse. Our experiment compares a set of classification algorithms, and a decision tree classifier J48, and the SVM outperform others for determining whether tweets contain signals of drug abuse. This automatic supervised classification study results illustrate the utility of Twitter in examining patterns of abuse, and show the feasibility of building the drug abuse detection system that can process large volume data from social media sources in a near real-Time.
AB - Prescription drug abuse is one of the fastest growing public health problems in the USA. To address this epidemic, a near real-Time monitoring strategy, instead of one resorting to a retrospective health records, may improve detecting the prevalence and patterns of abuse of both illegal drugs and prescription medications. In this paper, our primary goals are to demonstrate the possibility of utilizing social media, e.g., Twitter, for automatic monitoring of illegal drug and prescription medication abuse. We use machine learning methods for an automatic classification that can identify tweets that are indicative of drug abuse. We collected tweets associated with well-known illegal and prescription drugs. We manually annotated 300 tweets that are likely to be related to drug abuse. Our experiment compares a set of classification algorithms, and a decision tree classifier J48, and the SVM outperform others for determining whether tweets contain signals of drug abuse. This automatic supervised classification study results illustrate the utility of Twitter in examining patterns of abuse, and show the feasibility of building the drug abuse detection system that can process large volume data from social media sources in a near real-Time.
KW - Drug abuse
KW - Illegal drug
KW - Online social network
KW - Prescription drug
KW - Social media
UR - http://www.scopus.com/inward/record.url?scp=85021254031&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85021254031&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2017.221
DO - 10.1109/ICDE.2017.221
M3 - Conference contribution
AN - SCOPUS:85021254031
T3 - Proceedings - International Conference on Data Engineering
SP - 1510
EP - 1514
BT - Proceedings - 2017 IEEE 33rd International Conference on Data Engineering, ICDE 2017
PB - IEEE Computer Society
T2 - 33rd IEEE International Conference on Data Engineering, ICDE 2017
Y2 - 19 April 2017 through 22 April 2017
ER -