TY - GEN
T1 - Epidemic outbreak and spread detection system based on twitter data
AU - Ji, Xiang
AU - Chun, Soon Ae
AU - Geller, James
PY - 2012
Y1 - 2012
N2 - Social Network systems, such as Twitter, can serve as important data sources to provide collective intelligence and awareness of health problems in real time. The challenges of utilizing social media data include that the volume of data is large but distributed and of a highly unstructured form. Appropriate data gathering, scrubbing and aggregating efforts for these data are required to transform them for meaningful use. In this paper, we discuss such a social media data ETL (Extract-Transform-Load) method, to provide a user-friendly, dynamic method for visualizing outbreaks and the spread of developing epidemics in space and time. We have developed the Epidemics Outbreak and Spread Detection System (EOSDS) as a prototype that makes use of the rich information retrievable in real time from Twitter. EOSDS provides three different visualization methods of spreading epidemics, static map, distribution map, and filter map, to investigate public health threats in the space and time dimensions. The results of these visualizations in our experiments correlate well with relevant CDC official reports, a gold standard used by health informatics scientists. In our experiments, the EOSDS also detected an unusual situation not shown in the CDC reports, but confirmed by online news media.
AB - Social Network systems, such as Twitter, can serve as important data sources to provide collective intelligence and awareness of health problems in real time. The challenges of utilizing social media data include that the volume of data is large but distributed and of a highly unstructured form. Appropriate data gathering, scrubbing and aggregating efforts for these data are required to transform them for meaningful use. In this paper, we discuss such a social media data ETL (Extract-Transform-Load) method, to provide a user-friendly, dynamic method for visualizing outbreaks and the spread of developing epidemics in space and time. We have developed the Epidemics Outbreak and Spread Detection System (EOSDS) as a prototype that makes use of the rich information retrievable in real time from Twitter. EOSDS provides three different visualization methods of spreading epidemics, static map, distribution map, and filter map, to investigate public health threats in the space and time dimensions. The results of these visualizations in our experiments correlate well with relevant CDC official reports, a gold standard used by health informatics scientists. In our experiments, the EOSDS also detected an unusual situation not shown in the CDC reports, but confirmed by online news media.
KW - Epidemics Detection
KW - Epidemics Distribution
KW - Epidemics Spread
KW - Health Information Visualization
KW - Social Network
KW - Twitter
UR - http://www.scopus.com/inward/record.url?scp=84859565382&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84859565382&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-29361-0_19
DO - 10.1007/978-3-642-29361-0_19
M3 - Conference contribution
AN - SCOPUS:84859565382
SN - 9783642293603
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 152
EP - 163
BT - Health Information Science - First International Conference, HIS 2012, Proceedings
T2 - 1st International Conference on Health Information Science, HIS 2012
Y2 - 8 April 2012 through 10 April 2012
ER -