TY - GEN
T1 - Detecting political bias trolls in Twitter data
AU - Chun, Soon Ae
AU - Holowczak, Richard
AU - Dharan, Kannan Neten
AU - Wang, Ruoyu
AU - Basu, Soumaydeep
AU - Geller, James
PY - 2019/1/1
Y1 - 2019/1/1
N2 - Ever since Russian trolls have been brought to light, their interference in the 2016 US Presidential elections has been monitored and studied. These Russian trolls employ fake accounts registered on several major social media sites to influence public opinion in other countries. Our work involves discovering patterns in these tweets and classifying them by training different machine learning models such as Support Vector Machines, Word2vec, Google BERT, and neural network models, and then applying them to several large Twitter datasets to compare the effectiveness of the different models. Two classification tasks are utilized for this purpose. The first one is used to classify any given tweet as either troll or non-troll tweet. The second model classifies specific tweets as coming from left trolls or right trolls, based on apparent extreme political orientations. On the given data sets, Google BERT provides the best results, with an accuracy of 89.4% for the left/right troll detector and 99% for the troll/non-troll detector. Temporal, geographic, and sentiment analyses were also performed and results were visualized.
AB - Ever since Russian trolls have been brought to light, their interference in the 2016 US Presidential elections has been monitored and studied. These Russian trolls employ fake accounts registered on several major social media sites to influence public opinion in other countries. Our work involves discovering patterns in these tweets and classifying them by training different machine learning models such as Support Vector Machines, Word2vec, Google BERT, and neural network models, and then applying them to several large Twitter datasets to compare the effectiveness of the different models. Two classification tasks are utilized for this purpose. The first one is used to classify any given tweet as either troll or non-troll tweet. The second model classifies specific tweets as coming from left trolls or right trolls, based on apparent extreme political orientations. On the given data sets, Google BERT provides the best results, with an accuracy of 89.4% for the left/right troll detector and 99% for the troll/non-troll detector. Temporal, geographic, and sentiment analyses were also performed and results were visualized.
KW - Alt-right tweets
KW - Election manipulation
KW - Political biases
KW - Social network mining
KW - Troll detection
KW - Twitter
UR - http://www.scopus.com/inward/record.url?scp=85074248570&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074248570&partnerID=8YFLogxK
M3 - Conference contribution
T3 - WEBIST 2019 - Proceedings of the 15th International Conference on Web Information Systems and Technologies
SP - 334
EP - 342
BT - WEBIST 2019 - Proceedings of the 15th International Conference on Web Information Systems and Technologies
A2 - Bozzon, Alessandro
A2 - Mayo, Francisco Jose Dominguez
A2 - Filipe, Joaquim
PB - SciTePress
T2 - 15th International Conference on Web Information Systems and Technologies, WEBIST 2019
Y2 - 18 September 2019 through 20 September 2019
ER -