TY - JOUR
T1 - Using machine learning to assess rape reports
T2 - “Signaling” words about victims' credibility that predict investigative and prosecutorial outcomes
AU - Lovell, Rachel E.
AU - Klingenstein, Joanna
AU - Du, Jiaxin
AU - Overman, Laura
AU - Sabo, Danielle
AU - Ye, Xinyue
AU - Flannery, Daniel J.
N1 - Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/9/1
Y1 - 2023/9/1
N2 - Purpose: The second of two articles from a larger study whose aim was to teach a computer to detect innuendo (or signaling) about a victim's credibility in incident reports of rape. This study explored if the words expressed or not expressed, intentionally or not, influenced case progression and outcomes. Methods: We employed machine learning, specifically text classification, to identify predictive phrases. Sample consisted of 5638 incident reports of rape with a sexual assault kit from a U.S., urban Midwestern jurisdiction. Results: As hypothesized, predictive phrases were different in cases that stalled earlier. Cases not recommended for prosecution lacked detail and more heavily mentioned: (in)actions of victims, actions that stall cases, and procedural words. Reports where victims were not believed or unfounded were similarly vague, procedural, and terse. Cases recommended for prosecution predictively mentioned suspects and the rape statute. Conclusions: We taught a computer to detect signaling via words that were predictive of case progression and outcomes. Negative signals about a victim's credibility often presented as unqualified statements of “fact” or observations or procedural words, indicating a focus on the process vs. victim or suspect. Implications and recommendations are provided, including how unqualified doubts about victims' credibility have substantial public safety consequences.
AB - Purpose: The second of two articles from a larger study whose aim was to teach a computer to detect innuendo (or signaling) about a victim's credibility in incident reports of rape. This study explored if the words expressed or not expressed, intentionally or not, influenced case progression and outcomes. Methods: We employed machine learning, specifically text classification, to identify predictive phrases. Sample consisted of 5638 incident reports of rape with a sexual assault kit from a U.S., urban Midwestern jurisdiction. Results: As hypothesized, predictive phrases were different in cases that stalled earlier. Cases not recommended for prosecution lacked detail and more heavily mentioned: (in)actions of victims, actions that stall cases, and procedural words. Reports where victims were not believed or unfounded were similarly vague, procedural, and terse. Cases recommended for prosecution predictively mentioned suspects and the rape statute. Conclusions: We taught a computer to detect signaling via words that were predictive of case progression and outcomes. Negative signals about a victim's credibility often presented as unqualified statements of “fact” or observations or procedural words, indicating a focus on the process vs. victim or suspect. Implications and recommendations are provided, including how unqualified doubts about victims' credibility have substantial public safety consequences.
KW - Machine learning
KW - Sexual assault
KW - Signaling
KW - Text classification
KW - Victim credibility
UR - http://www.scopus.com/inward/record.url?scp=85169798114&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85169798114&partnerID=8YFLogxK
U2 - 10.1016/j.jcrimjus.2023.102107
DO - 10.1016/j.jcrimjus.2023.102107
M3 - Article
AN - SCOPUS:85169798114
SN - 0047-2352
VL - 88
JO - Journal of Criminal Justice
JF - Journal of Criminal Justice
M1 - 102107
ER -