TY - JOUR
T1 - Using machine learning to assess rape reports
T2 - “Signaling” words about victims' credibility that predict investigative and prosecutorial outcomes
AU - Lovell, Rachel E.
AU - Klingenstein, Joanna
AU - Du, Jiaxin
AU - Overman, Laura
AU - Sabo, Danielle
AU - Ye, Xinyue
AU - Flannery, Daniel J.
N1 - Funding Information:
We want to acknowledge the survivors. These are not data points but individuals who endured intense, intimate trauma—survivors. To these survivors: We hold your stories with the reverence and respect they deserve. Thank you for inadvertently sharing them with us. We aim to leverage your stories to inform and improve our response to rape. We would like to acknowledge and thank the Cleveland Division of Police and Cuyahoga County Prosecutor's Office for their support and the researchers who worked on this project but are not co-authors, including Misty Luminais, Margaret McGuire, Grayson Holt, Julia Zalewski, and Madeline Myers.
Funding Information:
This project was supported by Grant No. 2018-VA-CX-0002, awarded by the National Institute of Justice (NIJ). NIJ is the research, development and evaluation agency of the United States Department of Justice, Office of Justice Programs (OJP). Points of view or opinions in this document are those of the authors and do not necessarily represent the official position or policies of the US Department of Justice.
Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/9/1
Y1 - 2023/9/1
N2 - Purpose: The second of two articles from a larger study whose aim was to teach a computer to detect innuendo (or signaling) about a victim's credibility in incident reports of rape. This study explored if the words expressed or not expressed, intentionally or not, influenced case progression and outcomes. Methods: We employed machine learning, specifically text classification, to identify predictive phrases. Sample consisted of 5638 incident reports of rape with a sexual assault kit from a U.S., urban Midwestern jurisdiction. Results: As hypothesized, predictive phrases were different in cases that stalled earlier. Cases not recommended for prosecution lacked detail and more heavily mentioned: (in)actions of victims, actions that stall cases, and procedural words. Reports where victims were not believed or unfounded were similarly vague, procedural, and terse. Cases recommended for prosecution predictively mentioned suspects and the rape statute. Conclusions: We taught a computer to detect signaling via words that were predictive of case progression and outcomes. Negative signals about a victim's credibility often presented as unqualified statements of “fact” or observations or procedural words, indicating a focus on the process vs. victim or suspect. Implications and recommendations are provided, including how unqualified doubts about victims' credibility have substantial public safety consequences.
AB - Purpose: The second of two articles from a larger study whose aim was to teach a computer to detect innuendo (or signaling) about a victim's credibility in incident reports of rape. This study explored if the words expressed or not expressed, intentionally or not, influenced case progression and outcomes. Methods: We employed machine learning, specifically text classification, to identify predictive phrases. Sample consisted of 5638 incident reports of rape with a sexual assault kit from a U.S., urban Midwestern jurisdiction. Results: As hypothesized, predictive phrases were different in cases that stalled earlier. Cases not recommended for prosecution lacked detail and more heavily mentioned: (in)actions of victims, actions that stall cases, and procedural words. Reports where victims were not believed or unfounded were similarly vague, procedural, and terse. Cases recommended for prosecution predictively mentioned suspects and the rape statute. Conclusions: We taught a computer to detect signaling via words that were predictive of case progression and outcomes. Negative signals about a victim's credibility often presented as unqualified statements of “fact” or observations or procedural words, indicating a focus on the process vs. victim or suspect. Implications and recommendations are provided, including how unqualified doubts about victims' credibility have substantial public safety consequences.
KW - Machine learning
KW - Sexual assault
KW - Signaling
KW - Text classification
KW - Victim credibility
UR - http://www.scopus.com/inward/record.url?scp=85169798114&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85169798114&partnerID=8YFLogxK
U2 - 10.1016/j.jcrimjus.2023.102107
DO - 10.1016/j.jcrimjus.2023.102107
M3 - Article
AN - SCOPUS:85169798114
SN - 0047-2352
VL - 88
JO - Journal of Criminal Justice
JF - Journal of Criminal Justice
M1 - 102107
ER -