Detecting Hate Crimes Through Machine Learning and Natural Language Processing

Information Menu
Search Tips
ASC Home

Navigation and Settings Menu
Sign In

Social Media Menu
Facebook
X (Twitter)

Back Home

Refresh: Off View Personal Schedule

Individual Submission Summary

Share...

Direct link:

Detecting Hate Crimes Through Machine Learning and Natural Language Processing

In Event: Novel Uses and Approaches to Administrative Data

Thu, Nov 14, 8:00 to 9:20am, Salon 4 - Lower B2 Level

Abstract

Misidentification and misreporting of hate crimes by victims and law enforcement are significant barriers to accurate data collection of hate crimes, and consequently to their study and prevention. The use of machine learning in crime detection can improve the accuracy and speed at which reported incidents with bias elements are identified. This study develops a machine learning classifier that categorizes police reports as either events with bias elements or events with no bias elements. We use incident/offense reports from the Seattle Police Department to train a Natural Language Processing classification algorithm. We collect narratives, location data, and victim and suspect demographics to use as features. We evaluate the performance of logistic regression, random forest, and XGBoost algorithms, as well as several text embedding techniques. Despite substantial class imbalance, our model achieves a macro F1-score of 0.79, demonstrating the benefits of applied machine learning in accurately detecting and reporting hate crimes.

Author

Ana Ortiz Salazar, Seattle Police Department