Individual Submission Summary
Share...

Direct link:

Classifying Physical Integrity Rights Allegations using Machine Learning Methods

Fri, September 16, 10:00 to 11:30am, TBA

Abstract

We present and evaluate an automated coding process that produces categorical information about physical integrity rights violations described in annual human rights reports. The coding process builds on an existing corpus of human rights allegations. Allegations are sentences taken from country-year human rights reports published annually by Amnesty International, Human Rights Watch, and the US State Department. Each allegation sentence presents information about a specific type of physical integrity violation (disappearances, torture, killing and political imprisonment) and may also contain information about the date, actor type, location, scope, intensity, range, victim type, and whether or not the violator of the right was held accountable. The information content of the allegations varies, and each allegation only contains information about a subset of these items. Therefore, the information for each allegation is based on a probability model that relates the text contained in each allegation to human coded categorical data of each item. We use this training data to code each repression allegation for all countries in the world from 1999-2016. We present support vector machine, naïve Bayes, logistic regression-based probabilities and use cross validation to assess model performance.

Authors