NeurIPS Are Police Biased? An NLP Approach

Poster
in
Workshop: Statistical Frontiers in LLMs and Foundation Models

Are Police Biased? An NLP Approach

Jonathan Choi

Keywords: [ regression analysis ] [ racial profiling ] [ police bias ] [ law enforcement ] [ pedestrian stops ] [ empirical legal studies ] [ contraband discovery ] [ racial discrimination ] [ predictive modeling ] [ omitted variable bias ] [ large language models ] [ criminal justice ] [ natural language processing ] [ text analysis ] [ machine learning ]

[ Abstract ] [ Project Page ]

[ OpenReview]

Sat 14 Dec 3:45 p.m. PST — 4:30 p.m. PST

Abstract:

Researchers have traditionally run regressions on numerical and categorical data to detect police bias and inform decisions about criminal justice. This approach can only control for a limited set of simple features, leaving significant unexplained variation and raising concerns of omitted variable bias. Using a novel dataset of text from more than a million police stops, we propose a new method applying large language models (LLMs) to incorporate textual data into regression analysis of stop outcomes. Our LLM-boosted approach has considerably more explanatory power than traditional methods and substantially changes inferences about police bias on characteristics like gender, race, and ethnicity. It also allows us to investigate what features of police reports best predict stops and how officers differ in their conduct of stops. Incorporating textual data ultimately permits more accurate and more detailed inferences on criminal justice data.

Chat is not available.

Poster in Workshop: Statistical Frontiers in LLMs and Foundation Models

Are Police Biased? An NLP Approach

Jonathan Choi

Poster
in
Workshop: Statistical Frontiers in LLMs and Foundation Models