Predictive policing

Alex Reinhart – Updated January 16, 2019 notebooks · refsmmat.com

See also Policing, Predicting recidivism.

Hotspots

Crime tends to concentrate at places, so we find the places and direct policing. A very straightforward intervention-oriented approach.

Crime concentration

Finding hotspots

Usually clustering methods or kernel densities: pick the areas with clusters or the highest crime density. There are conflicting results on what works best, but I don’t like the metrics anyway; the PAI and RRI don’t seem to measure useful quantities, particularly when you arbitrarily choose your threshold for defining “hotspot” and don’t compare across a range of thresholds, ROC-style.

For evaluation metrics:

Explaining hotspots

Experimental trials

Various experiments have tested whether directing patrols to hotspots reduces crime, to generally positive results.

But trying to solve community problems may be better than just saturation patrol:

One curious trial finds increases in crime when hotspot patrols are predictable: [To read] Ariel, B., & Partridge, H. (2016). Predictable Policing: Measuring the Crime Control Benefits of Hotspots Policing at Bus Stops. Journal of Quantitative Criminology, 1–25. doi:10.1007/s10940-016-9312-y

Risk Terrain Modeling

A spatial technique to identify spatial features which lead to crime. Works by identifying risk factors (bars, foreclosures, schools, etc.), mapping these, and then seeing how well they predict crime.

The initial iteration just added up the number of risk factors, then used a logistic regression to predict presence or absence of crime: Kennedy, L. W., Caplan, J. M., & Piza, E. L. (2010). Risk Clusters, Hotspots, and Spatial Intelligence: Risk Terrain Modeling as an Algorithm for Police Resource Allocation Strategies. Journal of Quantitative Criminology, 27(3), 339–362. doi:10.1007/s10940-010-9126-2

Model selection was just “which logistic regression has the biggest slope”, which naturally biases it to the models with fewer risk factors, since their risk values have a smaller range (as just a count of present factors) and hence must have a larger slope. Variable selection used a bunch of univariate chi-squareds, and I’m dubious about using p values to decide which variable predicts best.

Then came an update which uses elastic net penalized regression to fit a Poisson model, picking the best penalty via cross-validation, then further reducing the model with stepwise regression and BIC. (Why not just adjust the penalty parameter for more sparsity?) Features were included as three binary variables for proximity (within 426, 852, or 1278 feet) and three different kernel densities (with those three bandwidths), for reasons I do not understand: Kennedy, L. W., Caplan, J. M., Piza, E. L., & Buccine-Schraeder, H. (2016). Vulnerability and Exposure to Crime: Applying Risk Terrain Modeling to the Study of Assault in Chicago. Applied Spatial Analysis and Policy, 9(4), 529–548. doi:10.1007/s12061-015-9165-z

Other spatial methods

Near repeats

Crimes tend to be followed by nearby crimes, e.g. from a burglar returning to an area to try a new target.

Counting repeats

A bunch of papers use the Knox test, a permutation test that compares the number of crimes nearby in space and time with the permutation null. Requires discrete choice of cutoffs for “nearby”, so claims of distances of effects are really claims about the power of the test. (If significance is only found within 200m, would it be found at 300m if we had more data?) Implemented in the Near Repeat Calculator, widely used.

Another approach models choice of houses to burgle with a multinomial logit, where the outcome is the choice of house: Ratcliffe, J. H., & Rengert, G. F. (2008). Near-Repeat Patterns in Philadelphia Shootings. Security Journal, 21(1-2), 58–76. doi:10.1057/palgrave.sj.8350068

K functions

Ripley’s K function provides a continuous analog of the Knox test statistic. It’s a normalized count of the average number of points within a given distance of an arbitrary event, so it’s function of distance instead of having an arbitrary cutoff; a natural space-time generalization counts the average number within a given distance and a given time. Plotting these gives a sense of the scale and decay of near-repeat effects.

Used to compare before and after stop-and-frisk events: Wooditch, A., & Weisburd, D. (2016). Using Space-Time Analysis to Evaluate Criminal Justice Programs: An Application to Stop-Question-Frisk Practices. Journal of Quantitative Criminology, 32(2), 191–213. doi:10.1007/s10940-015-9259-4

Heterogeneity vs. state dependence

Burglaries are the most common crime studied, presumably because the theory is clear: burglars like returning to areas they’re familiar with. But this is easily confounded with spatial heterogeneity: some places are better to burgle than others, regardless of whether they were recently burgled. This seems connected to the state dependence vs. heterogeneity problem, Heckman, J. J. (1991). Identifying the hand of past: Distinguishing state dependence from heterogeneity. The American Economic Review, 81(2), 75–79. http://www.jstor.org/stable/2006829

Interventions

Self-exciting point process models

See also Self-exciting point processes.

It’d be useful to combine hotspot models and near-repeat effects. As Gorr has pointed out, hotspots can be either chronic (like the methods above try to find) or temporary, caused by, say, a new burglar hitting several houses in an area. Gorr, W. L., & Lee, Y. (2015). Early Warning System for Temporary Crime Hot Spots. Journal of Quantitative Criminology, 31(1), 25–47. doi:10.1007/s10940-014-9223-8

Mohler and colleagues have a series of papers on self-exciting models for crime, which allow both chronic hotspots and self-exciting temporary clusters:

Their methods have been adapted by others. (See also the Epidemic/endemic models section of Self-exciting point processes for application to epidemiology.)

There are also modeling approaches that aren’t self-exciting:

Other prediction methods

Weather

Crime is, naturally, affected by the weather.

Predictive policing and the law

A series of papers on how predictive policing interacts with the Fourth Amendment:

First, it’s surprising to see that courts already have recognized an implied Fourth Amendment exception for “high-crime areas”, which contribute to finding reasonable suspicion for a stop and search: Ferguson, A. G. (2011). Crime Mapping and the Fourth Amendment: Redrawing "High-Crime Areas". Hastings Law Journal, 63(1), 179–232. http://www.hastingslawjournal.org/2014/04/03/crime-mapping-and-the-fourth-amendment-redrawing-high-crime-areas/

Next, more on the concerns caused by data and predictive policing being used to justify searches: