Thursday, 28 February 2013

Which method predicts recidivism best?: a comparison of statistical, machine learning and data mining predictive models

an article by N. Tollenaar (Ministry of Security and Justice, The Hague, The Netherlands) and P. G. M. van der Heijden (University of Utrecht, The Netherlands) published in Journal of the Royal Statistical Society: Series A (Statistics in Society) Volume 176 Issue 2 (February 2013)


Using criminal population conviction histories of recent offenders, prediction models are developed that predict three types of criminal recidivism: general recidivism, violent recidivism and sexual recidivism.

The research question is whether prediction techniques from modern statistics, data mining and machine learning provide an improvement in predictive performance over classical statistical methods, namely logistic regression and linear discriminant analysis. These models are compared on a large selection of performance measures.

Results indicate that classical methods do equally well as or better than their modern counterparts.

The predictive performance of the various techniques differs only slightly for general and violent recidivism, whereas differences are larger for sexual recidivism.

For the general and violent recidivism data we present the results of logistic regression and for sexual recidivism of linear discriminant analysis.

No comments: