scispace - formally typeset
Open AccessJournal ArticleDOI

Which method predicts recidivism best?: a comparison of statistical, machine learning and data mining predictive models

TLDR
In this article, prediction mod els are developed that predict three types of criminal recidivism: general recidivisitc, violent re-conviction, and sexual recidivasitc.
Abstract
Using criminal population conviction histories of recent offenders, prediction mod els are developed that predict three types of criminal recidivism: general recidivism, violent recidivism and sexual recidivism. The research question is whether prediction techniques from modern statistics, data mining and machine learning provide an improvement in predictive performance over classical statistical methods, namely logistic regression and linear discrim inant analysis. These models are compared on a large selection of performance measures. Results indicate that classical methods do equally well as or better than their modern counterparts. The predictive performance of the various techniques differs only slightly for general and violent recidivism, whereas differences are larger for sexual recidivism. For the general and violent recidivism data we present the results of logistic regression and for sexual recidivism of linear discriminant analysis. Language: en

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

TL;DR: This Perspective clarifies the chasm between explaining black boxes and using inherently interpretable models, outlines several key reasons why explainable black boxes should be avoided in high-stakes decisions, identifies challenges to interpretable machine learning, and provides several example applications whereinterpretable models could potentially replace black box models in criminal justice, healthcare and computer vision.
Posted Content

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead

TL;DR: In this article, the chasm between explaining black box models and using inherently interpretable models is identified, and several key reasons why explainable models should be avoided in high-stakes decisions.
Posted Content

Please Stop Explaining Black Box Models for High Stakes Decisions.

Cynthia Rudin
TL;DR: There is a way forward – it is to design models that are inherently interpretable, rather than trying to explain black box models, which is likely to perpetuate bad practices and can potentially cause catastrophic harm to society.
Journal ArticleDOI

Interpretable Classification Models for Recidivism Prediction

TL;DR: A recent method called supersparse linear integer models is used to produce accurate, transparent and interpretable scoring systems along the full ROC curve, which can be used for decision making for many different use cases.
Journal ArticleDOI

Modeling the Spread of COVID-19 Infection Using a Multilayer Perceptron.

TL;DR: This dataset, intended to be a time-series dataset, is transformed into a regression dataset and used in training a multilayer perceptron (MLP) artificial neural network (ANN) to achieve a worldwide model of the maximal number of patients across all locations in each time unit.
References
More filters
Journal ArticleDOI

Support-Vector Networks

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.
Journal ArticleDOI

A Coefficient of agreement for nominal Scales

TL;DR: In this article, the authors present a procedure for having two or more judges independently categorize a sample of units and determine the degree, significance, and significance of the units. But they do not discuss the extent to which these judgments are reproducible, i.e., reliable.
Journal ArticleDOI

The meaning and use of the area under a receiver operating characteristic (ROC) curve.

James A. Hanley, +1 more
- 01 Apr 1982 - 
TL;DR: A representation and interpretation of the area under a receiver operating characteristic (ROC) curve obtained by the "rating" method, or by mathematical predictions based on patient characteristics, is presented and it is shown that in such a setting the area represents the probability that a randomly chosen diseased subject is (correctly) rated or ranked with greater suspicion than a random chosen non-diseased subject.
Journal ArticleDOI

Multivariate Adaptive Regression Splines

TL;DR: In this article, a new method is presented for flexible regression modeling of high dimensional data, which takes the form of an expansion in product spline basis functions, where the number of basis functions as well as the parameters associated with each one (product degree and knot locations) are automatically determined by the data.
Related Papers (5)