Friday 10 January 2020

Benford's Law as an Indicator of Survey Reliability: Can we trust our data?

an article by Micha Kaiser (University of Hohenheim and Center for Consumer, Markets, and Politics (CCMP)) published in Journal of Economic Surveys Volume 33 Issue 5 (December 2019)

Abstract

This paper analyses how closely different income measures conform to Benford's law, a mathematical predictor of probable first digit distribution across many sets of numbers.

Because Benford's law can be used to test data set reliability, we use a Benford analysis to assess the quality of six widely used survey data sets.

Our findings indicate that although income generally obeys Benford's law, almost all the data sets show substantial discrepancies from it, which we interpret as a strong indicator of reliability issues in the survey data. This result is confirmed by a simulation, which demonstrates that household level income data do not manifest the same poor performance as individual level data.

This finding implies that researchers should focus on household level characteristics whenever possible to reduce observation errors.

JEL Classification: C18, C15, C46, C55, C81, I100

Full text (PDF 17pp)

NOTE: You will probably need a higher or more recent qualification than my O-Level in Statistics to follow all the arguments in this paper.

Labels:
Benford's_law, data_quality, fraud_detection, measurement_error, survey_quality,


No comments: