### Campanelli, L (2022)

#### Testing Benford's Law: from small to very large data sets

Preprint submitted to Spanish Journal of Statistics.

**ISSN/ISBN:** Not available at this time.
**DOI:** 10.13140/RG.2.2.19884.95363

There are no links available at this time.

**Abstract:** We discuss some limitations of the use of generic tests, such as the Pearson’s χ2, for testing Benford’s law. Statistics with known distribution and constructed under the specific null hypothesis that Benford’s law holds, such as the Euclidean distance, are more appropriate when assessing the goodness-of-fit to Benford’s law, and should be preferred over generic tests in quantitative analyses. The rule of thumb proposed by Goodman for compliance checking to Benford’s law, instead, is shown to be statistically unfounded. For very large sample sizes (N > 1000), all existing statistical tests are inappropriate for testing Benford’s law due to its empirical nature. We propose a new statistic whose sample values are asymptotically independent on the sample size making it a natural candidate for testing Benford’s law in very large data sets.

**Bibtex:**

```
@misc{,
author = {Leonardo Campanelli},
title = {Testing Benford’s Law: from small to very large data sets},
year = {2022},
doi = {10.13140/RG.2.2.19884.95363},
}
```

**Reference Type:** Preprint

**Subject Area(s):** Statistics