View Complete Reference

Maurus, S and Plant, C (2017)

Let's See Your Digits: Anomalous-State Detection using Benford's Law

Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2017, pp. 977–986.

ISSN/ISBN: Not available at this time. DOI: 10.1145/3097983.3098101



Abstract: Benford's Law explains a curious phenomenon in which the leading digits of "naturally-occurring" numerical data are distributed in a precise fashion. In this paper we begin by showing that system metrics generated by many modern information systems like Twitter, Wikipedia, YouTube and GitHub obey this law. We then propose a novel unsupervised approach called BenFound that exploits this property to detect anomalous system events. BenFound tracks the "Benfordness" of key system metrics, like the follower counts of tweeting Twitter users or the change deltas in Wikipedia page edits. It then applies a novel Benford-conformity test in real-time to identify "non-Benford events". We investigate a variety of such events, showing that they correspond to unnatural and often undesirable system interactions like spamming, hashtag-hijacking and denial-of-service attacks. The result is a technically-uncomplicated and effective "red flagging" technique that can be used to complement existing anomaly-detection approaches. Although not without its limitations, it is highly efficient and requires neither obscure parameters, nor text streams, nor natural-language processing.


Bibtex:
@inproceedings{, author = {Maurus, Samuel and Plant, Claudia}, title = {Let's See Your Digits: Anomalous-State Detection Using Benford's Law}, year = {2017}, isbn = {9781450348874}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, url = {https://dl.acm.org/doi/10.1145/3097983.3098101}, doi = {10.1145/3097983.3098101}, booktitle = {Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining}, pages = {977–-986}, }


Reference Type: Conference Paper

Subject Area(s): Computer Science, General Interest