A bright shiny object: Misapplication of Benford’s law published in health economics journal

Benford’s Law is fun; it also is the source of endless confusion. Math is hard!

This came up recently when Kevin Lewis pointed to this paper, which states:

We use Benford’s law to examine the non-random elements of health care costs. We find that as health care expenditures increase, the conformity to the expected distribution of naturally occurring numbers worsens, indicating a tendency towards inefficient treatment. Government insurers follow Benford’s law better than private insurers indicating more efficient treatment. . . .

This sounded interesting, but I’m pretty sure they’re doing it wrong, because they try to evaluate the fit to Benford’s law within each “price bucket” ($100-999, $1000-9999, $10,000-99,999, and $100,000-999,999). Based on my understanding of the processes underlying Benford-like behavior, you wouldn’t necessarily expect the pattern to occur within each bin in that way.

Here’s an example of how things go wrong. The authors write:

We also follow Drake and Nigrini (2000) by calculating the mean of absolute deviations (MAD) to use as a way to assess conformity to the expected distribution. . . . We find that at the MAD for the first bucket of charges (0.010) shows a marginally acceptable conformity to Benford’s law. However, for the second (0.023), third (0.049), and fourth (0.092) buckets the MAD is greater than 0.012 indicating nonconformity. As expected, the MAD increases with the level of total-charges. . . . An additional possible explanation for this finding is hospital pricing strategies . . .

OK, here’s the problem. Here are the data:

This should really be a graph, but let’s not worry about that right here.

Let’s focus on the fourth bucket, because that’s where the discrepancy is highest. You see what’s happening, right? In that fourth bucket, we’re up there in the tail of the distribution, the tail is dropping fast, so, yeah, there are very charges over $200,000. That doesn’t mean anyone’s cheating in their billing; it’s just what you’d expect to see in the tail of the distribution. Benford’s law applies when the underlying numbers come from a distribution with have a wide dynamic range, and by binning in this way you’re destroying that.

I’d say I’m surprised this got published in a legitimate journal, but, you know, the problem with peer review is the peers. Everyone’s doing the best they can, Benford’s law is a bright shiny object, it gets misused just like linear regression gets misused, just like logistic regression gets used, just like hypothesis testing gets misused and misused and misused. The Benford example is just a little bit more interesting because the math confusion is something a bit less familiar than the usual statistical mistakes we see every day. Hence why I bothered with this post.

5 thoughts on “A bright shiny object: Misapplication of Benford’s law published in health economics journal

  1. Benford’s law, as fascinating as it is, focuses on the leading digit. My attention is on the trailing digits at the end which indicates the triumph of the number, “9”. Everything these days ends in a “9”, often in a bunch of them. I am no more concerned about conspiracies than the next man–assuming the next man is Richard M. Nixon–but this seems to be a relatively new phenomenon.

  2. The key aspect of our research that is in common with these papers is detecting non-random behaviour in an attempt to protect the public from being harmed by inefficient health care delivery.

    Why is “non-random” getting equated with inefficient and harmful? According to this, 30% of bills between $100k – $999,999k should be less than $200k. They observed 70% were in the lowest part of the range and concluded it was harming patients.

    Hello Ma’am, we are pleased to inform you a mistake has been made in your favor during billing. Your odds of a $900k- $1M bill should have been 4.5%, but we gave you only 0.25% chance to get the largest bill. Your odds have been recalculated and applied to the new bill. We apologize for the harm caused by the earlier error.

    Like previous economics stuff I’ve seen on here (eg, hinging theories on the assumption insider trading is rare???) I am just going to refuse to believe this.

  3. Not sure I fully understand the points being made by the paper (are costs really random?), but the huge variation in sizes of the bins strikes me as bizarre. Maybe based on quantities of an extremely right-skewed distribution?

  4. I buy the first observation by Anoneuoid.
    But first of all, Benford law applies to variables that span different orders of magnitude. So, grouping observations by the same order of magnitude invalidates the main Benford law assumption.

Leave a Reply

Your email address will not be published. Required fields are marked *