Why is this double-y-axis graph not so bad?

Posted on September 12, 2015 9:52 AM by Andrew

Usually I (and other statisticians who think a lot about graphics) can’t stand this sort of graph that overloads the y-axis:

journal.pone.0112042.g001

But this example from Isabel Scott and Nicholas Pound actually isn’t so bad at all! The left axis should have a lower bound at 0—it’s not possible for conception risk to be negative—but, other than that, the graph works well.

What’s usually the problem, then? I think the usual problem with double-y-axis graphs is that attention is drawn to the point at which the lines cross.

Here’s an example. I was searching the blog for double-y-axis graphs but couldn’t easily find any, so I googled and came across this:

Forget the context and the details—I just picked it out to have a quick example. The point is, when the y-axes are different, the lines could cross anywhere—or they don’t need to cross at all. Also you can make the graph look like whatever you want by scaling the axes.

The top graph above works because the message is that conception risk varies during the monthly cycle while political conservatism doesn’t. It’s still a bit of a cheat—the scale for conception risk just covers the data while for conservatism they use the full 1-6 scale—but, overall, they still get their message across.

10 thoughts on “Why is this double-y-axis graph not so bad?”

Philip Cohen on September 12, 2015 10:16 AM at 10:16 am said:

I have used them when the point is the direction of the trends rather than their scale. Of course it could be abused (like pairing tiny modulations with major trends), but when it’s reasonable I think it’s reasonable. Here’s a post with two of them: https://familyinequality.wordpress.com/2012/11/26/single-moms-cant-be-scapegoated-for-the-murder-rate-anymore/

Reply ↓
- Andrew on September 12, 2015 10:31 AM at 10:31 am said:
  
  Philip:
  
  I think in your examples I’d prefer the connected-dots plot, where you graph one variable on the x-axis and the other on the y-axis, and you connect the points from one year to the next so the time trends are clear. Or I’d just prefer 2 time series, side by side, on separate graphs.
  
  Reply ↓
  - Philip Cohen on September 12, 2015 2:32 PM at 2:32 pm said:
    
    That’s very reasonable. Two graphs side by side with the same units on the x-axis would probably be best. Thanks.
    
    Reply ↓
Rahul on September 12, 2015 10:57 AM at 10:57 am said:

One place double axes plots are useful is multiple units. Say, a heat transfer coefficient in J/m2 C & also in Btu/ft2 F

Reply ↓
Elin on September 12, 2015 3:38 PM at 3:38 pm said:

I think it kind of works because one is an abstract, smoothed distribution and the other is data points .. but it also works because it highlights the discrepancy between the effect size predicted by the “theory” and the observed data. I actually find it helpful for readability to have some space between the axis and the smoothed curve especially since there are no 0 risk days at the aggregate level (there is some woman who is in the fertility window on every number of days since last first day of menses). I think the space is bigger than needed though. Still it gets the information about the lack of variation in the political data across.

Reply ↓
Dale on September 13, 2015 12:47 PM at 12:47 pm said:

Actually, I think the image can be improved upon:
http://myweb.loras.edu/dl526303/RevisedFigure.html

I did not have the data so I had to guesstimate it. I indexed the two scales to the maximum value. This eliminated the fact that the variability of the risk was much larger than the variability of the conservatism measure. What I think is better about this display is that it clearly shows that the only feature common to both is the peak. Outside of that, there is little correlation between the two series and they are often in conflict.

Reply ↓
- Martha on September 13, 2015 6:43 PM at 6:43 pm said:
  
  The scales on your y-axes look off.
  
  Reply ↓
Dale on September 14, 2015 8:07 AM at 8:07 am said:

Both series have been scaled to the maximum value = 1 for that series. I think the scales are correct. The point of re-scaling them to the same scale is to overcome the disparity in the variability of the two measures. The original chart suggests the lack of correlation but, for me, the variability difference overwhelms the rest of the picture. With the same scaling, I think the correlation (or lack thereof) becomes more apparent.

Reply ↓
MIke S on September 15, 2015 1:51 AM at 1:51 am said:

Just for fun, I found a few more with duckduckgo, if you’d care to comment on any them:

http://www.tushar-mehta.com/excel/charts/plot_magnitude_differences.htm

http://www.tushar-mehta.com/excel/charts/0204-single%20graph%20dual%20axis.htm

http://www.mathworks.com/help/matlab/creating_plots/plotting-with-two-y-axes.html

http://www.mathworks.com/help/matlab/creating_plots/graph-with-multiple-x-axes-and-y-axes.html

http://www.statmethods.net/advgraphs/images/axis.png

http://www.ellenfinkelstein.com/pptblog/create-a-powerpoint-chartgraph-with-2-y-axes-and-2-chart-types/

http://www.graphpad.com/guides/prism/6/user-guide/index.htm?graphs_with_two_y_axes.htm

Reply ↓
Jan on July 2, 2017 12:37 PM at 12:37 pm said:

The point is, when the y-axes are different, the lines could cross anywhere—or they don’t need to cross at all.

Unless they are parallel lines, in a 2D world those lines will intersect.

Reply ↓

Statistical Modeling, Causal Inference, and Social Science

Why is this double-y-axis graph not so bad?

10 thoughts on “Why is this double-y-axis graph not so bad?”

Leave a Reply Cancel reply