Comments on: Happy birthday

By: Modeling statewide presidential election votes through 2028 - Statistical Modeling, Causal Inference, and Social Science

Fri, 04 Nov 2016 14:47:08 +0000

[…] decided to fit a Gaussian process model (following the lead of Aki in the birthday problem) with separate time series for each state and each region, with the country partitioned […]

By: Cross-validation, LOO and WAIC for time series - Statistical Modeling, Causal Inference, and Social Science Statistical Modeling, Causal Inference, and Social Science

Fri, 16 Jan 2015 13:39:55 +0000

[…] in the observed time series. For example, in the birthday example (BDA3 p. 505 and here), we can say that we have learned about the structure if we can predict any single date with […]

By: Spring forward, fall back, drop dead? « Statistical Modeling, Causal Inference, and Social Science Statistical Modeling, Causal Inference, and Social Science

Tue, 10 Jun 2014 13:42:37 +0000

[…] Valentine’s day and fewer on Halloween, I think the right way to go is to do an analysis of all the days of the year rather than picking just one or two […]

By: LOL! Birthdays not preferred on holidays? | 'Enjoying the Hi-5s of Autism'

LOL! Birthdays not preferred on holidays? | 'Enjoying the Hi-5s of Autism' — Sat, 28 Dec 2013 17:53:36 +0000

[…] more of his charts here. Happy Birthday Statistical Modeling, Causal Inference, and Social Science (19 December […]

By: Somewhere else, part 104 | Freakonometrics

Somewhere else, part 104 | Freakonometrics — Fri, 27 Dec 2013 04:16:06 +0000

[…] “Happy birthday” (via http://statmodeling.stat.columbia.edu/2013/12/1 …) […]

By: Rahul

Rahul — Thu, 26 Dec 2013 09:06:23 +0000

Can you see any blips for 9/11 or that big North East Blackout of 2003 etc.? I wonder.

By: Andrew

Andrew — Thu, 26 Dec 2013 08:32:48 +0000

In reply to Tehpet. y

By: Tehpet

Tehpet — Thu, 26 Dec 2013 05:46:18 +0000

Are the numbers for leap year day births normalized for the infrequency of that day?

By: stringph

stringph — Wed, 25 Dec 2013 18:34:11 +0000

What, no error bars or bands?

.. and no line connecting Sunday with Monday — the biggest difference out of all adjacent days?

By: Art

Art — Tue, 24 Dec 2013 06:55:59 +0000

I find it interesting that the actual number of births is highest in late September. Does that imply that while there may be fewer mothers giving birth on Christmas Day, prospective parents are busier conceiving on (or around) Christmas Day? :)

By: Take a Number: To Be Born on a Christmas Morn : One Caribbean Radio | The Global Mix

Tue, 24 Dec 2013 01:46:41 +0000

[…] See more of his charts here. […]

By: To Be Born on a Christmas Morn | Preezly

To Be Born on a Christmas Morn | Preezly — Tue, 24 Dec 2013 00:01:51 +0000

[…] See more of his charts here. […]

By: Andrew

Andrew — Fri, 20 Dec 2013 13:31:25 +0000

In reply to Aki Vehtari.

Aki:

But wouldn’t the log scale help when considering the long term trend (which moves about 20% from min to max)? Put it this way: suppose there is a fixed multiplicative effect of day of year or day of week or whatever. In the additive model, this will show up as a larger effect in 1976 (when the total #births is lowest). And, indeed, if you look at the day-of-week effects, the curve for 1976 is pretty high. It’s not the highest—1988 is the highest, presumably because there were real changes during this period with more scheduled births—but it’s up there, perhaps an artifact of the additive model for what fundamentally is a multiplicative process.

Regarding the Gaussian approximation, I wonder if there would be a way to do a multiplicative model by fitting an additive model on the log of the raw data and just adjusting the data variance accordingly. So the computation would be just as easy, it’s just that instead of approximating the binomial density with a Gaussian, we’d be applying the Gaussian approx to the density of the log of a binomially-distributed random variable.

By: Aki Vehtari

Aki Vehtari — Fri, 20 Dec 2013 08:36:32 +0000

D.O: Bumps before and after Labor day and Thanksgiving can be seen if we plot each year separately. In that plot you could also see that the effect for Labor Day and Thanksgiving is about the same size as for Independece day. Andrew preferred this plot where we show the effect for fixed yearday, and so the effect of fixed weekday is spread in this plot.

Rahul: Scale is not arbitrary. I first used absolute scale, but since I was interested in comparing the sizes of the different effects, it required extra mental effort to calculate whether the relative changes are big or not. I used % scale, because it looks prettier than having decimals (0.8 0.9 1.0 1.1). This scale has also benefit that when I made similar figure for Finland, I could immediately see that the size of the relative effects were similar. During these years on a average Friday there were about 10,000 births. We could have the absolute scale on the right.

Andrew: the data is count data, but with so high mean counts it can approximated very well with a Gaussian model. Log scale is not needed to ensure positivity and would transform the distribution away from Gaussian.

By: Rahul

Rahul — Fri, 20 Dec 2013 06:45:23 +0000

In reply to Andrew. Or exploit the unused right hand y-axis. You could relabel that in #births?

By: Andrew

Andrew — Fri, 20 Dec 2013 06:42:19 +0000

In reply to Rahul. Rahul: Good point. I think it would make sense to put the top graph (trends) on an absolute scale (perhaps #births per day, as you suggest) and the others on relative scales. Also, looking at the description in the book, it appears that we fit an additive model directly on the data, but now I'm thinking it would make more sense to work on the log scale.

By: Andrew

Andrew — Fri, 20 Dec 2013 06:38:25 +0000

In reply to D.O.. D.O.: Yes, as we discuss in the book, the model could be improved by replacing the daily spikes by little functions with "ringing" so that a dip on a particular day corresponds to smaller increases on the days right before and after. In the above graphs, I think that some of the daily effects have been inappropriately absorbed into the seasonal effect.

By: Rahul

Rahul — Fri, 20 Dec 2013 05:17:41 +0000

One minor question: Why normalize to an arbitrary 100 scale? Wouldn’t the graph be a tad more informative if you kept actual “num. of birth units”.

e.g. How many actual births do happen on a average Friday?

By: D.O.

D.O. — Fri, 20 Dec 2013 01:47:19 +0000

It’s a bit surprising that significant dips on Labor day and Thanksgivings do not have bumps before or after. Probably they are eaten up by the smoothing procedure; maybe because both LD and Thnxgiving are on the fixed days of the week.