Archive of entries posted on

## Statistics is the *least* important part of data science

This came up already but I’m afraid the point got lost in the middle of our long discussion of Rachel and Cathy’s book. So I’ll say it again: There’s so much that goes on with data that is about computing, not statistics. I do think it would be fair to consider statistics (which includes sampling, […]