In an article entitled, “Our Students Can’t Write. We Have Ourselves to Blame,” college professor Robert Zaretsky writes: I, for one, spend my semesters picking through the salads tossed and served up as papers by my students. Consider the opening paragraph from a paper I received this semester. The student, who chose to write on […]

**Teaching**category.

## A fun activity for your statistics class: One group of students comes up with a stochastic model for a decision process and simulates fake data from this model; another group of students takes this simulated dataset and tries to learn about the underlying process.

Benjamin Jarvis writes: I’m (re-)developing a course about discrete choice analysis, and I would like to build on data examples you use in your book with Jennifer Hill. In particular, I would like to include lessons about the conditional logistic regression model, aka McFadden’s multinomial logit. I was hoping students could extend the Bangladesh well-switching […]

## “Causal Inference: The Mixtape”

A few years ago we reviewed “Mostly Harmless Econometrics,” by Josh Angrist and Jörn-Steffen Pischke. And now we have another friendly introduction to causal inference by an economist, presented as a readable paperback book with a fun title. I’m speaking of “Causal Inference: The Mixtape,” by Scott Cunningham. I like the book—all the blurbs on […]

## Jordan Ellenberg’s new book, “Shape”

The full title is “Shape: The Hidden Geometry of Information, Biology, Strategy, Democracy, and Everything Else,” and I wasn’t looking forward to it. Yes, I’m a fan of Jordan Ellenberg, a practicing mathematician who’s also written general-interest books, but I have unpleasant memories of the math olympiad program where they were always trying to shove […]

## Probability problem involving multiple coronavirus tests in the same household

Mark Tuttle writes: Here is a potential homework problem for your students. The following is a true story. Mid-December, we have a household with five people. My wife and myself, and three who arrived from elsewhere. Subsequently, various diverse symptoms ensue – nothing too serious, but everyone is concerned, obviously. Video conference for all five […]

## When can we challenge authority with authority?

Michael Nelson writes: I want to thank you for posting your last decade of publications in a single space and organized by topic. But I also wanted to share a critique of your argument style as exemplified in your Annals of Surgery correspondence [here and here]. While I think it’s important and valuable that you […]

## More background on our research on constructing an informative prior from a corpus of comparable studies

Erik van Zwet writes: The post (“The Shrinkage Trilogy: How to be Bayesian when analyzing simple experiments”) didn’t get as many comments as I’d hoped, so I wrote an short explainer and a reading guide to help people understand what we’re up to. All three papers have the same very simple model. We abstract a […]

## “The 100 Worst Ed-Tech Debacles of the Decade”

This is a list from Audrey Watters (link from Palko). 100! Wow—that’s a long list. But it is for a whole decade. I doubt this’ll make it on to Bill Gates’s must-reads of the year, but I liked it. Just to give you a sense, I’ll share the first and last items on Watters’s list: […]

## Regression discontinuity analysis is often a disaster. So what should you do instead? Here’s my recommendation:

Summary If you have an observational study with outcome y treatment variable z and pre-treatment predictors X, and treatment assignment depends only on X, then you can estimate the average causal effect by regressing y on z and X and looking at the coefficient of z. If there is lack of complete overlap in X […]

## It was a year ago today . . .

We posted the following item: “We taught a class using Zoom yesterday. Here’s what we learned.” I was full of earnest thoughts. If you’d asked me whether I’d still be teaching on Zoom a year later, what would I have said? I’m not sure. The most relevant piece of information I can share with you […]

## Statistical fallacies as they arise in political science (from Bob Jervis)

Bob Jervis sends along this fun document he gives to the students in his classes. Enjoy. Theories of International Relations Assume that all the facts and assertions in these paragraphs are correct. Why do the conclusions not follow? (This does not mean that the conclusions are actually false.) What are the alternative explanations for the […]

## Toronto Data Workshop on Reproducibility

I (Lauren not Andrew writing) will be speaking at an upcoming online workshop on reproducibility (free and open). More details here. Looking at the talk outlines, I’m really looking forward to it. I think we can generally agree that reproducibility is a good thing, and something we want to strive for, but in practice there’s […]

## Kill the math in the intro stat course?

David Kane writes: Our introductory classes in statistics and data science use too much mathematics. The key causal effect which our students want our classes to have is to improve their future performance and opportunities. The more professional their computing skills (in the context of data analysis), the greater their likely success. Introductory courses should […]

## Summer training in statistical sampling at University of Michigan

Yajuan points us to this summer program:

## The textbook paradox: “Textbooks more than a very few years old cannot even be given away, but new textbooks are mostly made by copying from former ones”

The above remark, from Alan Dunne, applies to mature fields more than to new fields. For example, I guess the textbooks on deep learning are pretty recent, so anything a few years old really would be out of date. Even in subfields that have been around for awhile, it can take a while for textbook […]

## New textbook, “Statistics for Health Data Science,” by Etzioni, Mandel, and Gulati

Ruth Etzioni, Micha Mandel, Roman Gulati wrote a new book that I really like. Here are the chapters: 1 Statistics and Health Data 1.1 Introduction 1.2 Statistics and Organic Statistics 1.3 Statistical Methods and Models 1.4 Health Care Data 1.5 Outline of the Text 1.6 Software and Data 2 Key Statistical Concepts 2.1 Samples and […]

## Thanks, commenters!

The person who sent me this question (“You’re a data scientist at a local hospital and you’ve been asked to present to the physicians on communicating statistical information to patients. What should you say?”) the other day read the comment thread and responded: Thank you so much for putting the question to your readership. Their […]

## You’re a data scientist at a local hospital and you’ve been asked to present to the physicians on communicating statistical information to patients. What should you say?

Someone who wishes to remain anonymous writes: I just read your post reflecting on crappy talks . . . I’m reaching out because I’m a data scientist at a local hospital in the US and I’ve been asked to present to our physicians about communicating statistical information to patients (e.g., how to interpret the results […]

## Reflections on a talk gone wrong

The first talk I ever gave was at a conference in 1988. (This isn’t the one that went wrong.) I spoke on Constrained maximum entropy methods in an image reconstruction problem. The conference was in England, and I learned about it from a wall poster. They had travel funding for students. I sent in my […]

## Sketching the distribution of data vs. sketching the imagined distribution of data

Elliot Marsden writes: I was reading the recently published UK review of food and eating habits. The above figure caught my eye as it looked like the distribution of weight had radically changed, beyond just its mean shifting, over past decades. This would really change my beliefs! But in fact the distributional data wasn’t available […]