The person who sent me this question (“You’re a data scientist at a local hospital and you’ve been asked to present to the physicians on communicating statistical information to patients. What should you say?”) the other day read the comment thread and responded: Thank you so much for putting the question to your readership. Their […]

**Teaching**category.

## You’re a data scientist at a local hospital and you’ve been asked to present to the physicians on communicating statistical information to patients. What should you say?

Someone who wishes to remain anonymous writes: I just read your post reflecting on crappy talks . . . I’m reaching out because I’m a data scientist at a local hospital in the US and I’ve been asked to present to our physicians about communicating statistical information to patients (e.g., how to interpret the results […]

## Reflections on a talk gone wrong

The first talk I ever gave was at a conference in 1988. (This isn’t the one that went wrong.) I spoke on Constrained maximum entropy methods in an image reconstruction problem. The conference was in England, and I learned about it from a wall poster. They had travel funding for students. I sent in my […]

## Sketching the distribution of data vs. sketching the imagined distribution of data

Elliot Marsden writes: I was reading the recently published UK review of food and eating habits. The above figure caught my eye as it looked like the distribution of weight had radically changed, beyond just its mean shifting, over past decades. This would really change my beliefs! But in fact the distributional data wasn’t available […]

## Weakliem on air rage and himmicanes

Weakliem writes: I think I see where the [air rage] analysis went wrong. The dependent variable was whether or not an “air rage” incident happened on the flight. Two important influences on the chance of an incident are the number of passengers and how long the flight was (their data apparently don’t include the number […]

## Debate involving a bad analysis of GRE scores

This is one of these academic ping-pong stories of a general opinion, an article that challenges the general opinion, a rebuttal to that article, a rebuttal to the rebuttal, etc. I’ll label the positions as A1, B1, A2, B2, and so forth: A1: The starting point is that Ph.D. programs in the United States typically […]

## What are the most important statistical ideas of the past 50 years?

Aki and I wrote this article, doing our best to present a broad perspective. We argue that the most important statistical ideas of the past half century are: counterfactual causal inference, bootstrapping and simulation-based inference, overparameterized models and regularization, multilevel models, generic computation algorithms, adaptive decision analysis, robust inference, and exploratory data analysis. These eight […]

## Basbøll’s Audenesque paragraph on science writing, followed by a resurrection of a 10-year-old debate on Gladwell

I pointed Thomas Basbøll to my recent post, “Science is science writing; science writing is science,” and he in turn pointed me to his post from a few years ago, “Scientific Writing and ‘Science Writing,’” which stirringly begins: For me, 2015 will be the year that I [Basbøll] finally lost all respect for “science writing”. […]

## Is causality as explicit in fake data simulation as it should be?

Sander Greenland recently published a paper with a very clear and thoughtful exposition on why causality, logic and context need full consideration in any statistical analysis, even strictly descriptive or predictive analysis. For instance, in the concluding section – “Statistical science (as opposed to mathematical statistics) involves far more than data – it requires realistic […]

## Nonparametric Bayes webinar

This post is by Eric. A few months ago we started running monthly webinars focusing on Bayes and uncertainty. Next week, we will be hosting Arman Oganisian, a 5th-year biostatistics PhD candidate at the University of Pennsylvania and Associate Fellow at the Leonard Davis Institute for Health Economics. His research focuses on developing Bayesian nonparametric […]

## “In the world of educational technology, the future actually is what it used to be”

Following up on this post from Audrey Watters, Mark Palko writes: I [Palko] have been arguing for a while that the broad outlines of our concept of the future were mostly established in the late 19th/early 20th Centuries and put in its current form in the Postwar Period. Here are a few more data points […]

## Lying with statistics

As Deb Nolan and I wrote in our book, Teaching Statistics: A Bag of Tricks, the most basic form of lying with statistics is simply to make up a number. We gave the example of Senator McCarthy’s proclaimed (but nonexistent) list of 205 Communists, but we have a more recent example: One of the supposed […]

## My scheduled talks this week

Department of Biostatistics, Harvard University: Today, Tues 10 Nov 2020, 1pm Department of Marketing, Arison School of Business, Israel: Thurs 12 Nov 2020, 10am (US eastern time) St. Louis Chapter of the American Statistical Association: Thurs 5pm 2020, 5pm (US eastern time) The listed topic for the first two events is election forecasting and for […]

## Why is this graph actually ok? It’s the journey, not just the destination.

Josh Miller was in my office and started flipping through Kieran Healy’s book on data visualization, a book that I like a lot—I even use it in my class, replacing Cleveland’s Elements of Graphing Data which is wonderful but things have changed in 35 years so time for a new book. Josh noticed Figure 8.17 […]

## Reverse-engineering the problematic tail behavior of the Fivethirtyeight presidential election forecast

We’ve been writing a bit about some odd tail behavior in the Fivethirtyeight election forecast, for example that it was giving Joe Biden a 3% chance of winning Alabama (which seemed high), it was displaying Trump winning California as in “the range of scenarios our model thinks is possible” (which didn’t seem right), and it […]

## We are stat professors with the American Statistical Association, and we’re thrilled to talk to you about the statistics behind voting. Ask us anything!

It’s happening at 11am today on Reddit. It’s a real privilege to do this with Mary Gray, who was so nice to me back when I took a class at American University several decades ago.

## Misrepresenting data from a published source . . . it happens all the time!

Following up on yesterday’s post on an example of misrepresentation of data from a graph, I wanted to share a much more extreme example that I wrote about awhile ago, about some data misrepresentation in an old statistics textbook: About fifteen years ago, when preparing to teach an introductory statistics class, I recalled an enthusiastic […]

## Some wrong lessons people will learn from the president’s illness, hospitalization, and expected recovery

Jonathan Falk writes about the president’s illness: I [Falk] would think it provides a focused opportunity to make a few salient statistical education points. First, a 6 percent mortality rate (among old people with comorbidities) is really bad, but any single selected person is really quite unlikely to die, or even be really sick. Same […]

## It’s kinda like phrenology but worse. Not so good for the “Nature” brand name, huh? Measurement, baby, measurement.

Federico Mattiello writes: I thought you might find this thread interesting, it’s about a machine learning paper building a “trustworthiness score” from faces databases and historical (mainly British) portraits. It checks many bias boxes I believe, but my biggest complaint (I know it shouldn’t be) is the linear regression of basically spherical clouds of points: […]

## A question of experimental design (more precisely, design of data collection)

An economist colleague writes in with a question: What is your instinct on the following. Consider at each time t, 1999 through 2019, there is a probability P_t for some event (e.g., it rains on a given day that year). Assume that P_t = P_1999 + (t-1999)A. So P_t has a linear time trend. What […]