New Course: Prediction for (Individualized) Decision-making

This is Jessica. This winter I’m teaching a new graduate seminar on prediction for decision-making intended primarily for Computer Science Ph.D. students. The goal of the new course is to consider various perspectives on what it means to predict for the purpose of decision-making. We’ll look at this question in the context of predictive modeling for automated decisions or to inform expert decisions and causal estimation to inform policy. I’m trying to include a mix of theoretical and applied papers, with an emphasis on philosophical and ethical challenges to evaluating decision-making and applying formal methods in practice, especially in contexts where human experts currently make decisions and/or the decisions involve people. Technically the course title is Prediction for Decision-making. But one of the motivations is that we have yet to adequately address the gap between conventional machine learning, where we optimize loss over aggregates, and the needs of human decision-makers in practice, where we often care about doing right by individual cases. Hence the reference to “individualized.” 

Suggestions welcome if this is your cup of tea and you think I missed something important. A few of the listed papers are already coming from pointers I’ve gotten from readers here. I’m especially interested in papers that help illustrate the gaps in current methods when it comes to good individual decisions. 

Course Schedule

Week 1 – Introduction and background on statistical decision rules

     Background: Statistical decision theory, randomized controlled trials

  • Berger, J. O. (2013). Statistical decision theory and Bayesian analysis. Springer Science & Business Media. Chapter 1.
  • Hernan, Miguel A., & Robins, James, M. (2023). Causal inference: what if. CRC PRESS. Chapters 1, 2

    Examples

Week 2 – Prediction versus decision-making

     Optional

Week 3 – Human versus statistical judgment

     Optional

Week 4 – Evaluating (individual) predictions and decisions

Optional

Week 5 – Data shifts and causality

     Optional

Week 6 – Personalization and fairness

      Optional

Week 7 – Calibration for decision-making

     Optional

Week 8 – Communicating prediction uncertainty

    Optional

Week 9 – Designing human-AI workflows 

     Optional

25 thoughts on “New Course: Prediction for (Individualized) Decision-making

  1. What an excellent list of references – most of which I was not familiar with but which are directly relevant to things I think about a lot. I will offer one observation of many of these sources – they are unnecessarily laden with technical jargon, in my opinion. There are a number of examples of what I consider academic writing styles that adhere to our academic conventions but make the substantive issues harder to understand. I think this applies especially to the calibration readings (which have been hard for me to digest on this blog as well). There is a lot on this list! Too much for the time I have! But, thanks for putting it together and providing much reading for the coming months.

  2. This looks like fun and there is always more to add. My main concern would be that’s a lot of technical material per week—if these aren’t stats or very dedicated ML Ph.D. students, this is going to be rough.

    I would want to talk about Pareto optimality as it relates to decision making for groups and individuals. And then something like the trolley problem, which clearly highlights that these problems don’t have a “solution”. Then I’d want to talk about Arrow’s impossibility theorem on voting for decision making. But just this topic is the basis of a whole semester-long class, so I don’t know how you’re going to balance all this material—good luck!

    • I’d be interested to know any references about Arrow’s impossibility theorem in the context of decision making. I am only vaguely familiar with the theorem from reading several of Donald Saari’s works about a decade ago. I came away with the impression that it was more molehill than mountain, but I don’t remember any specifics.

    • It should also be mentioned that Arrow’s theorem requires that you vote based on ranks. It doesn’t apply if you’re voting based on cardinality/utility estimates. Of course other “perverse/strategic voting theorems” apply perhaps. Everything has some kind of strategic voting possibility. But Arrow’s theorem is a molehill for sure. Unfortunately we can’t seem to get people to understand that score voting is the one true path ;-)

      • Daniel,

        Yes, everything has some kind of strategic voting possibility, also the rules of the election can affect who decides to run and how many serious candidates there will be. So naive analyses of voting paradoxes based on a static set of candidates can go wrong.

    • You’re right that it will be a challenge given the material. I usually assign groups of 3 or so students to cover each week, and it’s a big chunk of their grade, so the presentations are expected to be technically correct and well prepared. This year I plan to do practice sessions with them to make sure the presentations hit all the important points. In the past in courses like this I have sometimes taken on some of the technical material (and the important points from the optional material) to help out. I’m ok with students not grasping everything if they are motivated to try to keep up and sincerely interested in the material. I prefer to have some diversity of perspectives in a course like this.

      On Arrow’s impossibility theorem – not including anything on computational social choice is probably an oversight, particularly given the interest in ML right now around “pluralistic alignment”/social choice for RLHF.

      Your mention of Pareto optimality reminds me of a fairness paper I may add: https://dl.acm.org/doi/pdf/10.1145/3490486.3538237

      • Jessica,

        If you want to talk about fairness, I kinda like my 2002 paper, Voting, fairness, and political representation (with discussion). Chance 15 (3), 22-26.

        I like this particular paper because it’s simple and it demystifies some fairness issues that I sometimes think are presented in too technical a manner.

        (I keep referring to my own work not because it’s the best but because I’m most familiar with it, so when any topic comes up, some article of mine will come to mind.)

  3. – For week 1 on decision analysis, I recommend chapter 9 of BDA3. Or, for a softer read, this 2015 article with Phil Price, Centralized analysis of local data, with dollars and lives on the line: Lessons from the home radon experience. In Data Science for Politics, Policy and Government}, ed. R. Michael Alvarez. Cambridge University Press. I prefer these real examples to the abstract discussions of mean squared error etc. in Berger’s book.

    – For week 1 on causal inference, I recommend my 2011 article, Causality and statistical learning. American Journal of Sociology 117, 955-966, not because it offers any deep theory (it doesn’t!) but because it discusses several different perspectives on causal inference.

    – For week 5 on generalization, I recommend chapter 19 of Regression and Other Stories and this 2021 article with Lauren Kennedy, Know your population and know your model: Using model-based regression and poststratification to generalize findings beyond the observed sample. Psychological Methods 26, 547-558.

    – For week 8 on communicating predictive uncertainty, I recommend our article! from 2020, Information, incentives, and goals in election forecasts. Judgment and Decision Making 15, 863-880.

  4. This list already looks great! Here is some relevant writing that might be interesting.

    For a history (of science) perspective on how formalized decision policies are put to work in social practice, I really enjoyed Lorraine Daston’s book “Rules: A Short History of What We Live By”. It focuses on how we have historically approached the tasks of first generalizing from particulars (cases) to the universal (the rule/policy), and second applying the universal (rule/policy) to the particular (individual decision) across a range of domains. The book really gets at how the application of formal procedures inevitably runs into improvisation (or what we sometimes call judgment and intuition). Streel-Level Algorithms (https://hci.stanford.edu/publications/2019/streetlevelalgorithms/streetlevelalgorithms-chi2019.pdf) also talks about the same improvisation in application. Lorraine’s work highlights how for a long time improvisation was thought to be necessary (rules or policies work were thought to work best when *someone* is tasked with applying them rather than automated application) but over time the concepts of “judgement” and “discernment” have shifted from being desirable (by accommodating for the uniqueness of each decision) to undesirable (introducing human biases) in decision-making processes. Street-level algorithm’s talks about how the improvisation can be accounted for (e.g. by provisioning recourse).

    For philosophical perspectives on individual decisions, I like Ruth Chang’s work on the limits of practical reason (such expected utility calculus or cost/benefits analysis). This is an example https://philarchive.org/archive/CHAHC-8

    Finally self-plug on decisions about people, where the decision-maker cares about individual outcomes (and how current methods are limited): https://dl.acm.org/doi/10.1145/3613904.3642685 . We study what kinds of “decision support” managers wanted when making team decisions. Similar to Anna’s work on child welfare decisions in that it deals with decisions about people but the key difference is that we looks at decisions that warrant explanation (e.g. where convincing stakeholders about a course of action and choosing the course of action are concomitant). So, shaping collective beliefs is part of the decision-making process and as a result, also constrains how decisions proceed. This is also explored to some extent in https://hal.science/hal-03199715/document

      • The Dalston book is great. Kenneth R. Hammond’s (1996) “Human Judgement and Social Policy” could be relevant especially to the topics in week 3, but I’m not sure I would recommend it even as background material for this reading list since it’s a rather long book. Many thanks for sharing this list of references, it looks fantastic and I’m looking forward to reading!

  5. Hello, Jessica! On “individualized”, I happen to be watching some old YouTube videos where Michael Jordan was talking about computational thinking, inferential/statistical thinking, and big data (around 2015), e.g.,
    https://www.youtube.com/watch?v=j1PS6T6yAoY
    https://dl.acm.org/doi/10.1145/2745754.2745782 (there is a video under Supplementary Material)
    The latter is linked to:
    Michael I. Jordan. 2015. Computational Thinking, Inferential Thinking and “Big Data”. In Proceedings of the 34th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems (PODS ’15). Association for Computing Machinery, New York, NY, USA, 1. https://doi.org/10.1145/2745754.2745782.
    I’m not sure if there are any decision making contents in the paper, but just in case.

  6. This is a really solid list. Some work I haven’t come across before but I’ll come back to this post to find some new stuff, I think.

    As an economist by background I have also been meaning to start delving into some behavioural economics reading as the entire discipline is more or less about decision-making under uncertainty. If anyone has anywhere they would suggest to get started, I’d appreciate it!

  7. Darn, what a list! Would definitely enjoy working with graduates from this class. I mentioned it recently in another thread and will highlight it here again as an excellent dissection of individualized inference that would fit well in such a course:

    Liu K & Meng XL (2016). There Is Individualized Treatment. Why Not Individualized Inference? Annual Review of Statistics and Its Application, Vol. 3, Issue 1, pp. 79-111: https://www.annualreviews.org/content/journals/10.1146/annurev-statistics-010814-020310

  8. I don’t know if this really fits in your existing course but take a look at Fleder, Daniel M. and Hosanagar, Kartik, Blockbuster Culture’s Next Rise or Fall: The Impact of Recommender Systems on Sales Diversity (September 1, 2007). Management Science, Vol. 55, No. 5, pp. 697-712, May 2009, Available at SSRN: https://ssrn.com/abstract=955984 or https://dx.doi.org/10.2139/ssrn.955984. This paper is looking at how interactions with a recommender system affect the sales of various products over the course of a sequence of decisions, and that doesn’t necessarily line up with what you might expect from just looking at a single period.

  9. Jessica: Have you considered including a section on legal issues for AI systems that will in all likelihood be used to violate the 4th Amendment along with state and federal legislation related to stalking and harassment?

      • My thought is that you will likely be covering AI algorithmic employment decisions under your section on fairness and what might be a good exercise is to identify a specific technology and generate the potential for abuse and constitutional or illegal use of such a technology. One technology that has been around for a long time, but is just becoming commercially available is hypersonic speaker and parabolic microphone technology. You could assign students to research the technology and come with how it will likely be misused given AI technology such as Voice Generators.

Leave a Reply

Your email address will not be published. Required fields are marked *