Cross validation and model assessment comparing models with different likelihoods? The answer’s in Aki’s cross validation faq!

Posted on October 19, 2021 9:56 AM by Andrew

Nick Fisch writes:

After reading your paper “Practical Bayesian model evaluation using leave-one-outcross-validation and WAIC”, I am curious as to whether the criteria WAIC or PSIS-LOO can be used to compare models that are fit using different likelihoods? I work in fisheries assessment, where we are frequently fitting highly parameterized nonlinear models to multiple data sources using MCMC (generally termed “integrated fisheries assessments”). If I build two models that solely differ in the likelihood specified for a specific data source (one Dirichlet, the other Multinomial), would WAIC or loo be able to distinguish these or must I use some other method to compare the models (such as goodness of fit, sensitivity, etc). I should note that the posterior distribution will be the unnormalized posterior distribution in these cases.

My response: for discrete data I think you’d just want to work with the log probability of the observed outcome (log p), and it would be fine if the families of models are different. I wasn’t sure what was the best solution with continuous variables, so I forwarded the question to Aki, who wrote:

This question is answered in my [Aki’s] cross validation FAQ:

12 Can cross-validation be used to compare different observation models / response distributions / likelihoods?

First to make the terms more clear, p(y∣θ) as a function of y is an observation model and p(y∣θ) as a function of θ is a likelihood. It is better to ask “Can cross-validation be used to compare different observation models?”

– You can compare models given different discrete observation models and it’s also allowed to have different transformations of y as long as the mapping is bijective (the probabilities will the stay the same).
– You can’t compare densities and probabilities directly. Thus you can’t compare model given continuous and discrete observation models, unless you compute probabilities in intervals from the continuous model (also known as discretising continuous model).
– You can compare models given different continuous observation models, but you have exactly the same y (loo functions in rstanarm and brms check that the hash of y is the same). If y is transformed, then the Jacobian of that transformation needs to be included. There is an example of this in mesquite case study.

It is better to use cross-validation than WAIC as the computational approximation in WAIC fails more easily and it’s more difficult to diagnose when it fails.

P.S. Nick Fisch is a Graduate Research Fellow in Fisheries and Aquatic Sciences at the University of Florida. How cool is that? I’m expecting to hear very soon from Nick Beef at the University of Nebraska.

7 thoughts on “Cross validation and model assessment comparing models with different likelihoods? The answer’s in Aki’s cross validation faq!”

Sam on October 19, 2021 11:33 AM at 11:33 am said:

Here’s the link to Andrew’s paper:

https://arxiv.org/pdf/1507.04544.pdf

Could you, Andrew, provide a brief description of what type of models and problems that LOO and WAIC should be used for. (The abstract describes how the algorithms work but doesn’t describe some real world cases in which it could be used.)

Reply ↓
- Andrew on October 19, 2021 12:40 PM at 12:40 pm said:
  
  Sam:
  
  Aki’s convinced me to always use LOO, never WAIC.
  
  Reply ↓
Chris Wilson on October 19, 2021 4:05 PM at 4:05 pm said:

Uh oh, Andrew, you are sailing into the ‘Dentists named Dennis’ waters ;)

Reply ↓
- Andrew on October 19, 2021 4:34 PM at 4:34 pm said:
  
  Chris:
  
  Tell it to Nick Beef.
  
  Reply ↓
  - Anonymous on October 19, 2021 5:23 PM at 5:23 pm said:
    
    I’m pretty sure it’s Patrick Beef, from Jamaica
    
    Reply ↓
    - Andrew on October 19, 2021 5:27 PM at 5:27 pm said:
      
      ftw
bbis on October 20, 2021 4:31 PM at 4:31 pm said:

With Nick Fisch in Florida, it seems it could be Russ L. Cowherd in Nebraska or Steel Head studying anadromous trout on the west coast.

Reply ↓

Statistical Modeling, Causal Inference, and Social Science

Cross validation and model assessment comparing models with different likelihoods? The answer’s in Aki’s cross validation faq!

7 thoughts on “Cross validation and model assessment comparing models with different likelihoods? The answer’s in Aki’s cross validation faq!”

Leave a Reply Cancel reply