Another physically motivated sampler: Microcanonical HMC

Posted on December 20, 2022 3:00 PM by Bob Carpenter

This time it’s astrophysicists rather than quantum physicists getting in on the sampling action.

Microcanonical HMC

Uroš Seljak, an astrophysics professor at UC Berkeley, and collaborators developed a form of Hamiltonian Monte Carlo (HMC) sampling with an alternative approach to energy related to underdamped Langevin dynamics. Here’s the link to the arXiv paper.

Robnik, De Luca, Silverstein, and Seljak. 2022. Microcanonical HMC. arXiv.

Uros presented a preliminary version last month when he visited Flatiron Institute for our workshop on measure transport, diffusion processes, and sampling (a topic which was way more popular than my co-organizer Michael Albergo and I anticipated).

Meaningful evaluation vs. NUTS

I like that the microcanonical HMC paper demonstrates an auto-tuning scheme that shows impressive improvements over the no-U-turn sampler from Stan rather than using vanilla Hamiltonian Monte Carlo as a baseline. Vanilla HMC is very difficult to tune to work and even harder to tune for efficiency, especially without integration time jittering (step size or number of steps).

Langevin diffusions everywhere

I’m seeing Langevin diffusions everywhere these days. Chirag Modi, a joint astro and math postdoc here at Flatiron Institute who did his Ph.D. with Uros, Alex Barnett, Edward Roualdes and I are working on mashing up our recent work on delayed rejection for HMC for multiscale distributions with Radford Neal’s latest partial momentum refresh Langevin sampler, with a dash of parallel auto-adaptation courtesy of Matt Hoffman and Pavel Sountsov’s latest sampler, MEADS. This is the project that motivated Edward to start BridgeStan.

The field is heating up

The field of NUTS competitors is continuing to heat up (physical systems pun purely coincidental). Stay tuned for the results of BridgeStan implementations with more extensive evaluations based on posteriordb.

13 thoughts on “Another physically motivated sampler: Microcanonical HMC”

Ben on December 20, 2022 4:22 PM at 4:22 pm said:

Question. For the comparison of MCHMC and NUTS on Neal’s funnel, the paper says (these lines come from two different places):

> Compared to NUTS (with warm-up) the improvement of MCHMC is a factor of 11
> Both MCHMC and NUTS are capable of accurately sampling the funnel target

Is there a reason to think we can get away from needing to non-center everything? Or that MCHMC has some advantage here leading to this factor of 11? Or is this just MCHMC making better of a tough situation?

Reply ↓
- Jakob Robnik on December 21, 2022 3:55 AM at 3:55 am said:
  
  Non-centered parametrization would still be useful. Neal’s Funnel becomes the Standard Normal and the ESS drastically increases to 0.25.
  Here we do not do that and as you say, MCHMC just makes better of a tough situation.
  
  Reply ↓
  - Ben on December 21, 2022 12:36 PM at 12:36 pm said:
    
    Makes sense, thanks for having a look.
    
    Reply ↓
- Bob Carpenter on December 21, 2022 11:02 AM at 11:02 am said:
  
  I hope we can get away from needing to manual make the combinatorial centered/non-centered parameterization. I would much rather have a sampler that works for both parameterizations than requiring the user to manually optimize.
  
  Reply ↓
  - Jakob Robnik on December 21, 2022 2:32 PM at 2:32 pm said:
    
    It is too much to expect from any sampler to work equally well in all parametrizations.
    This is where the preconditioning comes in. For example, you could combine the MCHMC with a Normalizing flow preconditioner (or something cheaper).
    
    Reply ↓
    - Ben on December 21, 2022 4:23 PM at 4:23 pm said:
      
      What libraries do people use for normalizing flow stuff? Is it mostly custom or are there some gotos? I wanted to try some the other day (was working with Jax and haiku)
    - Andrew on December 28, 2022 6:29 PM at 6:29 pm said:
      
      I thought they didn’t use goto anymore.
    - Ben on January 13, 2023 11:13 PM at 11:13 pm said:
      
      Just came back through here — good one
  - Niko on December 21, 2022 4:07 PM at 4:07 pm said:
    
    If I have my way, robust and automatic adaptive centering will come to brms next year. And maybe other model specific automatic reparametrizations.
    
    Reply ↓
    - Ben on December 21, 2022 4:17 PM at 4:17 pm said:
      
      Do you have something diagrammed on how that might work (how to know a non-centering can be applied, how to detect, and what to do)?
    - Tamas K Papp on December 24, 2022 9:10 AM at 9:10 am said:
      
      I was not aware that automatic centering is even possible. Is there a paper about this that you would recommend?
    - Niko on December 30, 2022 6:04 AM at 6:04 am said:
      
      @ben and @tamas, sure, I can share something in two weeks or so, when I’m back from my vacation.
    - Niko on January 13, 2023 2:59 AM at 2:59 am said:
      
      @Ben and @Tamas:
      
      The current standard reference for adaptive non-centering is probably Maria Gorinova’s “Automatic Reparameterisation of Probabilistic Programs”. She also writes a bit about automatically detecting possibilities for non-centering (and also other reparametrizations) based on analyzing the probabilistic program’s code/structure, but this isn’t what we would be considering at all. We would want to put it into brms, because brms already has all of the structure about hierarchies between parameters available. To tune the adaptive non-centeredness, we’d also be doing something slightly different (but related) to what Maria Gorinova proposed. We actually haven’t started working on bringing it to brms, and Paul Bürkner also doesn’t know about our plans yet, so “end of this year” is probably quite optimistic. In principle, implementation should be easy, but in practice, well, you never know what happens there…

Statistical Modeling, Causal Inference, and Social Science

Another physically motivated sampler: Microcanonical HMC

13 thoughts on “Another physically motivated sampler: Microcanonical HMC”

Leave a Reply Cancel reply