BoTorch – Bayesian Optimization in PyTorch

Pseudomanifold · on Sept 1, 2023

They missed a great opportunity to call this BloTorch (Bayesian Learning and Optimisation?) here, but I'm very excited to see such methods gaining more traction!

elcomet · on Sept 1, 2023

Or Borch

SubiculumCode · on Sept 1, 2023

Ever since you said this I only see blowtorch

_giorgio_ · on Sept 1, 2023

too long

fastneutron · on Sept 1, 2023

We built (internally) an engineering optimization framework on top of Botorch. Bayesian optimization is a great tool to have in the toolbox for mapping a design space and finding the limiting cases with fewer calls to the expensive physics-based solvers. The computational budget often winds up being 5-10x smaller than it would have been using traditional design-of-experiments sampling methods.

vervez · on Sept 1, 2023

There's also recent Bayesian Optimal Experimental Design methods that allow you to directly design experiments using gradient ascent. Not sure how it compares with BayesOpt on your problem, though.

capableweb · on Sept 1, 2023

That sounds really interesting, do you have more information about it?

altell · on Sept 1, 2023

how flexible was Botorch for your task and is it difficult to map it to a Botorch specific format. And want kind of understanding did you have about the underlying cost/objective function and what kind of surrogate model did you used? Just basic GP(gaussian process) ?

fastneutron · on Sept 1, 2023

It wasn’t terribly difficult once we got a feel for the underlying API. GP surrogates work out of the box, but you can plug in other kinds as well. Same goes for acquisition functions.

As far as the objective, usually we’re calling an external physics-based solver (e.g. finite elements) and post-processing the solution to get the quantity we’re trying to optimize. There’s almost never any gradient information, so Bayesian optimization winds up being the method of choice.

agravier · on Sept 1, 2023

What are you talking about? I understand the individual words but not the overall meaning.

fastneutron · on Sept 1, 2023

If you’re not from a traditional engineering background, some of my terminology may have been confusing. Here’s a less jargon-heavy version:

- We use Bayesian optimization to find the optimum (or worst-case) configuration of real manufactured objects and systems. - Bayesian optimization lets us arrive at that design configuration faster than explicit, physics-based simulation of many samples within the space of all possible configurations.

- We built the framework to do that using Botorch.

- It’s not an uncommon practice by any means, but the availability of tools like Botorch now makes it a lot easier to implement Bayesian optimization in-house, vs relying on a vendor-based engineering tool.

afp · on Sept 1, 2023

Are the tolerances of the optimum important in your case? (i.e., how sensitive it is to errors in the design parameters) If so, did you use any method to incorporate this information into the optimization?

fastneutron · on Sept 1, 2023

It depends on the needs of the specific application. What typically happens is that you’d use BO to globally converge to within some tolerance and use the resulting surrogate to get a map of where the interesting regions are. You can then more densely sample these candidate regions or switch to a gradient-based method (via finite difference). For uncertainty information, we usually add this as a noise parameter, either on the input samples, or as part of the GP kernel.

afp · on Sept 1, 2023

That's very useful, thanks!

tomrod · on Sept 1, 2023

This is so cool! Is there anywhere I can learn more about the approach?

fastneutron · on Sept 1, 2023

These two references are both really good, and freely available:

Bayesian Optimization - https://bayesoptbook.com/

Surrogates - https://bobby.gramacy.com/surrogates/

random_ · on Sept 1, 2023

I've never used the library directly but it was quite usefull as a method for hyperparameter search on optuna (for choosing machine learning tuning parameters).

capableweb · on Sept 1, 2023

Same here, that's how I first came across it :) At current tests, it performs a lot better compared to other samplers and requires a lot less trials to find good hyperparameters.

altell · on Sept 1, 2023

did anyone used it for something other then hyperparameter tuning ?

afp · on Sept 1, 2023

It's also very useful for simulation-based optimization. As an example, we use it extensively for the design of particle accelerators, where the simulations are typically expensive and need to run on supercomputers. We have built our own library[0] for enabling this, which in the end uses BoTorch (through Ax[1]) under the hood.

[0]: https://github.com/optimas-org/optimas [1]: https://ax.dev