Updated Bayesian Optimization Workflow by jaeminy00 · Pull Request #14 · mcgalcode/rxn-ca

jaeminy00 · 2026-05-19T22:50:10Z

Revamped the Bayesian Optimization workflows under /workflow/flows and /workflow/jobs to support precursor, temperature, and time-based optimization of reaction recipes.

…standard ReactCA simulation and Bayesian Optimization simulation.

Updated docstring to clarify jobflow usage.

Copied original jobs.py file to here

Added example usage for creating a simulation flow in the docstring.

…etting temperatures; rxn-network uses string instead of float/int so the decimal point caused KeyErrors.

…ole json

…orrectly, while making sure the reaction library path is exported in BO workflow when it is first generated to prevent mongoDB memory errors.

Update campaign handling to write JSON to a file for next trial.

…hpc job

Added functionality to optimize individual precursor amounts using ratio parameters.

workflow to build 1 common reaction library and share

Ray auto-detects full node CPU count (e.g. 256) instead of the SLURM- allocated CPUs, causing it to spawn task workers that hang on startup. Initialize Ray with SLURM_NTASKS * SLURM_CPUS_PER_TASK before run_enumerators / get_scored_rxns to keep tasks within the allocated pool.

mcgalcode

This looks good to me! Not sure I can understand completely enough to be really critical, but nothing stood out to me.

mcgalcode · 2026-05-27T19:54:15Z

+
+        Discrete rather than continuous: BayBE enumerates all candidate values
+        and selects via Thompson Sampling, avoiding the L-BFGS-B boundary-hang
+        that occurs when a continuous parameter's optimum sits at its lower bound


Just out of interest: was that boundary hang thing a problem? Did you resolve it somehow by using this Thompson Sampling?

I think I need to do more testing on this; making precursor ratio as a discrete variable was a semi-temporary fix and I was planning on looking at it more, but after talking to KP she said it'd be okay to not touch precursor ratios at all for now.

mcgalcode · 2026-05-27T19:55:18Z

-import multiprocessing as mp

-_scoring_globals = {}
+def _pool_initializer(data: dict):


Thanks, this is a lot better than the naked global declaration lol

mcgalcode · 2026-05-27T19:55:54Z

+#!/usr/bin/env python3
+"""Quick local test for the BOFlowMaker jobflow.
+
+Runs 2 initial + 2 BO trials on a tiny 5x5 grid with 1 realization.


Did you mean to include this file? It looks like a debug script.

Oh shoot yes it's not supposed to be here my bad–can you get rid of it from your side or do I have to do it?

You can just delete it, make a commit, and push again.

Okie I did that and I also made some changes on my fork, I'm assuming they automatically get added here? Or should I PR again?

mcgalcode · 2026-05-27T20:05:53Z

+
+    search_space = SearchSpace.from_dict(search_space_config)
+
+    stub_objective = MockObjectiveFunction(


This seems out of place - shouldn't this be a real objective function?

I added this because BayesianOptimizer needed to initialize the campaign requires objective argument, but _build_campaign doesn't require it. So it was like a placeholder, but I definitely think this can be written better.

mcgalcode · 2026-05-27T20:13:55Z

+            "to prevent this (see search_space.add_precursor_ratio)."
+        )
+        from baybe.recommenders.pure.nonpredictive.sampling import RandomRecommender
+        recommendation = RandomRecommender().recommend(


Again out of curiosity, do you know what the different recommender options are?

I used the random recommender as a starting point, but there's ~10 recommenders that come with BayBE–this is future room for improvement of this BO workflow. Perhaps it would be better if we allow users to choose which recommender to use, instead of fixing it to ReandomRecommender?

Yeah, maybe making this a parameter would be good? RandomRecommender could be the default.

…to improve speed.

This was added previously but it's nolonger needed and is causing errors. Therefore reverting it back.

jaeminy00 · 2026-06-02T22:56:07Z

Note that the force-push from 77f533c to 74bcf32 was done to change the github account and email that was used to commit (HPC configuration wasn't done properly, so a different, automatically generated github account was used for the commits; overwrote the authorship)

jaeminy00 and others added 26 commits April 9, 2026 14:23

added kwargs for jobflow library building

06d6ccf

fixed typo

3018d25

added entry_kwargs to the @job decorated setup_reaction_library as well

4d7fb76

Bayesian workflow jobs

9b849e8

added bayesian optimizer JobFlow flow file

40c8982

Created jobs and flows folders, and separated job and flow files for …

f80e728

…standard ReactCA simulation and Bayesian Optimization simulation.

Revise docstring for core jobflow jobs

3e82226

Updated docstring to clarify jobflow usage.

Refactor imports and enhance reaction library setup

b4c7955

Copied original jobs.py file to here

Enhance docstring with example for create_simulation_flow

b6d9f39

Added example usage for creating a simulation flow in the docstring.

fixed kwargs not correctly being added due to wrong branching

9969eac

fixed typo

a4017c8

more kwargs error!!!

aca9692

fixed KeyError issue arising from using floats instead of ints when s…

8aa0d69

…etting temperatures; rxn-network uses string instead of float/int so the decimal point caused KeyErrors.

fixed mongoDB max memory issue, now passes the path instead of the wh…

e479e7a

…ole json

fixed flows/bayesian.py as well

843add8

made changes to kwargs logic to make sure everything is passed down c…

10ca6bf

…orrectly, while making sure the reaction library path is exported in BO workflow when it is first generated to prevent mongoDB memory errors.

fixed a bug where json from 1 trial to another was not writing correctly

db00955

deleted the weird json file

575339e

Change campaign_json to file path for next trial

9c4f68e

Update campaign handling to write JSON to a file for next trial.

cherry picked max's edits on precursor selection

7db9951

accounting for when job walltime hits and need to re-launch on a new …

99488a8

…hpc job

Enable optimization of precursor amounts with ratios

8b07d00

Added functionality to optimize individual precursor amounts using ratio parameters.

fixed fireworks saving launcher files to wrong directories

36955a6

changes to make sure the correct job is picked up by fireworks; modified

7b24565

workflow to build 1 common reaction library and share

jobflow job naming convention fixes

e3e7120

mcgalcode reviewed May 27, 2026

View reviewed changes

jaeminy00 added 3 commits May 27, 2026 14:26

changed default compress_freq to 500, instead of 1, for BO workflows …

cefba4e

…to improve speed.

changed default metastability cutoff from 0.1 to 0.03

15bc4c3

Removed the Ray initialization

6cca609

This was added previously but it's nolonger needed and is causing errors. Therefore reverting it back.

jaeminy00 added 3 commits June 2, 2026 15:25

Removed Ray Initialization line as it's not functional

b7237e1

Merge remote-tracking branch 'refs/remotes/origin/master'

e06c6db

test

74bcf32

jaeminy00 force-pushed the master branch from 77f533c to 74bcf32 Compare June 2, 2026 22:50


		search_space = SearchSpace.from_dict(search_space_config)

		stub_objective = MockObjectiveFunction(

Conversation

jaeminy00 commented May 19, 2026

Uh oh!

mcgalcode left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jaeminy00 commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants