Skip to content

Write files only from root process when distributed#1310

Open
bernstei wants to merge 2 commits intodevelopfrom
write_files_on_root_only
Open

Write files only from root process when distributed#1310
bernstei wants to merge 2 commits intodevelopfrom
write_files_on_root_only

Conversation

@bernstei
Copy link
Copy Markdown
Collaborator

In particular, fixes issue where distributed mace_run_train processes overwrite each others' pretrained and combined data files. It's possible that there are other places where this kind of error exists, but I tried to fix it everyplace I could find it.

closes #1309

@bernstei
Copy link
Copy Markdown
Collaborator Author

includes #1311 , sort of accidentally

@bernstei bernstei changed the title Write files from root process only when distributed Write files only from root process when distributed Dec 18, 2025
@bernstei bernstei force-pushed the write_files_on_root_only branch from 52d9410 to 47ea1a7 Compare March 6, 2026 17:08
Noam Bernstein added 2 commits March 6, 2026 12:08
In particular, fixes issue where distributed mace_run_train processes
overwrite each others' pretrained and combined data files
…ce that other tasks see the compeleted write
@bernstei bernstei changed the base branch from main to develop March 6, 2026 17:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

mace_run_train distributed processes overwrite each others' pretrained data files

1 participant