Minimal runner wrapper for: https://github.com/Petlja/PLCT-AI-Data-Unifier
This repository is intentionally small. It uses the package above as a dependency and only provides a local config and one batch entry point for Windows users.
- Reads repositories from plct-ai-data-unifier-config.yaml.
- Uses uv to install and lock dependencies from pyproject.toml.
- Runs the PLCT AI Data Unifier command flow.
- Keeps generated output on disk after the run is finished.
Edit plct-ai-data-unifier-config.yaml and keep your repository list under:
repos:
- url: https://github.com/Petlja/example-repoYou can include as many repos as needed.
- Install uv.
- Edit plct-ai-data-unifier-config.yaml.
- Run run-au.bat.
Default run is full bootstrap and produces reusable output folders.
run-au.bat [config-file] [mode] [base-dir] [output-dir] [jobs]
Supported modes:
- bootstrap
- git-sync
- prepare-dataset
Examples:
-
Full run (default values) run-au.bat
-
Full run with explicit config and serial conversion run-au.bat plct-ai-data-unifier-config.yaml bootstrap repos dataset 1
-
Only sync repositories run-au.bat plct-ai-data-unifier-config.yaml git-sync repos
-
Only build dataset from already synced repositories run-au.bat plct-ai-data-unifier-config.yaml prepare-dataset repos dataset 4
Equivalent direct uv commands (same as package README, without batch wrapper):
uv sync
uv run plct-ai-data-unifier bootstrap --config plct-ai-data-unifier-config.yaml --base-dir repos --output-dir dataset --jobs 1
# or step by step
uv run plct-ai-data-unifier git-sync --config plct-ai-data-unifier-config.yaml --base-dir repos
uv run plct-ai-data-unifier prepare-dataset --base-dir repos --output-dir dataset --jobs 1After a successful run, folders are kept so users can continue to work with the result without rerunning everything:
- repos
- dataset
No cleanup is performed by run-au.bat.