feat: Add low-VRAM training framework for 4GB GPUs #136
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Low-VRAM Training Framework for GO-1
Description
This PR adds a memory-efficient training framework that enables GO-1 model fine-tuning on consumer GPUs with as little as 4GB VRAM (tested on NVIDIA GTX 970).
Problem
The current training setup requires ~70GB VRAM, making it inaccessible to most researchers and developers. This limits community contributions and experimentation.
Solution
A modular low-VRAM training framework built with Clean Architecture and SOLID principles:
Key Features
Architecture
Files Changed
go1/tools/low_vram/tests/low_vram/Testing
All unit tests pass:
python -m unittest tests.low_vram.test_components -v # 18 tests passedTested on:
Usage Example
Breaking Changes
None - this is a new module that doesn't modify existing code.
Checklist