Add DCP compatibility for FSDP2-TP sharding in TransformerEngine. #15448
Annotations
3 errors and 2 notices
|
Core
Process completed with exit code 1.
|
|
All
The hosted runner lost communication with the server. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
|
PyTorch
The hosted runner lost communication with the server. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
|
sccache stats
100% - 152 hits, 0 misses, 0 errors
|
|
sccache stats
100% - 197 hits, 0 misses, 0 errors
|