Can I use ByteTransformer to train TransFormer models on GPUs , currently supports which models ?
Can I use ByteTransformer to train TransFormer models on GPUs , currently supports which models ?