-
Notifications
You must be signed in to change notification settings - Fork 57
Open
Labels
questionFurther information is requestedFurther information is requested
Description
Hello @LeiWang1999
I am trying to use the BitNet modeling in an other project to use bitblas kernels, when I load the model, and try to replace linear layers, with BitBlas Linear layers, the _get_or_create_bitblas_operator function takes a lot of time to execute and compile kernels based on the weight shape, for a model with 32 layers, with a hidden size of 4096 and intermediate size of 14336 it takes ~8 min. Is this an intended behaviour ? Thank you for your help
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested