[Common, PyTorch] Improve mHC to match DeepSeek's implementation#2953
Draft
kainzhong wants to merge 10 commits intoNVIDIA:mainfrom
Draft
[Common, PyTorch] Improve mHC to match DeepSeek's implementation#2953kainzhong wants to merge 10 commits intoNVIDIA:mainfrom
kainzhong wants to merge 10 commits intoNVIDIA:mainfrom