feat/fix(kernel): integrate multi kernel and fix some bugs#19
Open
flashzxi wants to merge 1 commit into
Open
Conversation
* [算子添加-clip] 支持clip(torch.slamp), 不支持min max广播(infiniteCore限制) * [算子添加-conv] 支持Conv (torch.conv1d, torch.conv2d, torch.conv3d) * [算子添加-LayerNorm] 支持LayerNorm 输入x的rank至少为3(infiniCore限制) * [算子添加-LogSoftmax] 支持 LogSoftmax, 不支持dim参数 * 大量原生bug修复 * [算子添加-Softmax] 支持 Softmax, 仅gpu * [算子添加-LpNorm] 支持 LpNorm, 仅gpu * [算子添加-RmsNorm] 支持 RmsNorm, cpu and gpu * [算子添加-Unary] 支持 relu、sigmoid、silu、gelu、softplus、tanh * cpu gpu 测试通过 * format all
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
算子添加
bug修复
发现多个影响测试的bug,已经修复,细节见Bugs.md
另外InfiniCore的问题没有修复
比如:
logsoftmax kernel申请了太多线程导致无法启动。故类似测试中没有比较计算结果是否正确
cuda 测试通过截图
test
test-front