Commit 8477d3d
Enable fused RMSNorm dLN + add through CUDNN (NVIDIA#2778)
* add cudnn dln+add
Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>
* try fixing cudnn build issue
Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>
* guard against cudnn version
Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* change itype to wtype for add in rmsnorm_bwd
Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>
* remove dead code
Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>
* remove dangling todo
Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>
---------
Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>1 parent d2625e5 commit 8477d3d
3 files changed
Lines changed: 38 additions & 13 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
395 | 395 | | |
396 | 396 | | |
397 | 397 | | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
398 | 415 | | |
399 | 416 | | |
400 | 417 | | |
| |||
467 | 484 | | |
468 | 485 | | |
469 | 486 | | |
470 | | - | |
471 | | - | |
472 | | - | |
473 | 487 | | |
474 | 488 | | |
475 | 489 | | |
476 | 490 | | |
| 491 | + | |
| 492 | + | |
| 493 | + | |
| 494 | + | |
| 495 | + | |
| 496 | + | |
477 | 497 | | |
478 | 498 | | |
479 | 499 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
294 | 294 | | |
295 | 295 | | |
296 | 296 | | |
297 | | - | |
| 297 | + | |
298 | 298 | | |
299 | 299 | | |
300 | 300 | | |
| |||
Lines changed: 14 additions & 9 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
206 | 206 | | |
207 | 207 | | |
208 | 208 | | |
209 | | - | |
210 | | - | |
211 | | - | |
212 | | - | |
213 | | - | |
214 | | - | |
215 | | - | |
216 | | - | |
217 | | - | |
| 209 | + | |
| 210 | + | |
218 | 211 | | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
219 | 224 | | |
220 | 225 | | |
221 | 226 | | |
| |||
0 commit comments