deep-learning-optimization

Here are 2 public repositories matching this topic...

mslawsky / learning-rate-optimization

This repository documents the process of finding the optimal learning rate for deep neural networks

time-series-forecasting ml-engineering training-dynamics neural-network-training tensorflow-optimization learning-rate-optimization deep-learning-optimization

Updated Jun 3, 2025

pauliano22 / triton-gpu-kernels

Star

High-performance Triton kernels for NVIDIA H100. Implements fused FP8 LayerNorm, tiled FlashAttention, and SRAM-optimized memory primitives for Hopper architecture.

parallel-computing cuda triton gpu-kernels fp8 h100 deep-learning-optimization llm-infrastructure

Updated Apr 3, 2026
Python

Improve this page

Add a description, image, and links to the deep-learning-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deep-learning-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly