hongxiayang

Follow

Hongxia Yang hongxiayang

Follow

AI/ML Infra/Framework Software Engineer AMD

28 followers · 6 following

fb

Achievements

Achievements

Pinned Loading

flash-attention flash-attention Public

Forked from ROCm/flash-attention

Fast and memory-efficient exact attention

Python
tutorials tutorials Public

Forked from pytorch/tutorials

PyTorch tutorials.

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
pytorch/pytorch pytorch/pytorch Public

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 99.3k 27.5k
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 77.5k 15.9k
recipes recipes Public

Forked from vllm-project/recipes

Common recipes to run vLLM

Jupyter Notebook