update llama & grok config #472

JohnQinAMD · 2026-01-06T06:28:25Z

update llama and grok training config

Copilot

Pull request overview

This PR updates training configuration parameters for LLaMA 3.1 405B and Grok1 models on MI355X hardware. The changes adjust parallelism and batch size settings that affect training behavior and resource utilization.

Reduces tensor parallelism degree for LLaMA 3.1 405B from 8 to 1
Increases global batch size for Grok1 (both BF16 and FP8 variants) from 128 to 512

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
examples/torchtitan/configs/MI355X/llama3.1_405B-pretrain.yaml	Reduces tensor_parallel_degree from 8 to 1, aligning with other large model configs
examples/megatron/configs/MI355X/grok1-FP8-pretrain.yaml	Increases global_batch_size from 128 to 512, quadrupling gradient accumulation steps
examples/megatron/configs/MI355X/grok1-BF16-pretrain.yaml	Increases global_batch_size from 128 to 512, quadrupling gradient accumulation steps

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/megatron/configs/MI355X/grok1-FP8-pretrain.yaml

examples/megatron/configs/MI355X/grok1-BF16-pretrain.yaml

update llama & grok config

5b7950c

Copilot AI review requested due to automatic review settings January 6, 2026 06:28

JohnQinAMD requested review from Xiaoming-AMD, limou102 and wenxie-amd as code owners January 6, 2026 06:28

Copilot started reviewing on behalf of JohnQinAMD January 6, 2026 06:29 View session

Xiaoming-AMD approved these changes Jan 6, 2026

View reviewed changes

Xiaoming-AMD merged commit 639b793 into main Jan 6, 2026
9 checks passed

Copilot AI reviewed Jan 6, 2026

View reviewed changes

examples/megatron/configs/MI355X/grok1-FP8-pretrain.yaml Show resolved Hide resolved

examples/megatron/configs/MI355X/grok1-BF16-pretrain.yaml Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

update llama & grok config #472

update llama & grok config #472

JohnQinAMD commented Jan 6, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

update llama & grok config #472

update llama & grok config #472

Conversation

JohnQinAMD commented Jan 6, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants