Skip to content

[AutoDeploy]: Perf sweep SuperV3 nvfp4 with large workloads and identify perf gaps #11642

@galagam

Description

@galagam

🚀 The feature, motivation and pitch

Perf sweep with a large POR P0 workloads.

PRs that might be relevant:
#11515
#11624

Done criteria:

  • Pareto curve comparing manual + autodeploy perf
  • nsys traces
  • Trace comparison report (AI generated, human validated)

See also: #11529

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.

Metadata

Metadata

Labels

AutoDeploy<NV> AutoDeploy Backendfeature requestNew feature or request. This includes new model, dtype, functionality support

Type

No type

Projects

Status

Ready

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions