Skip to content

[Feature]: Support MXFP8/FP8 Model for ARK #1444

@luoyu-intel

Description

@luoyu-intel

Feature Description

ARK already supports MXFP8/FP8 kernels, but it has not been added to AR's backend

Motivation and Use Case

Support more quantization dtypes for inference

Alternatives Considered

No response

Definition of Done

No response

Additional Context

No response

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions