[Feature]: update the TRT-llm bench hard code version to match the TRT-llm version.

### 🚀 The feature, motivation and pitch

https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/bench/benchmark/utils/general.py#L172

is hardcoded as 1.2 so TRT-llm bench says version 1.2 no matter what.
`TensorRT LLM Version:   1.2
Dtype:                  bfloat16
KV Cache Dtype:         FP8
Quantization:           NVFP4`

### Alternatives

match the TRT-llm version 

### Additional context

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and checked the [documentation](https://nvidia.github.io/TensorRT-LLM/) and [examples](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples) for answers to frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: update the TRT-llm bench hard code version to match the TRT-llm version. #11560

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature]: update the TRT-llm bench hard code version to match the TRT-llm version. #11560

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions