adjust building RH image #3883

dtrawins · 2025-12-22T13:55:31Z

🛠 Summary

#3844

Fix convert_tokenizer configuration
use only non-restricted test models
fixed version of optimum-intel

🧪 Checklist

Unit tests added.
The documentation updated.
Change follows security best practices.
``

Fix convert_tokenizer configuration use only non-restricted test models fixed version of optimum-intel

ci/build_test_OnCommit.groovy

demos/python_demos/Dockerfile.redhat

Copilot

Pull request overview

This PR updates the build and test configuration for RedHat images to fix tokenizer handling and switch to non-restricted test models. The changes address issue #3844 by fixing the convert_tokenizer configuration, updating to a specific version of optimum-intel, and replacing restricted HuggingFace models with publicly accessible alternatives.

Key changes:

Switched from facebook/opt-125m to HuggingFaceTB/SmolLM2-360M-Instruct as the primary LLM test model
Changed meta-llama/Llama-3.1-8B-Instruct to unsloth/Llama-3.1-8B-Instruct for tokenizer testing
Fixed openvino_tokenizers installation process in RedHat Dockerfile to build from source and install Python bindings correctly

Reviewed changes

Copilot reviewed 14 out of 15 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
windows_prepare_llm_models.bat	Added new LLM_MODEL variable and updated LLAMA3_MODEL to use non-restricted model sources
src/test/pull_gguf_hf_model_test.cpp	Increased timeout and removed modelscope.cn test case
src/test/llm/output_parsers/llama3_output_parser_test.cpp	Updated tokenizer paths to use unsloth Llama model instead of meta-llama
src/test/llm/lm_legacy_regular.pbtxt	Changed models_path to use new SmolLM2 model
src/test/llm/lm_cb_regular.pbtxt	Changed models_path to use new SmolLM2 model
src/test/llm/llmnode_test.cpp	Updated test model paths and adjusted token limits for new model's max length (8192)
run_unit_tests.sh	Added proxy configuration support for bazel tests
prepare_llm_models.sh	Updated model variables, fixed optimum-intel version, and added facebook/opt-125m as separate download
docs/deploying_server_baremetal.md	Updated release version references from v2025.4 to v2025.4.1
demos/python_demos/requirements.txt	Pinned optimum-intel to specific commit and updated transformers version constraint
demos/common/export_models/requirements.txt	Updated optimum-intel commit, openvino versions to 2025.4.1
create_package.sh	Fixed convert_tokenizer script generation and updated version metadata
ci/build_test_OnCommit.groovy	Enabled tests for RedHat release image build (RUN_TESTS=1)
Dockerfile.redhat	Refactored openvino_tokenizers installation to always build from source with proper Python bindings

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/test/llm/llmnode_test.cpp

Copilot · 2026-01-07T09:50:51Z

src/test/llm/llmnode_test.cpp

+    // creating prompt that will be tokenized to 8194 tokens when model max length is 8192.
+    for (int i = 0; i < 8192 - 29; i++) {


The comment states '8194 tokens' but the loop creates 8163 tokens (8192 - 29). Additionally, the comment indicates this should exceed the max length of 8192, which would require 8163 + chat template tokens. Clarify whether the total including chat template tokens equals 8194.

mzegla · 2026-01-07T10:11:44Z

This PR will likely need the same adjustments that were done for: #3890

ci/build_test_OnCommit.groovy

src/test/pull_gguf_hf_model_test.cpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

adjust building RH image

d219587

dtrawins requested review from atobiszei and dkalinowski December 22, 2025 14:00

hadolint

1a4c514

RH-steve-grubb added a commit to RH-steve-grubb/openvino_model_server that referenced this pull request Dec 22, 2025

Add upstream PR openvinotoolkit#3883

e37e47f

Fix convert_tokenizer configuration use only non-restricted test models fixed version of optimum-intel

dtrawins added 6 commits December 23, 2025 17:11

adjust test models

09d2d29

longer timeout

8e81ed3

tune expected reponses for new test model

be616c7

tune test to updated model

49e9d90

pass proxy to bazel

ec4a18e

corrections in python packages

4d14591

atobiszei reviewed Jan 5, 2026

View reviewed changes

ci/build_test_OnCommit.groovy Outdated Show resolved Hide resolved

atobiszei approved these changes Jan 5, 2026

View reviewed changes

dtrawins commented Jan 5, 2026

View reviewed changes

demos/python_demos/Dockerfile.redhat Outdated Show resolved Hide resolved

Update demos/python_demos/Dockerfile.redhat

1b077da

mzegla requested a review from Copilot January 7, 2026 09:49

Copilot AI reviewed Jan 7, 2026

View reviewed changes

dtrawins requested a review from mzegla January 7, 2026 12:45

dtrawins commented Jan 7, 2026

View reviewed changes

ci/build_test_OnCommit.groovy Outdated Show resolved Hide resolved

dtrawins commented Jan 7, 2026

View reviewed changes

src/test/pull_gguf_hf_model_test.cpp Outdated Show resolved Hide resolved

Apply suggestions from code review

843ca68

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

dkalinowski approved these changes Jan 7, 2026

View reviewed changes

dtrawins merged commit 83bc7bb into releases/2025/4 Jan 7, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adjust building RH image #3883

adjust building RH image #3883

Uh oh!

dtrawins commented Dec 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 7, 2026

Uh oh!

mzegla commented Jan 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		// creating prompt that will be tokenized to 8194 tokens when model max length is 8192.
		for (int i = 0; i < 8192 - 29; i++) {

adjust building RH image #3883

adjust building RH image #3883

Uh oh!

Conversation

dtrawins commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🛠 Summary

🧪 Checklist

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

mzegla commented Jan 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dtrawins commented Dec 22, 2025 •

edited

Loading