(base) intel@intel-AXMB-D150-3:~/project/qwen35/Qwen3.5-35B-A3B/INT4$ docker run --user $(id -u):$(id -g)
--device /dev/dri/renderD128:/dev/dri/renderD128
--group-add=$(stat -c "%g" /dev/dri/renderD128)
--rm -p 8001:8001
-v $(pwd):$(pwd):rw
openvino/model_server:latest-gpu
--model_path /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
--model_name qwen35-35b
--target_device GPU
--rest_port 8001
[2026-04-24 06:24:18.265][1][serving][info][server.cpp:88] OpenVINO Model Server 2026.1.0.72cc06244
[2026-04-24 06:24:18.265][1][serving][info][server.cpp:89] OpenVINO backend 2026.1.0-21367-63e31528c62-releases/2026/1
[2026-04-24 06:24:18.265][1][serving][info][server.cpp:92] OpenVINO GenAI backend 2026.1.0.0-2957-1dabb8c2255
[2026-04-24 06:24:18.265][1][serving][info][pythoninterpretermodule.cpp:37] PythonInterpreterModule starting
[2026-04-24 06:24:18.265][1][serving][info][pythoninterpretermodule.cpp:40] Initializing python interpreter
[2026-04-24 06:24:18.321][1][serving][info][pythoninterpretermodule.cpp:59] PythonInterpreterModule started
[2026-04-24 06:24:18.359][1][modelmanager][info][modelmanager.cpp:156] Available devices for Open VINO: CPU, GPU
[2026-04-24 06:24:18.359][1][serving][info][capimodule.cpp:40] C-APIModule starting
[2026-04-24 06:24:18.359][1][serving][info][capimodule.cpp:42] C-APIModule started
[2026-04-24 06:24:18.359][1][serving][info][grpcservermodule.cpp:110] GRPCServerModule starting
[2026-04-24 06:24:18.359][1][serving][info][grpcservermodule.cpp:114] GRPCServerModule started
[2026-04-24 06:24:18.359][1][serving][info][grpcservermodule.cpp:115] Port was not set. GRPC server will not be started.
[2026-04-24 06:24:18.359][1][serving][info][httpservermodule.cpp:35] HTTPServerModule starting
[2026-04-24 06:24:18.359][1][serving][info][httpservermodule.cpp:39] Will start 16 REST workers
[2026-04-24 06:24:18.360][50][serving][info][drogon_http_server.cpp:155] Binding REST server to address: 0.0.0.0:8001
[2026-04-24 06:24:18.410][1][serving][info][drogon_http_server.cpp:182] REST server listening on port 8001 with 16 unary threads and 16 streaming threads
[2026-04-24 06:24:18.410][1][serving][info][http_server.cpp:248] API key not provided via --api_key_file or API_KEY environment variable. Authentication will be disabled.
[2026-04-24 06:24:18.410][1][serving][info][httpservermodule.cpp:52] HTTPServerModule started
[2026-04-24 06:24:18.410][1][serving][info][httpservermodule.cpp:53] Started REST server at 0.0.0.0:8001
[2026-04-24 06:24:18.410][1][serving][info][servablemanagermodule.cpp:51] ServableManagerModule starting
[2026-04-24 06:24:18.410][1][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:18.410][1][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:18.410][82][modelmanager][info][modelmanager.cpp:1224] Started model manager thread
[2026-04-24 06:24:18.410][1][serving][info][servablemanagermodule.cpp:55] ServableManagerModule started
[2026-04-24 06:24:18.410][83][modelmanager][info][modelmanager.cpp:1243] Started cleaner thread
[2026-04-24 06:24:19.411][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:19.411][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:20.411][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:20.411][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:21.411][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:21.411][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:22.412][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:22.412][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:23.412][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:23.412][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:24.412][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:24.412][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:25.412][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:25.412][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:26.413][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:26.413][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
^C[2026-04-24 06:24:26.817][1][serving][info][grpcservermodule.cpp:201] GRPCServerModule shutting down
[2026-04-24 06:24:26.817][1][serving][info][grpcservermodule.cpp:211] GRPCServerModule shutdown
(base) intel@intel-AXMB-D150-3:~/project/qwen35/Qwen3.5-35B-A3B/INT4$ docker run --user$(id -u):$ (id -g) $(pwd):$ (pwd):rw
--device /dev/dri/renderD128:/dev/dri/renderD128
--group-add=$(stat -c "%g" /dev/dri/renderD128)
--rm -p 8001:8001
-v
openvino/model_server:latest-gpu
--model_path /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
--model_name qwen35-35b
--target_device GPU
--rest_port 8001
[2026-04-24 06:24:18.265][1][serving][info][server.cpp:88] OpenVINO Model Server 2026.1.0.72cc06244
[2026-04-24 06:24:18.265][1][serving][info][server.cpp:89] OpenVINO backend 2026.1.0-21367-63e31528c62-releases/2026/1
[2026-04-24 06:24:18.265][1][serving][info][server.cpp:92] OpenVINO GenAI backend 2026.1.0.0-2957-1dabb8c2255
[2026-04-24 06:24:18.265][1][serving][info][pythoninterpretermodule.cpp:37] PythonInterpreterModule starting
[2026-04-24 06:24:18.265][1][serving][info][pythoninterpretermodule.cpp:40] Initializing python interpreter
[2026-04-24 06:24:18.321][1][serving][info][pythoninterpretermodule.cpp:59] PythonInterpreterModule started
[2026-04-24 06:24:18.359][1][modelmanager][info][modelmanager.cpp:156] Available devices for Open VINO: CPU, GPU
[2026-04-24 06:24:18.359][1][serving][info][capimodule.cpp:40] C-APIModule starting
[2026-04-24 06:24:18.359][1][serving][info][capimodule.cpp:42] C-APIModule started
[2026-04-24 06:24:18.359][1][serving][info][grpcservermodule.cpp:110] GRPCServerModule starting
[2026-04-24 06:24:18.359][1][serving][info][grpcservermodule.cpp:114] GRPCServerModule started
[2026-04-24 06:24:18.359][1][serving][info][grpcservermodule.cpp:115] Port was not set. GRPC server will not be started.
[2026-04-24 06:24:18.359][1][serving][info][httpservermodule.cpp:35] HTTPServerModule starting
[2026-04-24 06:24:18.359][1][serving][info][httpservermodule.cpp:39] Will start 16 REST workers
[2026-04-24 06:24:18.360][50][serving][info][drogon_http_server.cpp:155] Binding REST server to address: 0.0.0.0:8001
[2026-04-24 06:24:18.410][1][serving][info][drogon_http_server.cpp:182] REST server listening on port 8001 with 16 unary threads and 16 streaming threads
[2026-04-24 06:24:18.410][1][serving][info][http_server.cpp:248] API key not provided via --api_key_file or API_KEY environment variable. Authentication will be disabled.
[2026-04-24 06:24:18.410][1][serving][info][httpservermodule.cpp:52] HTTPServerModule started
[2026-04-24 06:24:18.410][1][serving][info][httpservermodule.cpp:53] Started REST server at 0.0.0.0:8001
[2026-04-24 06:24:18.410][1][serving][info][servablemanagermodule.cpp:51] ServableManagerModule starting
[2026-04-24 06:24:18.410][1][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:18.410][1][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:18.410][82][modelmanager][info][modelmanager.cpp:1224] Started model manager thread
[2026-04-24 06:24:18.410][1][serving][info][servablemanagermodule.cpp:55] ServableManagerModule started
[2026-04-24 06:24:18.410][83][modelmanager][info][modelmanager.cpp:1243] Started cleaner thread
[2026-04-24 06:24:19.411][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:19.411][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:20.411][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:20.411][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:21.411][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:21.411][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:22.412][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:22.412][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:23.412][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:23.412][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:24.412][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:24.412][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:25.412][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:25.412][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
[2026-04-24 06:24:26.413][82][modelmanager][warning][modelmanager.cpp:1488] Expected version directory name to be in number format. Got: model_cache
[2026-04-24 06:24:26.413][82][modelmanager][warning][modelmanager.cpp:1495] No version found for model in path: /home/intel/project/qwen35/Qwen3.5-35B-A3B/INT4
^C[2026-04-24 06:24:26.817][1][serving][info][grpcservermodule.cpp:201] GRPCServerModule shutting down
[2026-04-24 06:24:26.817][1][serving][info][grpcservermodule.cpp:211] GRPCServerModule shutdown