diff --git a/skills/deepstream/deepstream-dev/.claude-plugin/plugin.json b/skills/deepstream/deepstream-dev/.claude-plugin/plugin.json
deleted file mode 100644
index c20b6595..00000000
--- a/skills/deepstream/deepstream-dev/.claude-plugin/plugin.json
+++ /dev/null
@@ -1,6 +0,0 @@
-{
-  "name": "deepstream-dev",
-  "description": "NVIDIA DeepStream SDK 9.0 development with Python pyservicemaker API. Use when building video analytics pipelines, GStreamer-based video processing, TensorRT inference integration, object detection/tracking, or Kafka/message broker integration.",
-  "author": "NVIDIA CORPORATION",
-  "skills": "./"
-}
diff --git a/skills/deepstream/deepstream-dev/BENCHMARK.md b/skills/deepstream/deepstream-dev/BENCHMARK.md
deleted file mode 100644
index 2627a581..00000000
--- a/skills/deepstream/deepstream-dev/BENCHMARK.md
+++ /dev/null
@@ -1,111 +0,0 @@
-# Evaluation Report
-
-Evaluation of the `deepstream-dev` skill before publication through NVSkills-Eval.
-
-This benchmark summarizes 3-Tier Evaluation from NVSkills-Eval results for the skill. The goal is to document whether the skill is safe, discoverable, effective, and useful for agents before it is published for broader workflow use.
-
-## Evaluation Summary
-
-- Skill: `deepstream-dev`
-- Evaluation date: 2026-05-28
-- NVSkills-Eval profile: `external`
-- Environment: `local`
-- Dataset: 7 evaluation tasks
-- Attempts per task: 2
-- Pass threshold: 50%
-- Overall verdict: FAIL
-
-## Agents Used
-
-- `claude-code`
-- `codex`
-
-## Metrics Used
-
-Reported benchmark dimensions:
-
-- Security: checks whether skill-assisted execution avoids unsafe behavior such as secret leakage, destructive commands, or unauthorized access.
-- Correctness: checks whether the agent follows the expected workflow and produces the correct final output.
-- Discoverability: checks whether the agent loads the skill when relevant and avoids using it when irrelevant.
-- Effectiveness: checks whether the agent performs measurably better with the skill than without it.
-- Efficiency: checks whether the agent uses fewer tokens and avoids redundant work.
-
-Underlying evaluation signals used in this run:
-
-- `skill_execution` (Skill Execution): verifies that the agent loaded the expected skill and workflow.
-- `skill_efficiency` (Efficiency): checks routing quality, decoy avoidance, and redundant tool usage.
-- `accuracy` (Accuracy): grades final-answer correctness against the reference answer.
-- `goal_accuracy` (Goal Accuracy): checks whether the overall user task completed successfully.
-- `behavior_check` (Behavior Check): verifies expected behavior steps, including safety expectations.
-- `token_efficiency` (Token Efficiency): compares token usage with and without the skill.
-
-## Test Tasks
-
-The benchmark dataset contained 7 evaluation tasks:
-
-- Positive tasks: 5 tasks where the skill was expected to activate.
-- Negative tasks: 2 tasks where no skill was expected.
-- Unlabeled tasks: 0 tasks where positive/negative intent could not be inferred.
-
-Task composition is derived from the evaluation dataset when possible. Entries with `expected_skill` set are treated as positive skill-activation cases, while entries with `expected_skill: null` are treated as negative activation cases.
-
-## Results
-
-| Dimension | Num | `claude-code` | `codex` |
-|---|---:|---:|---:|
-| Security | 8 | 74% (+9%) | 57% (-2%) |
-| Correctness | 8 | 94% (+6%) | 88% (+9%) |
-| Discoverability | 8 | 86% (+11%) | 76% (+9%) |
-| Effectiveness | 8 | 81% (+6%) | 78% (+9%) |
-| Efficiency | 8 | 72% (+12%) | 64% (+9%) |
-
-Score values show skill-assisted performance. Values in parentheses show uplift versus the no-skill baseline when baseline data is available.
-
-## Tier 1: Static Validation Summary
-
-Tier 1 validation passed with observations. NVSkills-Eval ran 9 checks and found 34 total findings.
-
-Top findings:
-
-- MEDIUM PII/gps_coordinates: GPS coordinates (location information) (`references/service_maker_api.md:804`)
-- MEDIUM PII/gps_coordinates: GPS coordinates (location information) (`references/service_maker_api.md:827`)
-- MEDIUM PII/gps_coordinates: GPS coordinates (location information) (`references/service_maker_api.md:829`)
-- MEDIUM PII/gps_coordinates: GPS coordinates (location information) (`references/service_maker_api.md:1279`)
-- MEDIUM PII/gps_coordinates: GPS coordinates (location information) (`references/use_cases_pipelines.md:842`)
-
-## Tier 2: Deduplication Summary
-
-Tier 2 validation reported findings. NVSkills-Eval ran 2 checks and found 34 total findings.
-
-Top findings:
-
-- HIGH DUPLICATE/duplicate: Duplicate content found within references/metamux_config.md:
-  "# default pts-tolerance is 60 ms." in references/metamux_config.md (lines 67-72)
-  vs "# default pts-tolerance is 60 ms." in references/metamux_config.md (lines 125-130) (`references/metamux_config.md:67`)
-- HIGH DUPLICATE/duplicate: Duplicate content found across references/buffer_apis.md and references/kafka_messaging.md and references/service_maker_api.md and references/use_cases_pipelines.md and references/utilities_config.md:
-  "### Pattern 3: Selective Frame Capture" in references/buffer_apis.md (lines 1198-1199)
-  vs "### Pattern 5: Frame Analysis and Logging" in references/buffer_apis.md (lines 1339-1340)
-  vs "#### Example 2: Pipeline with Both Kafka and Display (Using Tee)" in references/kafka_messaging.md (lines 167-168)
-  vs "#### Custom Kafka Producer Probe" in references/kafka_messaging.md (lines 581-582)
-  vs "# Enable tensor output in nvinfer" in references/service_maker_api.md (lines 1329-1333)
-  vs "#### Approach 3: Custom Postprocessing with Tensor Metadata" in references/use_cases_pipelines.md (lines 837-841)
-  vs "### Pattern 3: Custom Postprocessing" in references/utilities_config.md (lines 1275-1279) (`references/buffer_apis.md:1198`)
-- HIGH DUPLICATE/duplicate: Duplicate content found across references/buffer_apis.md and references/kafka_messaging.md and references/use_cases_pipelines.md and references/utilities_config.md:
-  "# from multiprocessing import Queue  # Use this for MULTIPROCESSING!" in references/buffer_apis.md (lines 1059-1063)
-  vs "### Pattern 3: Selective Frame Capture" in references/buffer_apis.md (lines 1195-1197)
-  vs "### Pattern 5: Frame Analysis and Logging" in references/buffer_apis.md (lines 1336-1338)
-  vs "#### Example 2: Pipeline with Both Kafka and Display (Using Tee)" in references/kafka_messaging.md (lines 162-166)
-  vs "#### Custom Kafka Producer Probe" in references/kafka_messaging.md (lines 576-580)
-  vs "#### Approach 3: Custom Postprocessing with Tensor Metadata" in references/use_cases_pipelines.md (lines 832-836)
-  vs "### Pattern 3: Custom Postprocessing" in references/utilities_config.md (lines 1272-1274) (`references/buffer_apis.md:1059`)
-- HIGH DUPLICATE/duplicate: Duplicate content found within references/utilities_config.md:
-  "### Pattern 1: Load and Use Source Configuration" in references/utilities_config.md (lines 1107-1109)
-  vs "### Pattern 1: Load and Use Source Configuration" in references/utilities_config.md (lines 1127-1128)
-  vs "### Pattern 1: Load and Use Source Configuration" in references/utilities_config.md (lines 1142-1143) (`references/utilities_config.md:1107`)
-- HIGH DUPLICATE/duplicate: Duplicate content found within references/metamux_config.md:
-  "# mux all source if don't set it." in references/metamux_config.md (lines 74-78)
-  vs "# mux all source if don't set it." in references/metamux_config.md (lines 132-136) (`references/metamux_config.md:74`)
-
-## Publication Recommendation
-
-The skill should be reviewed before NVSkills-Eval publication. Skill owners should address the findings above and rerun NVSkills-Eval to refresh this benchmark.
diff --git a/skills/deepstream/deepstream-dev/SKILL.md b/skills/deepstream/deepstream-dev/SKILL.md
deleted file mode 100644
index 033844a1..00000000
--- a/skills/deepstream/deepstream-dev/SKILL.md
+++ /dev/null
@@ -1,180 +0,0 @@
----
-name: deepstream-dev
-description: NVIDIA DeepStream SDK 9.0 development with Python pyservicemaker API. Use when building video analytics pipelines, GStreamer-based video processing, TensorRT inference integration, object detection/tracking, or Kafka/message broker integration.
-owner: NVIDIA CORPORATION
-service: deepstream
-version: 1.1.0
-reviewed: 2026-04-24
-license: CC-BY-4.0 AND Apache-2.0
----
-
-# DeepStream Development Skill
-
-When this skill is active, **ALWAYS read the relevant reference documents** before generating code. Do NOT rely on memory - the reference documents contain critical details about exact property names, correct API usage, and common pitfalls.
-
-## SDK and Architecture Quick Reference
-
-### DeepStream SDK 9.0 Version Requirements
-
-- **GStreamer**: 1.24.2
-- **NVIDIA Driver**: 590+
-- **CUDA**: 13.1
-- **TensorRT**: 10.14.1.48
-- **Platforms**: Ubuntu 24.04 (x86_64 and ARM64/Jetson)
-
-### Typical Pipeline Flow
-
-```
-Source → Stream Muxer → Inference → [Tracker] → OSD → Renderer
-```
-Components in `[brackets]` are **optional** -- only add them when the user explicitly requests them.
-
-| Stage | Role | Key Element(s) | Required? |
-|-------|------|-----------------|-----------|
-| Source | Input from files, RTSP, cameras | `nvurisrcbin` (preferred), `nvmultiurisrcbin`, `filesrc` | Yes |
-| Stream Muxer | Batches streams for inference | `nvstreammux` | Yes |
-| Inference | TensorRT model execution | `nvinfer`, `nvinferserver` | Yes |
-| Tracker | Multi-object tracking across frames | `nvtracker` | **Only if requested** |
-| OSD | Draws bounding boxes, labels, overlays | `nvosdbin` | Yes (for visualization) |
-| Renderer | Display or save output | `nveglglessink`, `nv3dsink`, `filesink` | Yes |
-
-### Memory Model
-
-DeepStream uses NVIDIA Video Memory Manager (NVMM) for zero-copy GPU buffer transfers. Caps strings use `memory:NVMM` to indicate GPU memory (e.g., `video/x-raw(memory:NVMM), format=NV12`).
-
-## Critical Rules
-
-1. **Only Add Requested Components**: Do NOT add pipeline elements the user did not ask for.
-   - **Tracker (`nvtracker`)**: Only add when the user explicitly requests tracking or object IDs across frames
-   - **Secondary GIEs**: Only add when the user requests classification or attribute extraction
-   - **Analytics (`nvdsanalytics`)**: Only add when the user requests line crossing, ROI counting, etc.
-   - **Message broker (`nvmsgbroker`/`nvmsgconv`)**: Only add when the user requests Kafka/cloud messaging
-   - When in doubt, build the **minimal working pipeline** and let the user ask for additions
-
-2. **Default to `nvurisrcbin` for Sources**: When the user says "camera", "stream", "video", or provides a file path:
-   - Always use `nvurisrcbin` -- it handles RTSP, HTTP, and local files (`file://`) transparently
-   - Only use `filesrc` + `qtdemux` + parser when the user explicitly needs raw file source control
-   - For RTSP/live sources, also set `live-source=1` on `nvstreammux` and `sync=0` on the sink
-   - Convert local paths to URI: `"file://" + os.path.abspath(path)`
-
-3. **Metadata Iteration**: Use `.frame_items` and `.object_items` (returns iterators, NOT lists)
-   - NEVER use `len()` on these - iterate to count
-   - Iterator can only be consumed once
-
-4. **Request Pad Syntax**: Use `"sink_%u"` template, NEVER literal pad names
-   ```python
-   pipeline.link(("decoder", "mux"), ("", "sink_%u"))  # CORRECT
-   # pipeline.link(("decoder", "mux"), ("", "sink_0"))  # WRONG - will fail
-   ```
-
-5. **Platform Detection for Sinks**:
-   ```python
-   import platform
-   sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-   ```
-
-6. **Buffer Cloning**: Always clone buffers for async processing
-   ```python
-   tensor = buffer.extract(0).clone()  # CRITICAL
-   ```
-
-7. **Queue Types**:
-   - `queue.Queue` → Use with `threading.Thread`
-   - `multiprocessing.Queue` → Use with `multiprocessing.Process`
-   - Using wrong type causes silent data loss!
-
-8. **nvinfer Config Format**:
-   - YAML: Use `property:` section (NOT `model:`), `key: value` with space after colon
-   - INI: Use `[property]` section, `key=value` with equals sign
-   - Section MUST be named `property`
-
-9. **nvmsgbroker is a SINK**: Cannot have downstream elements - use `tee` to split pipeline
-
-10. **ALL Sinks Need async=0 for Tee Splits or Dynamic Sources**: CRITICAL for state transitions
-    ```python
-    # When using tee splits OR dynamic sources, ALL sinks MUST have async=0
-    pipeline.add("nveglglessink", "sink", {
-        "sync": 0, "qos": 0,
-        "async": 0  # CRITICAL - prevents state transition deadlock
-    })
-    ```
-    **Symptom if missing**: Pipeline stays in PAUSED state, no video displays.
-
-11. **Built-in Probe Attachment**: `measure_fps_probe` can only be attached to processing elements (e.g., `nvinfer`, `nvosdbin`), **NOT** to sink elements. Attaching to a sink raises `RuntimeError: Probe failure`.
-
-12. **Dynamic ONNX Models Require `infer-dims`**: When the ONNX model has dynamic input shapes (e.g., exported with `dynamic=True` in Ultralytics YOLO, or with dynamic batch/height/width axes), you **MUST** add `infer-dims=C;H;W` to the nvinfer config. Without it, TensorRT sees `-1` for dynamic dimensions and fails with `setDimensions: Error Code 3`. Common values:
-    - YOLO models (640 input): `infer-dims=3;640;640`
-    - Models with 416 input: `infer-dims=3;416;416`
-    - Models with 1280 input: `infer-dims=3;1280;1280`
-
-13. **Ultralytics YOLO Output Format Depends on Model Generation** — newer models (v10+/v26+) output post-NMS results; older models (v8/v11) output raw pre-NMS tensors. The custom parser and `cluster-mode` **must** match the actual output:
-
-   | Model generation | Output tensor shape | Fields | `cluster-mode` |
-   |------------------|--------------------|---------------------------------|----------------|
-   | v8 / v11 | `[batch, 84, 8400]` | `[features(4+80), anchors]` — raw cx/cy/w/h + class scores, no NMS | `2` (NMS) |
-   | v10 / v26+ | `[batch, 300, 6]` | `[max_det, (x1,y1,x2,y2,conf,cls)]` — already post-NMS, pixel coords | `4` (none) |
-
-   **How to identify at runtime**: log `inferDims.d[0]` and `inferDims.d[1]` inside the custom parser.
-   - `d={84, 8400}` → pre-NMS (v8/v11 style)
-   - `d={300, 6}` → post-NMS (v10/v26+ style)
-
-   **Symptom of mismatch**: If `cluster-mode: 2` is used with a post-NMS `[N, 6]` output, bounding boxes appear shifted by 45° or 135° from the actual objects (DeepStream's NMS incorrectly re-processes already-final coordinates).
-   If you see tilted or rotated boxes, also check the OBB / `rotation_angle` note in `references/nvinfer_config.md`: for non-OBB models, value-initialize `NvDsInferObjectDetectionInfo` with `obj{}` and keep `rotation_angle = 0`; plain `NvDsInferObjectDetectionInfo obj;` leaves fields uninitialized.
-
-14. **Virtual Environment Must Include pyservicemaker**: `pyservicemaker` is installed system-wide but is NOT accessible from a standard Python virtual environment. When a task requires a venv (e.g., for model download/conversion pip dependencies), **always install `pyservicemaker` and `pyyaml` inside the venv**. The venv setup in generated code and README must always include:
-    ```bash
-    python3 -m venv venv
-    source venv/bin/activate
-    pip install /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl pyyaml
-    pip install -r requirements.txt  # other dependencies
-    ```
-    **Symptom if missing**: `ModuleNotFoundError: No module named 'pyservicemaker'` when running the app inside the venv.
-
-## Key Paths (DeepStream 9.0)
-
-- Models: `/opt/nvidia/deepstream/deepstream/samples/models/`
-- Primary Detector: `/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx`
-- Tracker lib: `/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so`
-- Kafka lib: `/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so`
-- Sample configs: `/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/`
-
-## Reference Documents
-
-**IMPORTANT**: Always read these documents for complete details. Do NOT generate code from memory.
-
-| Document | Use When |
-|----------|----------|
-| [references/gstreamer_plugins.md](references/gstreamer_plugins.md) | Looking up plugin properties, ALL properties listed |
-| [references/service_maker_api.md](references/service_maker_api.md) | Using Pipeline/Flow API, metadata access, probes, EventMessageUserMetadata |
-| [references/use_cases_pipelines.md](references/use_cases_pipelines.md) | Building pipelines: simple playback, multi-inference, cascaded GIE |
-| [references/kafka_messaging.md](references/kafka_messaging.md) | Kafka/message broker setup, nvmsgconv/nvmsgbroker config, msg2p-newapi |
-| [references/best_practices.md](references/best_practices.md) | Design patterns, common pitfalls, anti-patterns |
-| [references/buffer_apis.md](references/buffer_apis.md) | BufferProvider/Feeder (injection), BufferRetriever/Receiver (extraction) |
-| [references/media_extractor_advanced.md](references/media_extractor_advanced.md) | MediaExtractor, MediaChunk, FrameSampler |
-| [references/utilities_config.md](references/utilities_config.md) | PerfMonitor, EngineFileMonitor, SourceConfig, SensorInfo, SmartRecordConfig |
-| [references/nvinfer_config.md](references/nvinfer_config.md) | nvinfer config file format, ALL parameters |
-| [references/tracker_config.md](references/tracker_config.md) | nvtracker config, NvDCF/IOU/DeepSORT/NvSORT |
-| [references/troubleshooting.md](references/troubleshooting.md) | Error messages and solutions |
-| [references/rest_api_dynamic.md](references/rest_api_dynamic.md) | REST API, dynamic source add/remove, nvmultiurisrcbin |
-| [references/metamux_config.md](references/metamux_config.md) | nvdsmetamux config, parallel multi-model inference, metadata merging, source ID filtering |
-| [references/docker_containers.md](references/docker_containers.md) | Docker images, Dockerfile examples, pyservicemaker install, container run commands |
-
-## Quick Error Reference
-
-| Error | Solution |
-|-------|----------|
-| `iterator has no len()` | Iterate to count, don't use `len()` |
-| `pad template not found` | Use `"sink_%u"` not `"sink_0"` |
-| Queue data loss | Use `multiprocessing.Queue` with `Process` |
-| Config parse failed | Use `property:` not `model:` in YAML |
-| `is-classifier` deprecation warning | Use `network-type: 1` instead of `is-classifier: 1` for classifiers; omit both for detectors |
-| `min-boxes` unknown key warning | Use `minBoxes` (camelCase) in `class-attrs-*` sections, not `min-boxes` |
-| Secondary GIE inactive | Set `process-mode: 2`, check `operate-on-gie-id` |
-| Tee/dynamic source stuck PAUSED | Set `async: 0` on **ALL** sink elements |
-| RTSP no data/reconnecting | Test URL with ffplay, check credentials |
-| `RuntimeError: Probe failure` | `measure_fps_probe` cannot attach to sink elements; use `nvinfer` or `nvosdbin` instead |
-| `setDimensions` negative dims / engine build failed | Add `infer-dims=C;H;W` for dynamic ONNX models (e.g., `infer-dims=3;640;640`) |
-| `No module named 'pyservicemaker'` in venv | `pip install /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl pyyaml` inside the venv |
-| `AttributeError: object has no attribute 'obj_label'` | Use `obj_meta.label` not `obj_meta.obj_label` in pyservicemaker (C API name differs from Python binding) |
-
-<!-- Signing refresh marker.  -->
diff --git a/skills/deepstream/deepstream-dev/evals/evals.json b/skills/deepstream/deepstream-dev/evals/evals.json
deleted file mode 100644
index 91564ad8..00000000
--- a/skills/deepstream/deepstream-dev/evals/evals.json
+++ /dev/null
@@ -1,97 +0,0 @@
-[
-  {
-    "id": "deepstream-dev-001",
-    "question": "Using DeepStream SDK 9.0 and the pyservicemaker Python API, generate a pipeline that reads a local video file, runs primary inference with nvinfer using the ResNet18 TrafficCamNet detector shipped with DeepStream, draws bounding boxes with nvosdbin, and renders to the screen. The user did not ask for tracking or Kafka.",
-    "expected_skill": "deepstream-dev",
-    "expected_script": null,
-    "ground_truth": "A minimal pipeline using nvurisrcbin, nvstreammux, nvinfer, nvosdbin, and a platform-appropriate sink. It must avoid nvtracker, secondary GIEs, nvmsgbroker, and other optional components that were not requested.",
-    "expected_behavior": [
-      "Use nvurisrcbin as the source for a local video file.",
-      "Batch streams through nvstreammux.",
-      "Use the sink_%u request-pad template when linking sources into nvstreammux.",
-      "Reference the bundled ResNet18 TrafficCamNet ONNX model path.",
-      "Do not add nvtracker because tracking was not requested.",
-      "Do not add nvmsgbroker or Kafka messaging because messaging was not requested."
-    ]
-  },
-  {
-    "id": "deepstream-dev-002",
-    "question": "Build a DeepStream 9.0 pyservicemaker pipeline that ingests two RTSP cameras, runs primary detection, tracks objects across frames, displays the result in a tiled view, and publishes detection metadata to a Kafka broker. Cover the live-source and tee-split requirements.",
-    "expected_skill": "deepstream-dev",
-    "expected_script": null,
-    "ground_truth": "The pipeline uses nvurisrcbin for each RTSP source, sets live-source=1 on nvstreammux, includes nvtracker because tracking was requested, splits display and broker output with tee, sends metadata to nvmsgbroker, and sets async=0 on sinks.",
-    "expected_behavior": [
-      "Configure nvstreammux with live-source=1 for RTSP input.",
-      "Include nvtracker because the user explicitly requested tracking.",
-      "Use tee to feed both display and broker branches.",
-      "Use nvmsgbroker for Kafka publishing.",
-      "Set async=0 on sinks in the tee branches to avoid state-transition deadlocks.",
-      "Use sync=0 on the live renderer path."
-    ]
-  },
-  {
-    "id": "deepstream-dev-003",
-    "question": "Generate an nvinfer YAML config for a YOLOv11 model with 640x640 input exported from Ultralytics with dynamic=True. The model outputs a raw pre-NMS tensor of shape [batch, 84, 8400].",
-    "expected_skill": "deepstream-dev",
-    "expected_script": null,
-    "ground_truth": "The nvinfer YAML uses a property section, sets infer-dims=3;640;640 so TensorRT does not see dynamic -1 dimensions, and uses cluster-mode: 2 for DeepStream NMS because the output tensor is pre-NMS.",
-    "expected_behavior": [
-      "Use the property section for the nvinfer YAML.",
-      "Set infer-dims to 3;640;640 for the dynamic ONNX input shape.",
-      "Use cluster-mode: 2 because YOLOv11 output is pre-NMS.",
-      "Do not set is-classifier for an object detector."
-    ]
-  },
-  {
-    "id": "deepstream-dev-004",
-    "question": "Write a DeepStream pipeline that just plays a video file through inference and shows it on screen. Keep it as minimal as possible.",
-    "expected_skill": "deepstream-dev",
-    "expected_script": null,
-    "ground_truth": "A minimal video inference pipeline with nvurisrcbin, nvstreammux, nvinfer, nvosdbin, and a renderer. It should not add tracking, analytics, secondary classifiers, metadata brokers, or other optional elements that the user did not request.",
-    "expected_behavior": [
-      "Do not add nvtracker when tracking was not requested.",
-      "Do not add nvdsanalytics when line crossing, ROI, or analytics were not requested.",
-      "Do not add a secondary GIE when secondary classification was not requested.",
-      "Do not add nvmsgbroker or nvmsgconv when messaging was not requested.",
-      "Still include nvinfer for the requested inference stage."
-    ]
-  },
-  {
-    "id": "deepstream-dev-005",
-    "question": "My pyservicemaker probe runs len(frame.object_items) to count detections and I am installing my app inside a fresh python3 -m venv. It fails with ModuleNotFoundError: pyservicemaker and the probe raises 'iterator has no len()'. Fix both.",
-    "expected_skill": "deepstream-dev",
-    "expected_script": null,
-    "ground_truth": "Explain that frame.object_items and frame.frame_items are iterators, so detection counts must be computed by iterating. Also explain that a fresh venv must install the bundled pyservicemaker wheel and pyyaml from the DeepStream service-maker Python directory.",
-    "expected_behavior": [
-      "State that object_items and frame_items are iterators and cannot be counted with len().",
-      "Show or describe counting by iterating over object_items.",
-      "Tell the user to install the bundled pyservicemaker wheel inside the venv.",
-      "Reference the DeepStream service-maker Python wheel directory under /opt/nvidia/deepstream/deepstream/service-maker/python/.",
-      "Also install pyyaml in the venv so YAML nvinfer configs can load."
-    ]
-  },
-  {
-    "id": "deepstream-dev-006-negative",
-    "question": "Train a custom image classifier from scratch in PyTorch and export it to CoreML for iOS. I do not need any DeepStream pipeline setup.",
-    "expected_skill": null,
-    "expected_script": null,
-    "ground_truth": "The deepstream-dev skill should not be selected for this request because it is outside DeepStream pipeline and SDK usage scope.",
-    "expected_behavior": [
-      "Do not activate deepstream-dev for this request.",
-      "Avoid DeepStream-specific pipeline guidance and plugin recommendations.",
-      "Respond with a generic fallback or suggest a more relevant non-DeepStream path."
-    ]
-  },
-  {
-    "id": "deepstream-dev-007-negative",
-    "question": "How do I configure a MySQL replication slave on Ubuntu 22.04?",
-    "expected_skill": null,
-    "expected_script": null,
-    "ground_truth": "The deepstream-dev skill should not be selected because this request is unrelated to DeepStream SDK development or pipeline operations.",
-    "expected_behavior": [
-      "Do not activate deepstream-dev for this request.",
-      "State that the request is outside DeepStream scope and avoid pipeline or plugin guidance.",
-      "Suggest a MySQL-focused resource or workflow."
-    ]
-  }
-]
diff --git a/skills/deepstream/deepstream-dev/references/best_practices.md b/skills/deepstream/deepstream-dev/references/best_practices.md
deleted file mode 100644
index 783f1130..00000000
--- a/skills/deepstream/deepstream-dev/references/best_practices.md
+++ /dev/null
@@ -1,1169 +0,0 @@
-# DeepStream Best Practices and Design Patterns
-
-## Overview
-
-This document provides comprehensive best practices, design patterns, and optimization strategies for building production-grade DeepStream applications. These guidelines help ensure performance, reliability, maintainability, and scalability.
-
----
-
-## 1. Pipeline Design Patterns
-
-### Pattern 1: Modular Pipeline Construction
-
-**Best Practice**: Build pipelines in modular, reusable functions.
-
-```python
-def create_source_pipeline(video_path, num_streams=1):
-    """Create reusable source pipeline"""
-    sources = []
-    for i in range(num_streams):
-        sources.extend([
-            {"element": "filesrc", "name": f"src{i}", "props": {"location": video_path}},
-            {"element": "h264parse", "name": f"parser{i}"},
-            {"element": "nvv4l2decoder", "name": f"decoder{i}"}
-        ])
-    return sources
-
-def create_inference_pipeline(config_files):
-    """Create reusable inference pipeline"""
-    inference_elements = []
-    for idx, config in enumerate(config_files):
-        unique_id = idx + 1
-        inference_elements.append({
-            "element": "nvinfer",
-            "name": f"infer{idx}",
-            "props": {
-                "config-file-path": config,
-                "unique-id": unique_id
-            }
-        })
-    return inference_elements
-
-def build_complete_pipeline(video_path, infer_configs):
-    """Compose complete pipeline from modules"""
-    pipeline = Pipeline("modular-pipeline")
-    
-    # Add source modules
-    sources = create_source_pipeline(video_path)
-    for src_config in sources:
-        pipeline.add(src_config["element"], src_config["name"], src_config.get("props", {}))
-    
-    # Add inference modules
-    infer_elements = create_inference_pipeline(infer_configs)
-    for infer_config in infer_elements:
-        pipeline.add(infer_config["element"], infer_config["name"], infer_config.get("props", {}))
-    
-    # Link modules
-    # ... linking logic ...
-    
-    return pipeline
-```
-
-### Pattern 2: Configuration-Driven Pipelines
-
-**Best Practice**: Use YAML/JSON configuration files for pipeline definition.
-
-```python
-import yaml
-
-def load_pipeline_config(config_path):
-    """Load pipeline configuration from YAML"""
-    with open(config_path, 'r') as f:
-        return yaml.safe_load(f)
-
-def build_pipeline_from_config(config):
-    """Build pipeline from configuration"""
-    pipeline = Pipeline(config["pipeline"]["name"])
-    
-    # Add elements from config
-    for elem_config in config["pipeline"]["elements"]:
-        pipeline.add(
-            elem_config["type"],
-            elem_config["name"],
-            elem_config.get("properties", {})
-        )
-    
-    # Link elements from config
-    for link_group in config["pipeline"]["links"]:
-        pipeline.link(*link_group)
-    
-    return pipeline
-```
-
-### Pattern 3: Factory Pattern for Element Creation
-
-**Best Practice**: Use factory functions for element creation with validation.
-
-```python
-def create_decoder(platform="x86"):
-    """Factory function for decoder creation"""
-    decoder_props = {}
-    
-    if platform == "jetson":
-        decoder_props["device"] = "/dev/video0"
-    
-    return {
-        "element": "nvv4l2decoder",
-        "name": "decoder",
-        "props": decoder_props
-    }
-
-def create_sink(platform="x86", window_config=None):
-    """Factory function for sink creation"""
-    sink_type = "nv3dsink" if platform == "jetson" else "nveglglessink"
-    sink_props = {"sync": 1}
-    
-    if window_config:
-        sink_props.update(window_config)
-    
-    return {
-        "element": sink_type,
-        "name": "sink",
-        "props": sink_props
-    }
-```
-
-### Pattern 4: Strategy Pattern for Processing
-
-**Best Practice**: Use strategy pattern for different processing approaches.
-
-```python
-class ProcessingStrategy:
-    """Base class for processing strategies"""
-    def process(self, batch_meta):
-        raise NotImplementedError
-
-class DetectionStrategy(ProcessingStrategy):
-    """Strategy for object detection"""
-    def process(self, batch_meta):
-        # Detection-specific processing
-        pass
-
-class ClassificationStrategy(ProcessingStrategy):
-    """Strategy for classification"""
-    def process(self, batch_meta):
-        # Classification-specific processing
-        pass
-
-class PipelineBuilder:
-    """Pipeline builder with strategy pattern"""
-    def __init__(self, strategy: ProcessingStrategy):
-        self.strategy = strategy
-    
-    def build(self):
-        pipeline = Pipeline("strategy-pipeline")
-        # Build pipeline based on strategy
-        return pipeline
-```
-
----
-
-## 2. Performance Optimization
-
-### Optimization 1: Batch Size Tuning
-
-**Best Practice**: Optimize batch sizes based on GPU memory and model complexity.
-
-```python
-def calculate_optimal_batch_size(
-    num_streams,
-    gpu_memory_gb,
-    model_complexity="medium",
-    resolution=(1920, 1080)
-):
-    """
-    Calculate optimal batch size
-    
-    Args:
-        num_streams: Number of input streams
-        gpu_memory_gb: Available GPU memory in GB
-        model_complexity: "low", "medium", "high"
-        resolution: (width, height) tuple
-    """
-    # Base memory per stream (GB)
-    base_memory = {
-        (1920, 1080): 1.0,
-        (1280, 720): 0.5,
-        (640, 480): 0.25
-    }.get(resolution, 1.0)
-    
-    # Model complexity multiplier
-    complexity_mult = {
-        "low": 1.0,
-        "medium": 1.5,
-        "high": 2.0
-    }.get(model_complexity, 1.5)
-    
-    # Calculate max batch size
-    memory_per_stream = base_memory * complexity_mult
-    max_batch = int(gpu_memory_gb / memory_per_stream)
-    
-    # Clamp to number of streams and use power of 2
-    optimal_batch = min(max_batch, num_streams)
-    optimal_batch = 2 ** (optimal_batch.bit_length() - 1)  # Round down to power of 2
-    
-    return max(1, optimal_batch)
-```
-
-### Optimization 2: Inference Precision Selection
-
-**Best Practice**: Use appropriate precision based on accuracy requirements.
-
-```python
-def get_inference_config(precision="fp16", model_path=None):
-    """
-    Get inference configuration with optimal precision
-    
-    Args:
-        precision: "fp32", "fp16", "int8"
-        model_path: Path to model file
-    """
-    precision_map = {
-        "fp32": 0,  # Highest accuracy, slowest
-        "fp16": 1,  # Good balance (recommended)
-        "int8": 2   # Fastest, may need calibration
-    }
-    
-    config = {
-        "network-mode": precision_map.get(precision, 1),
-        "model-engine-file": model_path
-    }
-    
-    if precision == "int8":
-        config["calibration-file"] = model_path.replace(".engine", "_calibration.bin")
-    
-    return config
-```
-
-### Optimization 3: Pipeline Parallelism
-
-**Best Practice**: Run multiple pipelines on different GPUs for scalability.
-
-```python
-from multiprocessing import Process
-
-def run_pipeline_on_gpu(pipeline_config, gpu_id):
-    """Run pipeline on specific GPU"""
-    import os
-    os.environ["CUDA_VISIBLE_DEVICES"] = str(gpu_id)
-    
-    pipeline = build_pipeline(pipeline_config)
-    pipeline.start().wait()
-
-def run_multi_gpu_pipelines(pipeline_configs):
-    """Run pipelines on multiple GPUs"""
-    processes = []
-    
-    for idx, config in enumerate(pipeline_configs):
-        gpu_id = idx % get_num_gpus()  # Distribute across GPUs
-        process = Process(
-            target=run_pipeline_on_gpu,
-            args=(config, gpu_id)
-        )
-        process.start()
-        processes.append(process)
-    
-    # Wait for all processes
-    for process in processes:
-        process.join()
-```
-
-### Optimization 4: Memory Pool Configuration
-
-**Best Practice**: Configure appropriate buffer pool sizes.
-
-```python
-def configure_buffer_pools(pipeline, num_streams, batch_size):
-    """Configure buffer pools for optimal performance"""
-    # Calculate buffer pool size
-    # Rule: pool_size >= (num_streams / batch_size) * 2
-    pool_size = max(4, (num_streams // batch_size) * 2)
-    
-    # Configure queues
-    for elem in pipeline.elements:
-        if elem.name.startswith("queue"):
-            elem.set_property("max-size-buffers", pool_size * 10)
-            elem.set_property("max-size-time", 0)  # Unlimited time
-            elem.set_property("leaky", 2)  # Leaky downstream
-```
-
----
-
-## 3. Memory Management
-
-### Best Practice 1: Proper Cleanup
-
-```python
-class ManagedPipeline:
-    """Pipeline with proper resource management"""
-    def __init__(self, pipeline):
-        self.pipeline = pipeline
-        self.probes = []
-    
-    def add_probe(self, element_name, probe):
-        """Add probe and track for cleanup"""
-        self.pipeline.attach(element_name, probe)
-        self.probes.append(probe)
-    
-    def start(self):
-        """Start pipeline"""
-        self.pipeline.start()
-    
-    def stop(self):
-        """Stop pipeline and cleanup"""
-        self.pipeline.set_state(GST_STATE_NULL)
-        
-        # Cleanup probes
-        for probe in self.probes:
-            if hasattr(probe, 'close'):
-                probe.close()
-            if hasattr(probe, 'flush'):
-                probe.flush()
-    
-    def __enter__(self):
-        self.start()
-        return self
-    
-    def __exit__(self, exc_type, exc_val, exc_tb):
-        self.stop()
-```
-
-### Best Practice 2: Memory Monitoring
-
-```python
-import pynvml
-
-class MemoryMonitor:
-    """Monitor GPU memory usage"""
-    def __init__(self):
-        pynvml.nvmlInit()
-        self.handle = pynvml.nvmlDeviceGetHandleByIndex(0)
-    
-    def get_memory_info(self):
-        """Get current GPU memory usage"""
-        info = pynvml.nvmlDeviceGetMemoryInfo(self.handle)
-        return {
-            "total": info.total / (1024**3),  # GB
-            "used": info.used / (1024**3),     # GB
-            "free": info.free / (1024**3)     # GB
-        }
-    
-    def check_memory_pressure(self, threshold=0.9):
-        """Check if memory usage exceeds threshold"""
-        info = self.get_memory_info()
-        usage_ratio = info["used"] / info["total"]
-        return usage_ratio > threshold
-
-# Usage in pipeline
-monitor = MemoryMonitor()
-if monitor.check_memory_pressure():
-    print("Warning: High GPU memory usage!")
-```
-
----
-
-## 4. Error Handling and Resilience
-
-### Pattern 1: Retry Logic
-
-```python
-import time
-from functools import wraps
-
-def retry(max_attempts=3, delay=1.0, backoff=2.0):
-    """Retry decorator with exponential backoff"""
-    def decorator(func):
-        @wraps(func)
-        def wrapper(*args, **kwargs):
-            attempts = 0
-            current_delay = delay
-            
-            while attempts < max_attempts:
-                try:
-                    return func(*args, **kwargs)
-                except Exception as e:
-                    attempts += 1
-                    if attempts >= max_attempts:
-                        raise
-                    print(f"Attempt {attempts} failed: {e}. Retrying in {current_delay}s...")
-                    time.sleep(current_delay)
-                    current_delay *= backoff
-        return wrapper
-    return decorator
-
-@retry(max_attempts=3, delay=1.0)
-def initialize_kafka_producer(config):
-    """Initialize Kafka producer with retry"""
-    return KafkaProducer(bootstrap_servers=config["servers"])
-```
-
-### Pattern 2: Circuit Breaker
-
-```python
-class CircuitBreaker:
-    """Circuit breaker pattern for external services"""
-    def __init__(self, failure_threshold=5, timeout=60):
-        self.failure_threshold = failure_threshold
-        self.timeout = timeout
-        self.failure_count = 0
-        self.last_failure_time = None
-        self.state = "closed"  # closed, open, half_open
-    
-    def call(self, func, *args, **kwargs):
-        """Execute function with circuit breaker"""
-        if self.state == "open":
-            if time.time() - self.last_failure_time > self.timeout:
-                self.state = "half_open"
-            else:
-                raise Exception("Circuit breaker is OPEN")
-        
-        try:
-            result = func(*args, **kwargs)
-            self.on_success()
-            return result
-        except Exception as e:
-            self.on_failure()
-            raise
-    
-    def on_success(self):
-        """Reset on success"""
-        self.failure_count = 0
-        self.state = "closed"
-    
-    def on_failure(self):
-        """Track failures"""
-        self.failure_count += 1
-        self.last_failure_time = time.time()
-        
-        if self.failure_count >= self.failure_threshold:
-            self.state = "open"
-```
-
-### Pattern 3: Graceful Shutdown
-
-```python
-import signal
-import sys
-
-class GracefulShutdown:
-    """Handle graceful shutdown signals"""
-    def __init__(self):
-        self.shutdown_requested = False
-        signal.signal(signal.SIGINT, self._signal_handler)
-        signal.signal(signal.SIGTERM, self._signal_handler)
-    
-    def _signal_handler(self, signum, frame):
-        """Handle shutdown signals"""
-        print(f"\nReceived signal {signum}. Initiating graceful shutdown...")
-        self.shutdown_requested = True
-    
-    def is_shutdown_requested(self):
-        """Check if shutdown was requested"""
-        return self.shutdown_requested
-
-# Usage
-shutdown_handler = GracefulShutdown()
-
-def run_pipeline_with_graceful_shutdown(pipeline):
-    """Run pipeline with graceful shutdown handling"""
-    try:
-        pipeline.start()
-        
-        while not shutdown_handler.is_shutdown_requested():
-            time.sleep(0.1)
-            # Check pipeline state, process messages, etc.
-        
-        print("Shutting down pipeline...")
-        pipeline.stop()
-    except Exception as e:
-        print(f"Error: {e}")
-        pipeline.stop()
-```
-
----
-
-## 5. Code Organization and Maintainability
-
-### Pattern 1: Separation of Concerns
-
-```python
-# config.py - Configuration management
-class PipelineConfig:
-    def __init__(self, config_path):
-        self.config = self._load_config(config_path)
-    
-    def get_source_config(self):
-        return self.config["source"]
-    
-    def get_inference_config(self):
-        return self.config["inference"]
-
-# pipeline_builder.py - Pipeline construction
-class PipelineBuilder:
-    def __init__(self, config: PipelineConfig):
-        self.config = config
-    
-    def build(self):
-        pipeline = Pipeline("main")
-        # Build pipeline from config
-        return pipeline
-
-# processors.py - Processing logic
-class MetadataProcessor:
-    def process(self, batch_meta):
-        # Processing logic
-        pass
-
-# main.py - Application entry point
-def main():
-    config = PipelineConfig("config.yml")
-    builder = PipelineBuilder(config)
-    pipeline = builder.build()
-    pipeline.start().wait()
-```
-
-### Pattern 2: Dependency Injection
-
-```python
-class PipelineService:
-    """Service class with dependency injection"""
-    def __init__(self, 
-                 source_factory,
-                 inference_factory,
-                 sink_factory,
-                 processor_factory):
-        self.source_factory = source_factory
-        self.inference_factory = inference_factory
-        self.sink_factory = sink_factory
-        self.processor_factory = processor_factory
-    
-    def create_pipeline(self):
-        """Create pipeline using injected factories"""
-        pipeline = Pipeline("service-pipeline")
-        
-        # Use factories to create elements
-        source = self.source_factory.create()
-        inference = self.inference_factory.create()
-        sink = self.sink_factory.create()
-        
-        # Build pipeline
-        # ...
-        
-        return pipeline
-```
-
----
-
-## 6. Testing Strategies
-
-### Unit Testing
-
-```python
-import unittest
-from unittest.mock import Mock, patch
-
-class TestMetadataProcessor(unittest.TestCase):
-    def setUp(self):
-        self.processor = MetadataProcessor()
-    
-    def test_process_empty_batch(self):
-        """Test processing empty batch"""
-        batch_meta = Mock()
-        batch_meta.frame_items = []
-        
-        # Should not raise exception
-        self.processor.process(batch_meta)
-    
-    def test_process_with_objects(self):
-        """Test processing batch with objects"""
-        batch_meta = Mock()
-        frame_meta = Mock()
-        frame_meta.object_items = [Mock(), Mock()]
-        batch_meta.frame_items = [frame_meta]
-        
-        self.processor.process(batch_meta)
-        # Assert expected behavior
-```
-
-### Integration Testing
-
-```python
-class TestPipelineIntegration(unittest.TestCase):
-    def test_pipeline_creation(self):
-        """Test pipeline creation"""
-        config = PipelineConfig("test_config.yml")
-        builder = PipelineBuilder(config)
-        pipeline = builder.build()
-        
-        self.assertIsNotNone(pipeline)
-        self.assertEqual(len(pipeline.elements), expected_count)
-    
-    def test_pipeline_linking(self):
-        """Test pipeline element linking"""
-        pipeline = create_test_pipeline()
-        
-        # Verify links are correct
-        # ...
-```
-
-### Performance Testing
-
-```python
-import time
-
-class PerformanceTest:
-    def test_fps_measurement(self, pipeline, duration=10):
-        """Measure FPS of pipeline"""
-        start_time = time.time()
-        frame_count = 0
-        
-        def frame_callback(batch_meta):
-            nonlocal frame_count
-            frame_count += len(batch_meta.frame_items)
-        
-        pipeline.attach("infer", Probe("fps", frame_callback))
-        pipeline.start()
-        
-        time.sleep(duration)
-        pipeline.stop()
-        
-        elapsed = time.time() - start_time
-        fps = frame_count / elapsed
-        
-        print(f"Measured FPS: {fps:.2f}")
-        return fps
-```
-
----
-
-## 7. Deployment Considerations
-
-### Configuration Management
-
-```python
-import os
-from pathlib import Path
-
-class EnvironmentConfig:
-    """Load configuration based on environment"""
-    def __init__(self):
-        self.env = os.getenv("DEEPSTREAM_ENV", "development")
-        self.config_dir = Path("/etc/deepstream") / self.env
-    
-    def get_config_path(self, config_name):
-        """Get configuration file path"""
-        return self.config_dir / f"{config_name}.yml"
-    
-    def get_model_path(self, model_name):
-        """Get model file path"""
-        return Path("/opt/models") / self.env / model_name
-```
-
-### Logging Best Practices
-
-```python
-import logging
-import sys
-
-def setup_logging(level=logging.INFO, log_file=None):
-    """Setup logging configuration"""
-    handlers = [logging.StreamHandler(sys.stdout)]
-    
-    if log_file:
-        handlers.append(logging.FileHandler(log_file))
-    
-    logging.basicConfig(
-        level=level,
-        format='%(asctime)s - %(name)s - %(levelname)s - %(message)s',
-        handlers=handlers
-    )
-
-# Usage
-logger = logging.getLogger(__name__)
-logger.info("Pipeline started")
-logger.error("Error occurred", exc_info=True)
-```
-
----
-
-## 8. Security Best Practices
-
-### Secure Configuration
-
-```python
-import os
-from cryptography.fernet import Fernet
-
-class SecureConfig:
-    """Handle sensitive configuration securely"""
-    def __init__(self):
-        self.key = os.getenv("CONFIG_ENCRYPTION_KEY")
-        self.cipher = Fernet(self.key) if self.key else None
-    
-    def get_secret(self, secret_name):
-        """Get decrypted secret"""
-        encrypted = os.getenv(secret_name)
-        if self.cipher and encrypted:
-            return self.cipher.decrypt(encrypted.encode()).decode()
-        return encrypted
-```
-
-### Input Validation
-
-```python
-def validate_video_path(path):
-    """Validate video file path"""
-    if not os.path.exists(path):
-        raise ValueError(f"Video file not found: {path}")
-    
-    allowed_extensions = ['.h264', '.h265', '.mp4', '.mkv']
-    if not any(path.endswith(ext) for ext in allowed_extensions):
-        raise ValueError(f"Unsupported video format: {path}")
-    
-    return path
-
-def validate_config_file(config_path):
-    """Validate configuration file"""
-    if not os.path.exists(config_path):
-        raise ValueError(f"Config file not found: {config_path}")
-    
-    # Additional validation
-    # ...
-    
-    return config_path
-```
-
----
-
-## 9. Monitoring and Observability
-
-### Metrics Collection
-
-```python
-from prometheus_client import Counter, Histogram, Gauge
-
-# Define metrics
-frames_processed = Counter('deepstream_frames_processed_total', 'Total frames processed')
-inference_latency = Histogram('deepstream_inference_latency_seconds', 'Inference latency')
-gpu_memory_usage = Gauge('deepstream_gpu_memory_bytes', 'GPU memory usage')
-
-class MetricsCollector(BatchMetadataOperator):
-    """Collect metrics from pipeline"""
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            frames_processed.inc()
-            
-            # Record inference latency if available
-            if hasattr(frame_meta, 'inference_time'):
-                inference_latency.observe(frame_meta.inference_time)
-```
-
----
-
-## 10. Common Anti-Patterns to Avoid
-
-### Anti-Pattern 1: Blocking Operations in Probes
-
-**Bad**:
-```python
-class BadProbe(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        # Blocking network call in probe
-        response = requests.get("http://api.example.com/data")
-        # This blocks the pipeline!
-```
-
-**Good**:
-```python
-import queue
-import threading
-
-class GoodProbe(BatchMetadataOperator):
-    def __init__(self):
-        super().__init__()
-        self.queue = queue.Queue()
-        self.worker = threading.Thread(target=self._process_queue)
-        self.worker.start()
-    
-    def handle_metadata(self, batch_meta):
-        # Non-blocking: add to queue
-        self.queue.put(batch_meta)
-    
-    def _process_queue(self):
-        while True:
-            batch_meta = self.queue.get()
-            # Process asynchronously
-            response = requests.get("http://api.example.com/data")
-```
-
-### Anti-Pattern 2: Ignoring Memory Limits
-
-**Bad**:
-```python
-# No batch size limits
-pipeline.add("nvstreammux", "mux", {"batch-size": 100})  # Too large!
-```
-
-**Good**:
-```python
-# Calculate optimal batch size
-optimal_batch = calculate_optimal_batch_size(num_streams, gpu_memory)
-pipeline.add("nvstreammux", "mux", {"batch-size": optimal_batch})
-```
-
-### Anti-Pattern 3: Not Handling Errors
-
-**Bad**:
-```python
-pipeline.start().wait()  # No error handling
-```
-
-**Good**:
-```python
-try:
-    pipeline.start().wait()
-except Exception as e:
-    logger.error(f"Pipeline error: {e}", exc_info=True)
-    pipeline.stop()
-    raise
-```
-
-### Anti-Pattern 4: Missing async=0 on All Sinks (Tee/Dynamic Sources)
-
-**CRITICAL**: When using `tee` to split a pipeline into multiple branches OR using dynamic sources (nvmultiurisrcbin), **ALL sink elements** must have `async: 0`. This is the most common cause of pipelines stuck in PAUSED state.
-
-**Bad** - Pipeline stuck in PAUSED:
-```python
-# ❌ WRONG - Only display sink has async=0, Kafka sink is missing it
-# Pipeline will be STUCK IN PAUSED STATE!
-
-# Tee split
-pipeline.add("tee", "tee")
-
-# Metadata branch - MISSING async=0!
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-    "conn-str": "localhost;9092",
-    "sync": 0,
-    # async: 0 is MISSING! Pipeline will hang!
-})
-
-# Video branch - has async=0 but it's not enough
-pipeline.add("nveglglessink", "sink", {
-    "sync": 0,
-    "async": 0  # This alone is NOT enough - ALL sinks need it!
-})
-```
-
-**Good** - All sinks have async=0:
-```python
-# ✅ CORRECT - ALL sinks have async=0
-
-# Tee split
-pipeline.add("tee", "tee")
-
-# Metadata branch - Kafka sink with async=0
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-    "conn-str": "localhost;9092",
-    "sync": 0,
-    "async": 0  # CRITICAL: Required on ALL sinks!
-})
-
-# Video branch - display sink with async=0
-pipeline.add("nveglglessink", "sink", {
-    "sync": 0,
-    "qos": 0,
-    "async": 0  # CRITICAL: Required on ALL sinks!
-})
-```
-
-**Symptoms of this bug**:
-- Camera shows "added successfully" in logs
-- Pipeline elements transition to READY, then PAUSED
-- Pipeline never transitions to PLAYING
-- No video display, no data flowing
-- No error messages (silent failure)
-
-**Rule**: When using `tee` or dynamic sources, ALWAYS set `async: 0` on EVERY sink element in the pipeline.
-
-### Anti-Pattern 5: Using threading.Queue with multiprocessing.Process
-
-**CRITICAL**: This is a common and subtle bug that causes data loss!
-
-When using `multiprocessing.Process` to run pipelines in separate processes, you MUST use `multiprocessing.Queue` for inter-process communication. A regular `queue.Queue` (from the `queue` module) only works within a single process.
-
-**Bad** - Data silently lost:
-```python
-from multiprocessing import Process
-from queue import Queue  # WRONG! This is a threading queue
-
-class MultiStreamProcessor:
-    def __init__(self):
-        # This queue WILL NOT work across process boundaries!
-        self.batch_queue = Queue()  # BAD: threading.Queue
-    
-    def start(self, use_multiprocessing=True):
-        for stream in self.streams:
-            if use_multiprocessing:
-                # Child process gets a COPY of the queue
-                # Any data put into it never reaches the parent!
-                process = Process(
-                    target=self._run_pipeline,
-                    args=(stream, self.batch_queue)
-                )
-                process.start()
-```
-
-**Good** - Use multiprocessing.Queue for inter-process communication:
-```python
-from multiprocessing import Process, Queue as MPQueue  # Correct!
-from queue import Queue as ThreadQueue
-
-class MultiStreamProcessor:
-    def __init__(self, use_multiprocessing=True):
-        # Choose the right queue type based on usage
-        if use_multiprocessing:
-            self.batch_queue = MPQueue()  # CORRECT: multiprocessing.Queue
-        else:
-            self.batch_queue = ThreadQueue()  # For single-process/threading
-    
-    def start(self, use_multiprocessing=True):
-        for stream in self.streams:
-            if use_multiprocessing:
-                # multiprocessing.Queue properly shares data across processes
-                process = Process(
-                    target=self._run_pipeline,
-                    args=(stream, self.batch_queue)
-                )
-                process.start()
-```
-
-**Alternative - Use threading instead of multiprocessing**:
-```python
-import threading
-from queue import Queue  # OK for threading
-
-class MultiStreamProcessor:
-    def __init__(self):
-        self.batch_queue = Queue()  # OK: threading.Queue for threads
-    
-    def start(self):
-        for stream in self.streams:
-            # Threads share memory, so queue.Queue works fine
-            thread = threading.Thread(
-                target=self._run_pipeline,
-                args=(stream, self.batch_queue)
-            )
-            thread.start()
-```
-
-**Key Rules**:
-1. `queue.Queue` → Use with `threading.Thread` (same process)
-2. `multiprocessing.Queue` → Use with `multiprocessing.Process` (cross-process)
-3. When in doubt, set `use_multiprocessing=False` and use threads
-4. Always add debug logs to verify data flows through queues correctly
-
-**Symptoms of this bug**:
-- Pipeline appears to run normally
-- No error messages
-- Downstream processing (e.g., VLM, Kafka) never receives data
-- Statistics show 0 batches/messages processed
-
----
-
-## 11. Common Pitfalls and Code Generation Errors
-
-This section documents common mistakes encountered when generating DeepStream code, to prevent them in future.
-
-### Pitfall 1: Using len() on Metadata Iterators
-
-**Problem**: `frame_meta.object_items`, `frame_meta.tensor_items`, and `frame_meta.user_items` return **iterators**, not lists.
-
-**Error**:
-```
-TypeError: object of type 'iterator' has no len()
-```
-
-**Bad Code**:
-```python
-# ❌ WRONG - Causes crash
-count = len(frame_meta.object_items)
-
-# ❌ WRONG - Second loop is empty (iterator already consumed)
-for obj in frame_meta.object_items:
-    process(obj)
-for obj in frame_meta.object_items:
-    count += 1
-```
-
-**Correct Code**:
-```python
-# ✅ CORRECT - Count while iterating
-obj_count = 0
-for obj in frame_meta.object_items:
-    obj_count += 1
-    process(obj)
-```
-
-### Pitfall 2: Incorrect nvinfer Configuration Syntax
-
-**Problem**: nvinfer supports **both YAML and INI-style formats**, but the syntax must be correct for each format.
-
-**Error**:
-```
-Configuration file parsing failed
-```
-
-**Common Mistakes**:
-```yaml
-# ❌ WRONG - Incorrect section name (should be 'property', not 'model')
-model:
-  model-engine-file: /path/to/model.engine
-  batch-size: 1
-
-# ❌ WRONG - Mixing formats (YAML syntax in .txt file or vice versa)
-```
-
-**Correct YAML Config** (`.yml`):
-```yaml
-# ✅ CORRECT YAML format
-property:
-  gpu-id: 0
-  onnx-file: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-  labelfile-path: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-  batch-size: 1
-  network-mode: 2
-  num-detected-classes: 4
-  process-mode: 1
-  cluster-mode: 2
-
-class-attrs-all:
-  topk: 20
-  pre-cluster-threshold: 0.2
-```
-
-**Correct INI-style Config** (`.txt`):
-```ini
-# ✅ CORRECT INI-style format
-[property]
-gpu-id=0
-onnx-file=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-labelfile-path=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-batch-size=1
-network-mode=2
-num-detected-classes=4
-process-mode=1
-cluster-mode=2
-
-[class-attrs-all]
-topk=20
-pre-cluster-threshold=0.2
-```
-
-**Key Rules**:
-- YAML format: Use `property:` (no brackets), `key: value` with colon+space
-- INI format: Use `[property]` (with brackets), `key=value` with equals sign
-- Section must be named `property` (not `model` or other names)
-- Don't mix formats in the same file
-
-### Pitfall 3: Using Wrong Model (ResNet10 vs ResNet18)
-
-**Problem**: DeepStream samples use **ResNet18** TrafficCamNet model, not ResNet10.
-
-**Correct Model Paths**:
-```
-/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/
-├── resnet18_trafficcamnet_pruned.onnx    # ✅ Use this ONNX model
-├── labels.txt                              # Class labels
-└── cal_trt.bin                            # INT8 calibration (optional)
-```
-
-**In nvinfer config**:
-```ini
-[property]
-onnx-file=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-labelfile-path=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-```
-
-### Pitfall 4: nvv4l2decoder Output Format Assumption
-
-**Fact**: `nvv4l2decoder` outputs `video/x-raw(memory:NVMM)` - already in GPU memory format.
-
-**Common Mistake**: Adding unnecessary `nvvideoconvert` after decoder.
-
-**Unnecessary Code**:
-```python
-# ❌ UNNECESSARY - nvv4l2decoder already outputs NVMM format
-pipeline.add("nvv4l2decoder", "decoder")
-pipeline.add("nvvideoconvert", "conv")  # Not needed!
-pipeline.add("nvstreammux", "mux")
-```
-
-**Correct Code**:
-```python
-# ✅ CORRECT - Direct connection, no converter needed
-pipeline.add("nvv4l2decoder", "decoder")
-pipeline.add("nvstreammux", "mux")
-pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-```
-
-### Pitfall 5: Built-in Probe Usage
-
-**Fact**: `measure_fps_probe` is a valid built-in probe, but must be attached to the correct element.
-
-**Correct Usage**:
-```python
-# Attach to inference element for FPS measurement
-pipeline.attach("infer", "measure_fps_probe", "fps-probe")
-```
-
-**If probe attachment fails**, implement custom FPS measurement:
-```python
-class FPSCounter(BatchMetadataOperator):
-    def __init__(self):
-        super().__init__()
-        self.start_time = None
-        self.frame_count = 0
-    
-    def handle_metadata(self, batch_meta):
-        if self.start_time is None:
-            self.start_time = time.time()
-        self.frame_count += 1
-        elapsed = time.time() - self.start_time
-        if elapsed > 0 and self.frame_count % 30 == 0:
-            print(f"FPS: {self.frame_count / elapsed:.2f}")
-
-pipeline.attach("infer", Probe("fps-counter", FPSCounter()))
-```
-
----
-
-## Summary
-
-Following these best practices and patterns will help you build robust, performant, and maintainable DeepStream applications. Key takeaways:
-
-1. **Design for modularity**: Use patterns like Factory, Strategy, and Dependency Injection
-2. **Optimize performance**: Tune batch sizes, use appropriate precision, enable parallelism
-3. **Manage resources**: Proper cleanup, memory monitoring, buffer pool configuration
-4. **Handle errors gracefully**: Retry logic, circuit breakers, graceful shutdown
-5. **Test thoroughly**: Unit tests, integration tests, performance tests
-6. **Monitor and observe**: Metrics collection, logging, health checks
-7. **Secure your application**: Input validation, secure configuration, access control
-8. **Use correct Queue types**: 
-   - `queue.Queue` → for threading (same process)
-   - `multiprocessing.Queue` → for multiprocessing (cross-process)
-   - **NEVER** use `queue.Queue` with `multiprocessing.Process` - data will be silently lost!
-9. **Set async=0 on ALL sinks when using tee or dynamic sources**:
-   - When pipeline uses `tee` to split into multiple branches, ALL sink elements need `async: 0`
-   - When using dynamic sources (nvmultiurisrcbin), ALL sinks need `async: 0`
-   - **Symptom if missing**: Pipeline stuck in PAUSED state, no video/data flows
-   - This applies to display sinks, Kafka sinks, file sinks - ALL sinks!
-10. **Avoid common code generation pitfalls**:
-   - **NEVER** use `len()` on metadata iterators (`object_items`, `tensor_items`, `user_items`)
-   - **USE** correct syntax for nvinfer config (YAML: `property:` with `: `, or INI: `[property]` with `=`)
-   - **USE** ResNet18 model (`resnet18_trafficcamnet_pruned.onnx`) from DeepStream samples
-   - **KNOW** that `nvv4l2decoder` outputs NVMM format (no converter needed before nvstreammux)
-
-These practices ensure your DeepStream applications are production-ready and scalable.
-
diff --git a/skills/deepstream/deepstream-dev/references/buffer_apis.md b/skills/deepstream/deepstream-dev/references/buffer_apis.md
deleted file mode 100644
index 0c169d46..00000000
--- a/skills/deepstream/deepstream-dev/references/buffer_apis.md
+++ /dev/null
@@ -1,1670 +0,0 @@
-# Buffer Provider and Retriever APIs
-
-## Overview
-
-DeepStream Service Maker provides two complementary APIs for custom data injection and extraction:
-
-1. **Media Extractor (BufferProvider/Feeder)** - Inject custom data INTO pipelines
-2. **Frame Selector (BufferRetriever/Receiver)** - Extract data FROM pipelines
-
-## When to Use Each API
-
-### Use BufferProvider/Feeder When:
-- You need to inject custom video frames from non-standard sources
-- You want to generate synthetic video data for testing
-- You have pre-processed frames to feed into the pipeline
-- You need to implement custom video sources beyond file/RTSP
-- You want to transfer frames FROM another pipeline or system INTO DeepStream
-
-**See**: Part 1 below for detailed API reference and implementation patterns.
-
-### Use BufferRetriever/Receiver When:
-- You need to extract frames for custom processing outside the pipeline
-- You want to save specific frames to disk or external storage
-- You need to collect inference results with frame data
-- You want to implement custom frame selection logic
-- You want to transfer frames FROM DeepStream TO another pipeline or system
-
-**See**: Part 2 below for detailed API reference and implementation patterns.
-
-## Common Patterns
-
-### Pattern 1: Pipeline-to-Pipeline Transfer
-Transfer frames between two DeepStream pipelines.
-
-```
-Pipeline A -> BufferRetriever -> Queue -> BufferProvider -> Pipeline B
-```
-
-**Use Case**: Process video in one pipeline, then re-process results in another
-
-**Details**: See Part 1 Pattern 3 (Frame Queue Injection) and Part 2 Pattern 2 (Frame Queue Transfer)
-
-### Pattern 2: Custom Video Source
-Read from custom camera or video source.
-
-```
-Custom Source -> BufferProvider -> appsrc -> DeepStream Pipeline
-```
-
-**Use Case**: Integrate non-standard cameras or video sources
-
-**Details**: See Part 1 Pattern 1 (File-Based Custom Video Source)
-
-### Pattern 3: Frame Extraction
-Extract frames from pipeline for archival or analysis.
-
-```
-DeepStream Pipeline -> appsink -> BufferRetriever -> Save/Process
-```
-
-**Use Case**: Save frames at intervals, capture detection screenshots
-
-**Details**: See Part 2 Pattern 1 (Frame Extraction and Saving)
-
-### Pattern 4: Synthetic Data Generation
-Generate test data for pipeline validation.
-
-```
-Synthetic Generator -> BufferProvider -> appsrc -> DeepStream Pipeline
-```
-
-**Use Case**: Testing, simulation, validation
-
-**Details**: See Part 1 Pattern 2 (Synthetic Frame Generation)
-
-### Pattern 5: Selective Frame Capture
-Capture frames based on inference results.
-
-```
-Pipeline -> Inference -> Metadata Probe -> Trigger -> BufferRetriever -> Save
-```
-
-**Use Case**: Save frames only when specific objects detected
-
-**Details**: See Part 2 Pattern 3 (Selective Frame Capture)
-
-## API Comparison
-
-| Feature | BufferProvider/Feeder | BufferRetriever/Receiver |
-|---------|----------------------|--------------------------|
-| **Direction** | Data IN (injection) | Data OUT (extraction) |
-| **GStreamer Element** | appsrc | appsink |
-| **Signal** | need-data/enough-data | new-sample |
-| **Method to Implement** | `generate(size)` | `consume(buffer)` |
-| **Return Value** | Buffer object | int (1=success, 0=error) |
-| **EOS Handling** | Return empty Buffer() | Return -1 |
-| **Properties** | format, width, height, framerate, device | None (configured on appsink) |
-
-## Quick Start Examples
-
-### Inject Custom Frames (BufferProvider)
-
-```python
-from pyservicemaker import Pipeline, BufferProvider, Feeder, as_tensor, ColorFormat, Buffer
-import torch  # pip install torch torchvision (not in base DS container)
-
-class MyProvider(BufferProvider):
-    def __init__(self):
-        super().__init__()
-        self.format = "RGB"
-        self.width = 1280
-        self.height = 720
-        self.framerate = 30
-        self.device = 'gpu'
-
-    def generate(self, size):
-        # Your custom frame generation logic
-        frame = get_custom_frame()  # Your function
-        if frame is None:
-            return Buffer()  # EOS
-
-        torch_tensor = torch.from_numpy(frame).cuda()
-        ds_tensor = as_tensor(torch_tensor, "HWC")
-        return ds_tensor.wrap(ColorFormat.RGB)
-
-pipeline = Pipeline("inject-pipeline")
-caps = "video/x-raw(memory:NVMM), format=RGB, width=1280, height=720, framerate=30/1"
-pipeline.add("appsrc", "src", {"caps": caps, "do-timestamp": True})
-# ... add more elements ...
-pipeline.attach("src", Feeder("feeder", MyProvider()), tips="need-data/enough-data")
-pipeline.start().wait()
-```
-
-### Extract Frames (BufferRetriever)
-
-```python
-from pyservicemaker import Pipeline, BufferRetriever, Receiver
-import torch  # pip install torch torchvision (not in base DS container)
-
-class MyRetriever(BufferRetriever):
-    def __init__(self):
-        super().__init__()
-        self.count = 0
-
-    def consume(self, buffer):
-        tensor = buffer.extract(0).clone()  # Always clone!
-        torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-
-        # Your custom processing logic
-        process_frame(torch_tensor)  # Your function
-
-        self.count += 1
-        return 1  # Success
-
-pipeline = Pipeline("extract-pipeline")
-# ... add source and processing elements ...
-pipeline.add("appsink", "sink", {"emit-signals": True, "sync": False})
-pipeline.attach("sink", Receiver("receiver", MyRetriever()), tips="new-sample")
-pipeline.start().wait()
-```
-
-## Key Concepts
-
-### BufferProvider/Feeder
-- **Purpose**: Custom data injection
-- **Element**: Works with `appsrc`
-- **Flow**: Your code -> BufferProvider -> Pipeline
-- **Control**: Pipeline pulls data when needed
-- **Properties**: Must set format, width, height, framerate, device
-
-### BufferRetriever/Receiver
-- **Purpose**: Custom data extraction
-- **Element**: Works with `appsink`
-- **Flow**: Pipeline -> BufferRetriever -> Your code
-- **Control**: Pipeline pushes data when available
-- **Critical**: Always call `.clone()` on extracted tensors
-
-## Best Practices Summary
-
-### For BufferProvider:
-1. Set all required properties (format, width, height, framerate, device)
-2. Return empty `Buffer()` to signal end of stream
-3. Use GPU memory (`device='gpu'`) for best performance
-4. Set `do-timestamp=True` on appsrc for proper sync
-5. Use `tips="need-data/enough-data"` when attaching
-
-### For BufferRetriever:
-1. **Always** call `.clone()` on extracted tensors
-2. Set `emit-signals=True` on appsink
-3. Use `tips="new-sample"` when attaching
-4. Return 1 for success, 0 for error (continue), -1 for fatal error
-5. Set `sync=False` for non-real-time extraction
-
-## Common Pitfalls
-
-### BufferProvider Issues:
-- Forgetting to set format properties -> Pipeline fails to negotiate caps
-- Not returning empty Buffer() for EOS -> Pipeline hangs
-- Mismatched caps between provider and appsrc -> Format errors
-
-### BufferRetriever Issues:
-- Not calling `.clone()` -> Data corruption in async processing
-- Forgetting `emit-signals=True` -> No frames received
-- Slow processing in consume() -> Frame drops
-- Not handling exceptions -> Pipeline crashes
-
-## Performance Tips
-
-### BufferProvider:
-- Use GPU memory for zero-copy transfers
-- Pre-allocate buffers when possible
-- Avoid CPU<->GPU transfers in hot path
-- Consider buffer pooling for high frame rates
-
-### BufferRetriever:
-- Set `sync=False` if you don't need real-time pacing
-- Process frames asynchronously if possible
-- Limit buffer accumulation to prevent memory issues
-- Use batch processing when extracting multiple streams
-
-## Example Applications
-
-The service-maker package includes sample applications demonstrating these APIs:
-
-**Pipeline API Examples**:
-- `/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/pipeline_api/deepstream_appsrc_test_app/`
-
-**Flow API Examples**:
-- `/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/flow_api/deepstream_appsrc_test_app/`
-
-## Goal-Based API Selection
-
-| Goal | Use This API | Section |
-|------|-------------|---------|
-| Inject custom frames | BufferProvider/Feeder | Part 1 |
-| Extract frames | BufferRetriever/Receiver | Part 2 |
-| Pipeline-to-pipeline transfer | Both | Part 1 Pattern 3, Part 2 Pattern 2 |
-| Custom video source | BufferProvider/Feeder | Part 1 Pattern 1 |
-| Frame archival | BufferRetriever/Receiver | Part 2 Pattern 1 |
-| Synthetic data generation | BufferProvider/Feeder | Part 1 Pattern 2 |
-| Selective capture | BufferRetriever/Receiver | Part 2 Pattern 3 |
-
-Choose the right API based on your data flow direction: injection (BufferProvider) or extraction (BufferRetriever).
-
----
-
-# Part 1: BufferProvider / Feeder API (Media Extractor)
-
-## Overview
-
-The Media Extractor API (implemented through `BufferProvider` and `Feeder` classes) enables custom data injection into DeepStream pipelines. This is useful for:
-- Injecting custom video frames from non-standard sources
-- Generating synthetic video data for testing
-- Feeding pre-processed frames into the pipeline
-- Implementing custom video sources beyond file/RTSP streams
-
-## Core Concepts
-
-### BufferProvider
-A `BufferProvider` is a user-implemented class that generates buffers on-demand. It works with GStreamer's `appsrc` element to inject data into the pipeline.
-
-### Feeder
-A `Feeder` is a wrapper that connects a `BufferProvider` to an `appsrc` element. It manages the signal handling for "need-data" and "enough-data" events.
-
-### Data Flow
-```
-BufferProvider.generate() -> Feeder -> appsrc -> Pipeline
-```
-
-## API Reference
-
-### BufferProvider Class
-
-Base class for implementing custom media providers.
-
-**Methods to Override**:
-
-#### `generate(size)`
-Generate a buffer when the pipeline needs data.
-
-**Parameters**:
-- `size` (int): Number of bytes requested by the pipeline
-
-**Returns**: `Buffer` object containing the data, or empty `Buffer()` to signal EOS
-
-**Properties to Set**:
-- `format` (str): Video format (e.g., "RGB", "NV12")
-- `width` (int): Frame width in pixels
-- `height` (int): Frame height in pixels
-- `framerate` (int): Frame rate
-- `device` (str): 'gpu' or 'cpu'
-
-**Example**:
-```python
-from pyservicemaker import BufferProvider, as_tensor, ColorFormat, Buffer
-import torch  # pip install torch torchvision (not in base DS container)
-
-class MyBufferProvider(BufferProvider):
-    def __init__(self, video_source):
-        super().__init__()
-        self.source = video_source
-        self.format = "RGB"
-        self.width = 1920
-        self.height = 1080
-        self.framerate = 30
-        self.device = 'gpu'
-        self.frame_count = 0
-
-    def generate(self, size):
-        # Get frame from your custom source
-        frame = self.source.get_next_frame()
-
-        if frame is None:
-            # Signal end of stream
-            return Buffer()
-
-        # Convert to torch tensor (on GPU if needed)
-        torch_tensor = torch.from_numpy(frame).cuda()
-
-        # Convert to DeepStream tensor format
-        ds_tensor = as_tensor(torch_tensor, "HWC")  # Height, Width, Channels
-
-        # Wrap in buffer with color format
-        buffer = ds_tensor.wrap(ColorFormat.RGB)
-
-        self.frame_count += 1
-        return buffer
-```
-
-### Feeder Class
-
-Wrapper for attaching a BufferProvider to a pipeline element.
-
-**Constructor**:
-```python
-from pyservicemaker import Feeder
-
-feeder = Feeder("feeder-name", buffer_provider_instance)
-```
-
-**Parameters**:
-- `name` (str): Name of the feeder
-- `provider` (BufferProvider): BufferProvider instance
-
-### Helper Functions
-
-#### `as_tensor(torch_tensor, layout)`
-Convert a PyTorch tensor to DeepStream tensor format.
-
-**Parameters**:
-- `torch_tensor`: PyTorch tensor
-- `layout` (str): Tensor layout - "HWC" (Height, Width, Channels) or "CHW"
-
-**Returns**: DeepStream tensor object
-
-#### ColorFormat Enum
-Specifies the pixel format for buffers.
-
-**Values**:
-- `ColorFormat.RGB`: RGB format
-- `ColorFormat.RGBA`: RGBA format
-- `ColorFormat.NV12`: NV12 format (YUV 4:2:0)
-- `ColorFormat.GRAY`: Grayscale
-
-### Buffer Class
-
-Container for video frame data.
-
-**Constructor**:
-```python
-buffer = Buffer()  # Empty buffer (signals EOS)
-```
-
-**Methods**:
-- `extract(index)`: Extract tensor at index from buffer
-- `clone()`: Create a copy of the buffer
-
-## Implementation Patterns
-
-### Pattern 1: File-Based Custom Video Source
-
-Read frames from custom file format and inject into pipeline.
-
-```python
-from pyservicemaker import Pipeline, BufferProvider, Feeder, as_tensor, ColorFormat, Buffer
-import cv2  # pip install opencv-python-headless (not in base DS container)
-import torch  # pip install torch torchvision (not in base DS container)
-import platform
-
-class CustomVideoFileProvider(BufferProvider):
-    def __init__(self, video_path):
-        super().__init__()
-        self.cap = cv2.VideoCapture(video_path)
-
-        # Set buffer properties
-        self.format = "RGB"
-        self.width = int(self.cap.get(cv2.CAP_PROP_FRAME_WIDTH))
-        self.height = int(self.cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
-        self.framerate = int(self.cap.get(cv2.CAP_PROP_FPS))
-        self.device = 'gpu'
-        self.frame_count = 0
-
-    def generate(self, size):
-        ret, frame = self.cap.read()
-
-        if not ret:
-            # End of video
-            self.cap.release()
-            return Buffer()
-
-        # Convert BGR to RGB
-        frame_rgb = cv2.cvtColor(frame, cv2.COLOR_BGR2RGB)
-
-        # Convert to torch tensor and move to GPU
-        torch_tensor = torch.from_numpy(frame_rgb).cuda()
-
-        # Convert to DeepStream tensor
-        ds_tensor = as_tensor(torch_tensor, "HWC")
-
-        self.frame_count += 1
-        print(f"Generated frame {self.frame_count}")
-
-        return ds_tensor.wrap(ColorFormat.RGB)
-
-def main(video_path):
-    pipeline = Pipeline("custom-video-source")
-
-    # Create appsrc with appropriate capabilities
-    caps = f"video/x-raw(memory:NVMM), format=RGB, width=1920, height=1080, framerate=30/1"
-    pipeline.add("appsrc", "src", {
-        "caps": caps,
-        "do-timestamp": True,
-        "format": 3  # GST_FORMAT_TIME
-    })
-
-    # Add processing elements
-    pipeline.add("nvvideoconvert", "convert", {
-        "nvbuf-memory-type": 2,  # NVBUF_MEM_CUDA_DEVICE
-        "compute-hw": 1
-    })
-    pipeline.add("capsfilter", "caps", {"caps": "video/x-raw(memory:NVMM), format=NV12"})
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": 1,
-        "width": 1920,
-        "height": 1080
-    })
-
-    # Add inference (optional)
-    pipeline.add("nvinfer", "infer", {
-        "config-file-path": "/path/to/config.yml"
-    })
-
-    # Add display
-    pipeline.add("nvosdbin", "osd")
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink", {"sync": False})
-
-    # Link elements
-    pipeline.link("src", "convert")
-    pipeline.link(("convert", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "infer", "osd", "sink")
-
-    # Attach feeder to appsrc
-    provider = CustomVideoFileProvider(video_path)
-    pipeline.attach("src", Feeder("feeder", provider), tips="need-data/enough-data")
-
-    # Start pipeline
-    pipeline.start().wait()
-
-if __name__ == "__main__":
-    import sys
-    main(sys.argv[1])
-```
-
-### Pattern 2: Synthetic Frame Generation
-
-Generate synthetic frames for testing or simulation.
-
-```python
-from pyservicemaker import Pipeline, BufferProvider, Feeder, as_tensor, ColorFormat, Buffer
-import torch  # pip install torch torchvision (not in base DS container)
-import numpy as np
-
-class SyntheticFrameProvider(BufferProvider):
-    def __init__(self, num_frames=100, width=1280, height=720, fps=30):
-        super().__init__()
-        self.format = "RGB"
-        self.width = width
-        self.height = height
-        self.framerate = fps
-        self.device = 'gpu'
-        self.num_frames = num_frames
-        self.frame_idx = 0
-
-    def generate(self, size):
-        if self.frame_idx >= self.num_frames:
-            return Buffer()
-
-        # Generate synthetic frame (moving gradient)
-        x = np.linspace(0, 255, self.width, dtype=np.uint8)
-        y = np.linspace(0, 255, self.height, dtype=np.uint8)
-
-        offset = (self.frame_idx * 5) % 255
-        frame = np.zeros((self.height, self.width, 3), dtype=np.uint8)
-        frame[:, :, 0] = (x + offset) % 255  # Red channel
-        frame[:, :, 1] = (y + offset) % 255  # Green channel
-        frame[:, :, 2] = 128  # Blue channel
-
-        # Convert to torch and move to GPU
-        torch_tensor = torch.from_numpy(frame).cuda()
-        ds_tensor = as_tensor(torch_tensor, "HWC")
-
-        self.frame_idx += 1
-        return ds_tensor.wrap(ColorFormat.RGB)
-
-def generate_test_video():
-    pipeline = Pipeline("synthetic-video")
-
-    provider = SyntheticFrameProvider(num_frames=300, width=1280, height=720, fps=30)
-
-    caps = f"video/x-raw(memory:NVMM), format=RGB, width={provider.width}, height={provider.height}, framerate={provider.framerate}/1"
-    pipeline.add("appsrc", "src", {"caps": caps, "do-timestamp": True})
-    pipeline.add("nvvideoconvert", "convert")
-    pipeline.add("nvv4l2h264enc", "encoder", {"bitrate": 4000000})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("mp4mux", "mux")
-    pipeline.add("filesink", "sink", {"location": "synthetic_output.mp4"})
-
-    pipeline.link("src", "convert", "encoder", "parser", "mux", "sink")
-    pipeline.attach("src", Feeder("feeder", provider), tips="need-data/enough-data")
-
-    pipeline.start().wait()
-```
-
-### Pattern 3: Frame Queue Injection
-
-Transfer frames between two pipelines using a queue.
-
-```python
-from pyservicemaker import Pipeline, BufferProvider, Feeder, as_tensor, ColorFormat, Buffer
-from queue import Queue, Empty
-import torch  # pip install torch torchvision (not in base DS container)
-
-class QueuedBufferProvider(BufferProvider):
-    def __init__(self, frame_queue, width=1280, height=720):
-        super().__init__()
-        self.queue = frame_queue
-        self.format = "RGB"
-        self.width = width
-        self.height = height
-        self.framerate = 30
-        self.device = 'gpu'
-
-    def generate(self, size):
-        try:
-            # Wait up to 2 seconds for frame
-            tensor = self.queue.get(timeout=2)
-
-            # Convert DLPack tensor to PyTorch
-            torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-
-            # Convert to DeepStream tensor
-            ds_tensor = as_tensor(torch_tensor, "HWC")
-
-            return ds_tensor.wrap(ColorFormat.RGB)
-        except Empty:
-            # Queue is empty, signal EOS
-            print("Queue empty, ending stream")
-            return Buffer()
-
-def pipeline_with_queue_injection(frame_queue):
-    pipeline = Pipeline("queue-injection")
-
-    provider = QueuedBufferProvider(frame_queue, width=1280, height=720)
-
-    caps = f"video/x-raw(memory:NVMM), format=RGB, width={provider.width}, height={provider.height}, framerate={provider.framerate}/1"
-    pipeline.add("appsrc", "src", {"caps": caps, "do-timestamp": True})
-    pipeline.add("nvvideoconvert", "convert", {"nvbuf-memory-type": 2})
-    pipeline.add("capsfilter", "caps", {"caps": "video/x-raw(memory:NVMM), format=NV12"})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1280, "height": 720})
-    pipeline.add("nveglglessink", "sink", {"sync": False})
-
-    pipeline.link("src", "convert", "caps")
-    pipeline.link(("convert", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "sink")
-
-    pipeline.attach("src", Feeder("feeder", provider), tips="need-data/enough-data")
-    pipeline.start().wait()
-```
-
-### Pattern 4: Flow API with Buffer Injection
-
-High-level Flow API for buffer injection.
-
-```python
-from pyservicemaker import Pipeline, Flow, BufferProvider, ColorFormat, as_tensor, Buffer
-import torch  # pip install torch torchvision (not in base DS container)
-import cv2  # pip install opencv-python-headless (not in base DS container)
-
-class SimpleVideoProvider(BufferProvider):
-    def __init__(self, video_path):
-        super().__init__()
-        self.cap = cv2.VideoCapture(video_path)
-        self.format = "RGB"
-        self.width = int(self.cap.get(cv2.CAP_PROP_FRAME_WIDTH))
-        self.height = int(self.cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
-        self.framerate = int(self.cap.get(cv2.CAP_PROP_FPS))
-        self.device = 'gpu'
-
-    def generate(self, size):
-        ret, frame = self.cap.read()
-        if not ret:
-            return Buffer()
-
-        frame_rgb = cv2.cvtColor(frame, cv2.COLOR_BGR2RGB)
-        torch_tensor = torch.from_numpy(frame_rgb).cuda()
-        ds_tensor = as_tensor(torch_tensor, "HWC")
-        return ds_tensor.wrap(ColorFormat.RGB)
-
-def flow_api_injection(video_path):
-    pipeline = Pipeline("flow-injection")
-    provider = SimpleVideoProvider(video_path)
-
-    # Flow API: inject() -> infer() -> render()
-    flow = Flow(pipeline)
-    flow.inject([provider])  # Pass list of providers
-    flow.infer("/path/to/config.yml")  # Optional: add inference
-    flow.render()  # Add renderer
-    flow()  # Execute
-```
-
-## Advanced Usage
-
-### Multi-Source Buffer Injection
-
-Inject from multiple custom sources simultaneously.
-
-```python
-from pyservicemaker import Pipeline, BufferProvider, Feeder, as_tensor, ColorFormat, Buffer
-import cv2  # pip install opencv-python-headless (not in base DS container)
-import torch  # pip install torch torchvision (not in base DS container)
-
-class MultiSourceProvider(BufferProvider):
-    def __init__(self, source_id, video_path):
-        super().__init__()
-        self.source_id = source_id
-        self.cap = cv2.VideoCapture(video_path)
-        self.format = "RGB"
-        self.width = 1280
-        self.height = 720
-        self.framerate = 30
-        self.device = 'gpu'
-
-    def generate(self, size):
-        ret, frame = self.cap.read()
-        if not ret:
-            return Buffer()
-
-        # Resize to common size
-        frame = cv2.resize(frame, (self.width, self.height))
-        frame_rgb = cv2.cvtColor(frame, cv2.COLOR_BGR2RGB)
-
-        torch_tensor = torch.from_numpy(frame_rgb).cuda()
-        ds_tensor = as_tensor(torch_tensor, "HWC")
-        return ds_tensor.wrap(ColorFormat.RGB)
-
-def multi_source_injection(video_paths):
-    pipeline = Pipeline("multi-source-injection")
-
-    # Create multiple appsrc elements
-    for i, path in enumerate(video_paths):
-        caps = "video/x-raw(memory:NVMM), format=RGB, width=1280, height=720, framerate=30/1"
-        pipeline.add("appsrc", f"src{i}", {"caps": caps, "do-timestamp": True})
-        pipeline.add("nvvideoconvert", f"convert{i}", {"nvbuf-memory-type": 2})
-
-    # Add muxer
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": len(video_paths),
-        "width": 1280,
-        "height": 720
-    })
-
-    # Add inference and display
-    pipeline.add("nvinfer", "infer", {"config-file-path": "/path/to/config.yml"})
-    pipeline.add("nvmultistreamtiler", "tiler", {"rows": 2, "columns": 2})
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nveglglessink", "sink")
-
-    # Link sources to muxer
-    for i in range(len(video_paths)):
-        pipeline.link(f"src{i}", f"convert{i}")
-        pipeline.link((f"convert{i}", "mux"), ("", "sink_%u"))
-
-        # Attach feeder
-        provider = MultiSourceProvider(i, video_paths[i])
-        pipeline.attach(f"src{i}", Feeder(f"feeder{i}", provider), tips="need-data/enough-data")
-
-    # Link processing chain
-    pipeline.link("mux", "infer", "tiler", "osd", "sink")
-    pipeline.start().wait()
-```
-
-## Part 1 Best Practices
-
-### 1. Memory Management
-- Use GPU memory (`device='gpu'`) for best performance
-- Release resources properly (close files, release capture devices)
-- Avoid memory leaks by managing tensors correctly
-
-### 2. Buffer Format
-- Always specify correct `format`, `width`, `height`, and `framerate`
-- Match color format with pipeline requirements
-- Use `ColorFormat.RGB` for most cases, `ColorFormat.NV12` for optimized pipelines
-
-### 3. Timestamping
-- Set `"do-timestamp": True` on appsrc for proper synchronization
-- Important for multi-stream applications
-
-### 4. Signal Handling
-- Use `tips="need-data/enough-data"` when attaching Feeder
-- This enables proper flow control and prevents buffer overflow
-
-### 5. End of Stream
-- Return empty `Buffer()` to signal EOS
-- Properly cleanup resources before returning EOS
-
-### 6. Error Handling
-```python
-class SafeBufferProvider(BufferProvider):
-    def __init__(self, source):
-        super().__init__()
-        self.source = source
-        self.format = "RGB"
-        self.width = 1280
-        self.height = 720
-        self.framerate = 30
-        self.device = 'gpu'
-
-    def generate(self, size):
-        try:
-            frame = self.source.get_frame()
-            if frame is None:
-                return Buffer()
-
-            torch_tensor = torch.from_numpy(frame).cuda()
-            ds_tensor = as_tensor(torch_tensor, "HWC")
-            return ds_tensor.wrap(ColorFormat.RGB)
-        except Exception as e:
-            print(f"Error generating buffer: {e}")
-            return Buffer()  # Signal EOS on error
-```
-
-## Part 1 Common Use Cases
-
-### 1. Custom Camera Integration
-Integrate cameras not supported by standard GStreamer elements.
-
-### 2. Pre-processed Frame Injection
-Inject frames that have been pre-processed by custom algorithms.
-
-### 3. Frame Rate Control
-Control exact frame timing and rate for testing.
-
-### 4. Multi-Pipeline Communication
-Transfer frames between multiple DeepStream pipelines. See also Part 2 Pattern 2 for the retriever side of pipeline-to-pipeline transfer.
-
-### 5. Synthetic Data Generation
-Generate synthetic data for testing inference models.
-
-### 6. Image Sequence Processing
-Process sequences of images as video streams.
-
-## Part 1 Troubleshooting
-
-### Issue 1: Frames Not Flowing
-**Solution**: Check that `tips="need-data/enough-data"` is set, verify appsrc caps match buffer properties
-
-### Issue 2: Memory Errors
-**Solution**: Ensure tensors are on correct device (GPU/CPU), check memory allocation
-
-### Issue 3: Format Mismatch
-**Solution**: Verify color format matches between BufferProvider and appsrc caps
-
-### Issue 4: Timing Issues
-**Solution**: Enable timestamping with `"do-timestamp": True`
-
-## Part 1 Summary
-
-The Media Extractor API (BufferProvider/Feeder) provides a powerful way to inject custom video data into DeepStream pipelines. Key points:
-
-1. Implement `BufferProvider.generate()` to create custom buffers
-2. Use `Feeder` to attach provider to `appsrc` elements
-3. Convert data to DeepStream format using `as_tensor()` and `wrap()`
-4. Return empty `Buffer()` to signal end of stream
-5. Always set correct format properties (`width`, `height`, `framerate`, etc.)
-6. Use GPU memory for optimal performance
-
-This API enables seamless integration of custom video sources with DeepStream's powerful inference and analytics capabilities.
-
----
-
-# Part 2: BufferRetriever / Receiver API (Frame Selector)
-
-## Overview
-
-The Frame Selector API (implemented through `BufferRetriever` and `Receiver` classes) enables extraction of video frames and buffers from DeepStream pipelines. This is useful for:
-- Extracting frames for custom processing outside the pipeline
-- Saving frames to disk or sending to external systems
-- Collecting inference results with frame data
-- Implementing custom frame selection logic
-- Transferring data between multiple pipelines
-
-## Core Concepts
-
-### BufferRetriever
-A `BufferRetriever` is a user-implemented class that consumes buffers from the pipeline. It works with GStreamer's `appsink` element to extract data from the pipeline.
-
-### Receiver
-A `Receiver` is a wrapper that connects a `BufferRetriever` to an `appsink` element. It manages the signal handling for "new-sample" events.
-
-### Data Flow
-```
-Pipeline -> appsink -> Receiver -> BufferRetriever.consume()
-```
-
-## API Reference
-
-### BufferRetriever Class
-
-Base class for implementing custom buffer consumers.
-
-**Methods to Override**:
-
-#### `consume(buffer)`
-Process a buffer received from the pipeline.
-
-**Parameters**:
-- `buffer` (Buffer): Buffer object containing frame data
-
-**Returns**: int (1 for success, 0 or negative for error/stop)
-
-**Example**:
-```python
-from pyservicemaker import BufferRetriever
-import torch  # pip install torch torchvision (not in base DS container)
-
-class MyBufferRetriever(BufferRetriever):
-    def __init__(self):
-        super().__init__()
-        self.frame_count = 0
-
-    def consume(self, buffer):
-        # Extract tensor from buffer at index 0
-        tensor = buffer.extract(0)
-
-        # Clone to prevent data loss
-        tensor_copy = tensor.clone()
-
-        # Convert to PyTorch for processing
-        torch_tensor = torch.utils.dlpack.from_dlpack(tensor_copy)
-
-        # Process the frame
-        print(f"Received frame {self.frame_count}: shape={torch_tensor.shape}")
-
-        self.frame_count += 1
-        return 1  # Success
-```
-
-### Receiver Class
-
-Wrapper for attaching a BufferRetriever to a pipeline element.
-
-**Constructor**:
-```python
-from pyservicemaker import Receiver
-
-receiver = Receiver("receiver-name", buffer_retriever_instance)
-```
-
-**Parameters**:
-- `name` (str): Name of the receiver
-- `retriever` (BufferRetriever): BufferRetriever instance
-
-### Buffer Class Methods
-
-**Methods**:
-
-#### `extract(index)`
-Extract tensor at specified index from the buffer.
-
-**Parameters**:
-- `index` (int): Batch index (usually 0 for single-stream)
-
-**Returns**: Tensor object (DLPack format)
-
-#### `clone()`
-Create a copy of the tensor to prevent data corruption.
-
-**Returns**: Cloned tensor
-
-**Example**:
-```python
-def consume(self, buffer):
-    # Extract and clone in one step
-    tensor = buffer.extract(0).clone()
-
-    # Now safe to use tensor asynchronously
-    torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-    return 1
-```
-
-## Implementation Patterns
-
-### Pattern 1: Frame Extraction and Saving
-
-Extract frames from pipeline and save to disk.
-
-```python
-from pyservicemaker import Pipeline, BufferRetriever, Receiver
-import torch  # pip install torch torchvision (not in base DS container)
-import cv2  # pip install opencv-python-headless (not in base DS container)
-import numpy as np
-import platform
-from multiprocessing import Process
-
-class FrameSaver(BufferRetriever):
-    def __init__(self, output_dir="./frames", save_interval=30):
-        super().__init__()
-        self.output_dir = output_dir
-        self.save_interval = save_interval
-        self.frame_count = 0
-
-        import os
-        os.makedirs(output_dir, exist_ok=True)
-
-    def consume(self, buffer):
-        # Extract and clone buffer
-        tensor = buffer.extract(0).clone()
-
-        # Save every Nth frame
-        if self.frame_count % self.save_interval == 0:
-            # Convert to PyTorch tensor
-            torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-
-            # Move to CPU and convert to numpy
-            frame_np = torch_tensor.cpu().numpy()
-
-            # Convert RGB to BGR for OpenCV
-            frame_bgr = cv2.cvtColor(frame_np, cv2.COLOR_RGB2BGR)
-
-            # Save frame
-            filename = f"{self.output_dir}/frame_{self.frame_count:06d}.jpg"
-            cv2.imwrite(filename, frame_bgr)
-            print(f"Saved: {filename}")
-
-        self.frame_count += 1
-        return 1
-
-def extract_frames(video_uri, output_dir):
-    pipeline = Pipeline("frame-extractor")
-
-    # Source
-    pipeline.add("nvurisrcbin", "src", {"uri": video_uri})
-
-    # Muxer
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": 1,
-        "width": 1920,
-        "height": 1080
-    })
-
-    # Convert to RGB for extraction
-    pipeline.add("nvvideoconvert", "converter")
-    pipeline.add("capsfilter", "caps", {
-        "caps": "video/x-raw(memory:NVMM), format=RGB"
-    })
-
-    # Sink for extraction
-    pipeline.add("appsink", "sink", {
-        "emit-signals": True,
-        "sync": False
-    })
-
-    # Link elements
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "converter", "caps", "sink")
-
-    # Attach retriever
-    retriever = FrameSaver(output_dir, save_interval=30)
-    pipeline.attach("sink", Receiver("receiver", retriever), tips="new-sample")
-
-    # Run
-    pipeline.start().wait()
-
-if __name__ == "__main__":
-    import sys
-    process = Process(target=extract_frames, args=(sys.argv[1], "./output_frames"))
-    try:
-        process.start()
-        process.join()
-    except KeyboardInterrupt:
-        process.terminate()
-```
-
-### Pattern 2: Frame Queue Transfer
-
-Transfer frames from one pipeline to another using a queue.
-
-> **CRITICAL WARNING: Queue Type Selection**
->
-> When transferring data between **threads**, use `queue.Queue` (from `queue` module).
-> When transferring data between **processes**, use `multiprocessing.Queue`.
->
-> Using `queue.Queue` with `multiprocessing.Process` will silently fail - data put into the queue in a child process will NEVER reach the parent process! This is a common bug that causes pipelines to appear running but produce no output.
->
-> See the Best Practices reference for Anti-Pattern 4 with detailed examples.
-
-```python
-from pyservicemaker import Pipeline, BufferRetriever, Receiver, BufferProvider, Feeder
-import torch  # pip install torch torchvision (not in base DS container)
-from queue import Queue, Empty  # Use for THREADING only!
-# from multiprocessing import Queue  # Use this for MULTIPROCESSING!
-import threading
-
-class QueuedRetriever(BufferRetriever):
-    def __init__(self, frame_queue):
-        super().__init__()
-        self.queue = frame_queue
-        self.count = 0
-
-    def consume(self, buffer):
-        # Extract and clone
-        tensor = buffer.extract(0).clone()
-
-        # Put in queue for other pipeline
-        self.queue.put(tensor)
-
-        self.count += 1
-        print(f"Queued frame {self.count}")
-        return 1
-
-class QueuedProvider(BufferProvider):
-    def __init__(self, frame_queue, width=1280, height=720):
-        super().__init__()
-        self.queue = frame_queue
-        self.format = "RGB"
-        self.width = width
-        self.height = height
-        self.framerate = 30
-        self.device = 'gpu'
-
-    def generate(self, size):
-        try:
-            tensor = self.queue.get(timeout=2)
-            torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-
-            from pyservicemaker import as_tensor, ColorFormat
-            ds_tensor = as_tensor(torch_tensor, "HWC")
-            return ds_tensor.wrap(ColorFormat.RGB)
-        except Empty:
-            from pyservicemaker import Buffer
-            return Buffer()
-
-def source_pipeline(uri, queue):
-    """Extract frames from source and queue them"""
-    pipeline = Pipeline("source-pipeline")
-
-    pipeline.add("nvurisrcbin", "src", {"uri": uri})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1280, "height": 720})
-    pipeline.add("nvvideoconvert", "converter")
-    pipeline.add("capsfilter", "caps", {"caps": "video/x-raw(memory:NVMM), format=RGB"})
-    pipeline.add("appsink", "sink", {"emit-signals": True, "sync": False})
-
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "converter", "caps", "sink")
-
-    retriever = QueuedRetriever(queue)
-    pipeline.attach("sink", Receiver("receiver", retriever), tips="new-sample")
-
-    pipeline.start().wait()
-
-def destination_pipeline(queue):
-    """Consume frames from queue and process"""
-    pipeline = Pipeline("dest-pipeline")
-
-    provider = QueuedProvider(queue, width=1280, height=720)
-
-    caps = "video/x-raw(memory:NVMM), format=RGB, width=1280, height=720, framerate=30/1"
-    pipeline.add("appsrc", "src", {"caps": caps, "do-timestamp": True})
-    pipeline.add("nvvideoconvert", "convert", {"nvbuf-memory-type": 2})
-    pipeline.add("capsfilter", "caps2", {"caps": "video/x-raw(memory:NVMM), format=NV12"})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1280, "height": 720})
-    pipeline.add("nvinfer", "infer", {"config-file-path": "/path/to/config.yml"})
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nveglglessink", "sink")
-
-    pipeline.link("src", "convert", "caps2")
-    pipeline.link(("convert", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "infer", "osd", "sink")
-
-    pipeline.attach("src", Feeder("feeder", provider), tips="need-data/enough-data")
-
-    pipeline.start().wait()
-
-def multi_pipeline_transfer(video_uri, use_multiprocessing=False):
-    """
-    Transfer frames between pipelines.
-
-    IMPORTANT: Queue type must match execution model:
-    - Threading: use queue.Queue
-    - Multiprocessing: use multiprocessing.Queue
-
-    Args:
-        video_uri: Video source URI
-        use_multiprocessing: If True, use processes (requires multiprocessing.Queue)
-    """
-    if use_multiprocessing:
-        from multiprocessing import Queue as MPQueue, Process
-        queue = MPQueue(maxsize=10)  # MUST use multiprocessing.Queue!
-
-        # Run pipelines in separate processes
-        proc1 = Process(target=source_pipeline, args=(video_uri, queue))
-        proc2 = Process(target=destination_pipeline, args=(queue,))
-
-        proc1.start()
-        proc2.start()
-
-        proc2.join()
-        proc1.join()
-    else:
-        # Threading approach - queue.Queue works fine here
-        queue = Queue(maxsize=10)
-
-        # Run both pipelines in threads (same process, shared memory)
-        thread1 = threading.Thread(target=source_pipeline, args=(video_uri, queue))
-        thread2 = threading.Thread(target=destination_pipeline, args=(queue,))
-
-        thread1.start()
-        thread2.start()
-
-        thread2.join()
-        thread1.join()
-```
-
-### Pattern 3: Selective Frame Capture
-
-Capture frames based on inference results (e.g., when objects are detected).
-
-```python
-from pyservicemaker import Pipeline, BufferRetriever, Receiver, BatchMetadataOperator, Probe
-import torch  # pip install torch torchvision (not in base DS container)
-import cv2  # pip install opencv-python-headless (not in base DS container)
-import numpy as np
-
-class SelectiveFrameCapture(BufferRetriever):
-    def __init__(self, output_dir="./captured", min_objects=1):
-        super().__init__()
-        self.output_dir = output_dir
-        self.min_objects = min_objects
-        self.frame_count = 0
-        self.saved_count = 0
-        self.capture_next = False
-
-        import os
-        os.makedirs(output_dir, exist_ok=True)
-
-    def set_capture_flag(self, should_capture):
-        """Called by metadata probe to signal capture"""
-        self.capture_next = should_capture
-
-    def consume(self, buffer):
-        tensor = buffer.extract(0).clone()
-
-        if self.capture_next:
-            # Save this frame
-            torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-            frame_np = torch_tensor.cpu().numpy()
-            frame_bgr = cv2.cvtColor(frame_np, cv2.COLOR_RGB2BGR)
-
-            filename = f"{self.output_dir}/capture_{self.saved_count:06d}.jpg"
-            cv2.imwrite(filename, frame_bgr)
-            print(f"Captured frame {self.frame_count} with objects -> {filename}")
-
-            self.saved_count += 1
-            self.capture_next = False
-
-        self.frame_count += 1
-        return 1
-
-class ObjectDetectionTrigger(BatchMetadataOperator):
-    def __init__(self, frame_capture, min_objects=1):
-        super().__init__()
-        self.frame_capture = frame_capture
-        self.min_objects = min_objects
-
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # Note: object_items is an ITERATOR - cannot use len() directly
-            # Count by iterating
-            obj_count = sum(1 for _ in frame_meta.object_items)
-
-            if obj_count >= self.min_objects:
-                # Signal frame capture to save this frame
-                self.frame_capture.set_capture_flag(True)
-                print(f"Detected {obj_count} objects, triggering capture")
-
-def selective_capture(video_uri, config_path, output_dir):
-    pipeline = Pipeline("selective-capture")
-
-    # Source and muxer
-    pipeline.add("nvurisrcbin", "src", {"uri": video_uri})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Inference
-    pipeline.add("nvinfer", "infer", {"config-file-path": config_path})
-
-    # Convert for extraction
-    pipeline.add("nvvideoconvert", "converter")
-    pipeline.add("capsfilter", "caps", {"caps": "video/x-raw(memory:NVMM), format=RGB"})
-
-    # Sink
-    pipeline.add("appsink", "sink", {"emit-signals": True, "sync": False})
-
-    # Link
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "infer", "converter", "caps", "sink")
-
-    # Attach frame capture
-    frame_capture = SelectiveFrameCapture(output_dir, min_objects=2)
-    pipeline.attach("sink", Receiver("receiver", frame_capture), tips="new-sample")
-
-    # Attach metadata processor to trigger capture
-    trigger = ObjectDetectionTrigger(frame_capture, min_objects=2)
-    pipeline.attach("infer", Probe("trigger", trigger))
-
-    pipeline.start().wait()
-```
-
-### Pattern 4: Flow API with Frame Retrieval
-
-High-level Flow API for frame extraction.
-
-```python
-from pyservicemaker import Pipeline, Flow, BufferRetriever
-import torch  # pip install torch torchvision (not in base DS container)
-import cv2  # pip install opencv-python-headless (not in base DS container)
-import numpy as np
-
-class SimpleFrameRetriever(BufferRetriever):
-    def __init__(self, save_path="output.jpg"):
-        super().__init__()
-        self.save_path = save_path
-        self.count = 0
-
-    def consume(self, buffer):
-        if self.count == 0:  # Save first frame only
-            tensor = buffer.extract(0).clone()
-            torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-            frame_np = torch_tensor.cpu().numpy()
-            frame_bgr = cv2.cvtColor(frame_np, cv2.COLOR_RGB2BGR)
-            cv2.imwrite(self.save_path, frame_bgr)
-            print(f"Saved frame to {self.save_path}")
-
-        self.count += 1
-        return 1
-
-def flow_api_retrieval(video_uri):
-    pipeline = Pipeline("flow-retrieval")
-    retriever = SimpleFrameRetriever("output_frame.jpg")
-
-    # Flow API: batch_capture() -> retrieve()
-    flow = Flow(pipeline)
-    flow.batch_capture([video_uri])
-    flow.retrieve(retriever)
-    flow()
-```
-
-### Pattern 5: Frame Analysis and Logging
-
-Extract frames with metadata for analysis.
-
-```python
-from pyservicemaker import Pipeline, BufferRetriever, Receiver, BatchMetadataOperator, Probe
-import torch  # pip install torch torchvision (not in base DS container)
-import json
-from datetime import datetime
-
-class FrameAnalyzer(BufferRetriever):
-    def __init__(self, log_file="frame_analysis.json"):
-        super().__init__()
-        self.log_file = log_file
-        self.frame_count = 0
-        self.metadata_cache = {}
-
-    def set_metadata(self, frame_num, metadata):
-        """Called by metadata probe"""
-        self.metadata_cache[frame_num] = metadata
-
-    def consume(self, buffer):
-        tensor = buffer.extract(0).clone()
-        torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-
-        # Calculate frame statistics
-        mean_intensity = torch_tensor.float().mean().item()
-        std_intensity = torch_tensor.float().std().item()
-
-        # Get metadata if available
-        metadata = self.metadata_cache.get(self.frame_count, {})
-
-        # Log analysis
-        analysis = {
-            "frame_number": self.frame_count,
-            "timestamp": datetime.now().isoformat(),
-            "mean_intensity": mean_intensity,
-            "std_intensity": std_intensity,
-            "shape": list(torch_tensor.shape),
-            "objects_detected": metadata.get("object_count", 0),
-            "object_classes": metadata.get("classes", [])
-        }
-
-        with open(self.log_file, "a") as f:
-            f.write(json.dumps(analysis) + "\n")
-
-        # Clear cached metadata
-        if self.frame_count in self.metadata_cache:
-            del self.metadata_cache[self.frame_count]
-
-        self.frame_count += 1
-        return 1
-
-class MetadataExtractor(BatchMetadataOperator):
-    def __init__(self, frame_analyzer):
-        super().__init__()
-        self.frame_analyzer = frame_analyzer
-
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # Note: object_items is an ITERATOR - convert to list if you need
-            # to access it multiple times or use len()
-            objects = list(frame_meta.object_items)
-            metadata = {
-                "object_count": len(objects),
-                "classes": [obj.class_id for obj in objects],
-                "confidences": [obj.confidence for obj in objects]
-            }
-            self.frame_analyzer.set_metadata(frame_meta.frame_number, metadata)
-
-def analyze_frames(video_uri, config_path):
-    pipeline = Pipeline("frame-analyzer")
-
-    # Source
-    pipeline.add("nvurisrcbin", "src", {"uri": video_uri})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Inference
-    pipeline.add("nvinfer", "infer", {"config-file-path": config_path})
-
-    # Convert and extract
-    pipeline.add("nvvideoconvert", "converter")
-    pipeline.add("capsfilter", "caps", {"caps": "video/x-raw(memory:NVMM), format=RGB"})
-    pipeline.add("appsink", "sink", {"emit-signals": True, "sync": False})
-
-    # Link
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "infer", "converter", "caps", "sink")
-
-    # Attach analyzer
-    analyzer = FrameAnalyzer("analysis_log.json")
-    pipeline.attach("sink", Receiver("receiver", analyzer), tips="new-sample")
-
-    # Attach metadata extractor
-    extractor = MetadataExtractor(analyzer)
-    pipeline.attach("infer", Probe("extractor", extractor))
-
-    pipeline.start().wait()
-```
-
-### Pattern 6: Real-time Frame Streaming
-
-Stream frames to external system (e.g., web server, cloud service).
-
-```python
-from pyservicemaker import Pipeline, BufferRetriever, Receiver
-import torch  # pip install torch torchvision (not in base DS container)
-import cv2  # pip install opencv-python-headless (not in base DS container)
-import numpy as np
-import base64
-import requests
-
-class FrameStreamer(BufferRetriever):
-    def __init__(self, endpoint_url, stream_interval=1):
-        super().__init__()
-        self.endpoint_url = endpoint_url
-        self.stream_interval = stream_interval
-        self.frame_count = 0
-
-    def consume(self, buffer):
-        # Stream every Nth frame
-        if self.frame_count % self.stream_interval == 0:
-            tensor = buffer.extract(0).clone()
-            torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-            frame_np = torch_tensor.cpu().numpy()
-
-            # Encode as JPEG
-            frame_bgr = cv2.cvtColor(frame_np, cv2.COLOR_RGB2BGR)
-            _, jpeg_buffer = cv2.imencode('.jpg', frame_bgr, [cv2.IMWRITE_JPEG_QUALITY, 85])
-
-            # Encode as base64
-            jpeg_base64 = base64.b64encode(jpeg_buffer).decode('utf-8')
-
-            # Send to endpoint
-            try:
-                response = requests.post(
-                    self.endpoint_url,
-                    json={
-                        "frame_number": self.frame_count,
-                        "image": jpeg_base64
-                    },
-                    timeout=1
-                )
-                if response.status_code == 200:
-                    print(f"Streamed frame {self.frame_count}")
-            except Exception as e:
-                print(f"Failed to stream frame {self.frame_count}: {e}")
-
-        self.frame_count += 1
-        return 1
-
-def stream_frames(video_uri, endpoint_url):
-    pipeline = Pipeline("frame-streamer")
-
-    pipeline.add("nvurisrcbin", "src", {"uri": video_uri})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1280, "height": 720})
-    pipeline.add("nvvideoconvert", "converter")
-    pipeline.add("capsfilter", "caps", {"caps": "video/x-raw(memory:NVMM), format=RGB"})
-    pipeline.add("appsink", "sink", {"emit-signals": True, "sync": False})
-
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "converter", "caps", "sink")
-
-    streamer = FrameStreamer(endpoint_url, stream_interval=10)
-    pipeline.attach("sink", Receiver("receiver", streamer), tips="new-sample")
-
-    pipeline.start().wait()
-```
-
-## Part 2 Best Practices
-
-### 1. Always Clone Buffers
-```python
-def consume(self, buffer):
-    # ALWAYS clone to prevent data corruption
-    tensor = buffer.extract(0).clone()
-    # Now safe to use asynchronously
-```
-
-### 2. Signal Configuration
-```python
-# Always use "new-sample" signal for appsink
-pipeline.attach("sink", Receiver("receiver", retriever), tips="new-sample")
-
-# Enable signal emission on appsink
-pipeline.add("appsink", "sink", {"emit-signals": True})
-```
-
-### 3. Synchronization Control
-```python
-# For frame extraction, usually disable sync
-pipeline.add("appsink", "sink", {
-    "emit-signals": True,
-    "sync": False  # Don't block on frame rate
-})
-
-# For real-time processing, enable sync
-pipeline.add("appsink", "sink", {
-    "emit-signals": True,
-    "sync": True  # Maintain real-time pacing
-})
-```
-
-### 4. Return Value Handling
-```python
-def consume(self, buffer):
-    try:
-        # Process buffer
-        tensor = buffer.extract(0).clone()
-        # ... processing ...
-        return 1  # Success, continue processing
-    except Exception as e:
-        print(f"Error: {e}")
-        return 0  # Error, but continue
-        # return -1  # Fatal error, stop pipeline
-```
-
-### 5. Memory Management
-```python
-class EfficientRetriever(BufferRetriever):
-    def __init__(self):
-        super().__init__()
-        self.frame_buffer = []
-        self.max_buffer_size = 100
-
-    def consume(self, buffer):
-        tensor = buffer.extract(0).clone()
-
-        # Limit buffer size to prevent memory issues
-        if len(self.frame_buffer) >= self.max_buffer_size:
-            self.frame_buffer.pop(0)  # Remove oldest
-
-        self.frame_buffer.append(tensor)
-        return 1
-```
-
-### 6. Thread Safety
-```python
-import threading
-
-class ThreadSafeRetriever(BufferRetriever):
-    def __init__(self):
-        super().__init__()
-        self.lock = threading.Lock()
-        self.frame_count = 0
-
-    def consume(self, buffer):
-        with self.lock:
-            tensor = buffer.extract(0).clone()
-            # Safe concurrent access
-            self.frame_count += 1
-        return 1
-```
-
-## Advanced Usage
-
-### Multi-Batch Frame Extraction
-
-Extract frames from multi-stream batches.
-
-```python
-class MultiBatchRetriever(BufferRetriever):
-    def __init__(self, num_streams):
-        super().__init__()
-        self.num_streams = num_streams
-        self.frame_counts = [0] * num_streams
-
-    def consume(self, buffer):
-        # Extract all streams in batch
-        for stream_idx in range(self.num_streams):
-            try:
-                tensor = buffer.extract(stream_idx).clone()
-                torch_tensor = torch.utils.dlpack.from_dlpack(tensor)
-
-                # Process each stream
-                print(f"Stream {stream_idx}, Frame {self.frame_counts[stream_idx]}")
-
-                self.frame_counts[stream_idx] += 1
-            except Exception as e:
-                print(f"Error extracting stream {stream_idx}: {e}")
-
-        return 1
-
-def multi_stream_extraction(video_uris):
-    pipeline = Pipeline("multi-stream-extract")
-
-    # Add sources
-    for i, uri in enumerate(video_uris):
-        pipeline.add("nvurisrcbin", f"src{i}", {"uri": uri})
-
-    # Muxer for batching
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": len(video_uris),
-        "width": 1280,
-        "height": 720
-    })
-
-    # Convert and extract
-    pipeline.add("nvvideoconvert", "converter")
-    pipeline.add("capsfilter", "caps", {"caps": "video/x-raw(memory:NVMM), format=RGB"})
-    pipeline.add("appsink", "sink", {"emit-signals": True, "sync": False})
-
-    # Link sources to muxer
-    for i in range(len(video_uris)):
-        pipeline.link((f"src{i}", "mux"), ("", "sink_%u"))
-
-    pipeline.link("mux", "converter", "caps", "sink")
-
-    # Attach multi-batch retriever
-    retriever = MultiBatchRetriever(len(video_uris))
-    pipeline.attach("sink", Receiver("receiver", retriever), tips="new-sample")
-
-    pipeline.start().wait()
-```
-
-## Part 2 Common Use Cases
-
-### 1. Frame Archival
-Extract and save frames at regular intervals for archival purposes.
-
-### 2. Thumbnail Generation
-Extract keyframes to generate video thumbnails.
-
-### 3. Object Detection Screenshots
-Capture frames when specific objects are detected.
-
-### 4. Video Quality Analysis
-Extract frames for quality metrics computation.
-
-### 5. Pipeline Debugging
-Extract frames at various pipeline stages for debugging.
-
-### 6. Data Collection
-Collect frames and metadata for training dataset creation.
-
-## Part 2 Troubleshooting
-
-### Issue 1: No Frames Received
-**Solution**: Ensure `emit-signals=True` is set on appsink, verify `tips="new-sample"` is set
-
-### Issue 2: Data Corruption
-**Solution**: Always call `.clone()` on extracted tensors before async processing
-
-### Issue 3: Memory Leaks
-**Solution**: Limit buffer accumulation, properly release tensors
-
-### Issue 4: Performance Issues
-**Solution**: Set `sync=False` on appsink, process frames asynchronously
-
-### Issue 5: Missing Frames
-**Solution**: Check return value (return 1 for success), ensure processing is fast enough
-
-### Issue 6: Frames/Batches Not Reaching Downstream Processing (Queue Empty)
-**Symptoms**:
-- Pipeline runs without errors
-- BufferRetriever.consume() is being called
-- But downstream processing (VLM, Kafka, etc.) never receives data
-- Queue appears to be empty in consumer thread/process
-
-**Root Cause**: Using `queue.Queue` with `multiprocessing.Process`
-
-**Solution**:
-1. If using multiprocessing: Switch to `multiprocessing.Queue`
-2. If process isolation not required: Use `threading.Thread` with `queue.Queue`
-3. Set `use_multiprocessing=False` in your configuration
-
-```python
-# WRONG: queue.Queue with multiprocessing
-from multiprocessing import Process
-from queue import Queue  # Won't work across processes!
-
-# CORRECT Option 1: Use multiprocessing.Queue
-from multiprocessing import Process, Queue
-
-# CORRECT Option 2: Use threading instead
-import threading
-from queue import Queue
-
-# See the Best Practices reference for Anti-Pattern 4 details
-```
-
-## Part 2 Summary
-
-The Frame Selector API (BufferRetriever/Receiver) provides powerful capabilities for extracting frames and data from DeepStream pipelines. Key points:
-
-1. Implement `BufferRetriever.consume()` to process extracted buffers
-2. Use `Receiver` to attach retriever to `appsink` elements
-3. Always call `buffer.extract(0).clone()` to safely extract tensors
-4. Return `1` for success, `0` for error (continue), `-1` for fatal error
-5. Set `emit-signals=True` on appsink and use `tips="new-sample"`
-6. Consider `sync=False` for non-real-time extraction
-
-This API enables seamless extraction of frames, inference results, and metadata from DeepStream pipelines for custom processing, archival, or transfer to other systems.
diff --git a/skills/deepstream/deepstream-dev/references/docker_containers.md b/skills/deepstream/deepstream-dev/references/docker_containers.md
deleted file mode 100644
index f5bf6245..00000000
--- a/skills/deepstream/deepstream-dev/references/docker_containers.md
+++ /dev/null
@@ -1,273 +0,0 @@
-# DeepStream Docker Containers Reference
-
-## Overview
-
-DeepStream Docker images are hosted on the NVIDIA NGC container registry (`nvcr.io`). They package all SDK dependencies (GStreamer, TensorRT, CUDA, models, sample streams) and require the NVIDIA Container Toolkit (`nvidia-container-toolkit`) for GPU access.
-
-- **NGC catalog page**: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/deepstream
-- **Official docs**: https://docs.nvidia.com/metropolis/deepstream/dev-guide/text/DS_docker_containers.html
-
----
-
-## Available Containers (DeepStream 9.0)
-
-### dGPU (x86_64)
-
-| Container | Pull Command | Description |
-|-----------|-------------|-------------|
-| **Samples** | `docker pull nvcr.io/nvidia/deepstream:9.0-samples-multiarch` | Runtime libraries, GStreamer plugins, reference apps, sample streams, models, configs. Best for running demos and deploying applications. |
-| **Triton** | `docker pull nvcr.io/nvidia/deepstream:9.0-triton-multiarch` | Everything in samples + Triton Inference Server and dependencies + development environment. Use when Triton-based inference is needed or building custom DeepStream applications. |
-
-### Jetson (ARM64/aarch64)
-
-| Container | Pull Command | Description |
-|-----------|-------------|-------------|
-| **Samples** | `docker pull nvcr.io/nvidia/deepstream:9.0-samples-multiarch` | Runtime libraries, GStreamer plugins, reference apps, sample streams, models, configs. **Deployment only** — does not support development inside the container. |
-| **Triton** | `docker pull nvcr.io/nvidia/deepstream:9.0-triton-multiarch` | Samples contents + devel libraries + Triton Inference Server backends. |
-
-### dGPU on ARM (GH200, GB200, SBSA)
-
-| Container | Pull Command | Description |
-|-----------|-------------|-------------|
-| **Triton ARM SBSA** | `docker pull nvcr.io/nvidia/deepstream:9.0-triton-arm-sbsa` | Triton Inference Server + development environment for ARM SBSA platforms. |
-
----
-
-## Choosing the Right Image
-
-| Use Case | Recommended Image |
-|----------|-------------------|
-| Running sample apps / demos | `9.0-samples-multiarch` |
-| pyservicemaker Python applications | `9.0-triton-multiarch` |
-| Triton Inference Server required | `9.0-triton-multiarch` |
-| Custom Dockerfile base image | `9.0-samples-multiarch` (minimal) or `9.0-triton-multiarch` (with Triton) |
-
----
-
-## NGC Authentication
-
-Pulling images requires NGC authentication:
-
-```bash
-# 1. Get an API key from https://ngc.nvidia.com
-# 2. Log in to the NGC registry
-docker login nvcr.io
-# Username: $oauthtoken
-# Password: <YOUR_NGC_API_KEY>
-```
-
----
-
-## Installing pyservicemaker Inside the Container
-
-The `pyservicemaker` Python wheel is **bundled** in the container but **NOT pre-installed**. You must install it explicitly:
-
-```bash
-pip install /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl \
-    pyyaml
-```
-
-In a Dockerfile:
-
-```dockerfile
-RUN pip install --break-system-packages \
-    /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl \
-    pyyaml
-```
-
-> **Note**: The `--break-system-packages` flag is needed on Ubuntu 24.04 (Python 3.12) to install into the system Python environment. Alternatively, use a virtual environment.
-
----
-
-## Running Containers
-
-### Prerequisites
-
-1. **Docker**: Install `docker-ce` via [official instructions](https://docs.docker.com/engine/install)
-2. **NVIDIA Container Toolkit**: Install via [install guide](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html)
-3. **NVIDIA Driver**: 590+ for dGPU
-
-### Basic Run (with display)
-
-```bash
-export DISPLAY=:0
-xhost +
-
-docker run -it --rm \
-    --network=host \
-    --gpus all \
-    -e DISPLAY=$DISPLAY \
-    -v /tmp/.X11-unix/:/tmp/.X11-unix \
-    nvcr.io/nvidia/deepstream:9.0-triton-multiarch
-```
-
-### Headless Run (no display)
-
-```bash
-docker run -it --rm \
-    --gpus all \
-    nvcr.io/nvidia/deepstream:9.0-triton-multiarch
-```
-
-> For headless mode, use `fakesink` instead of `nveglglessink`/`nv3dsink` in your pipeline, or output to a file with `filesink`.
-
-### Run with Custom Video File
-
-```bash
-docker run -it --rm \
-    --gpus all \
-    -e DISPLAY=$DISPLAY \
-    -v /tmp/.X11-unix/:/tmp/.X11-unix \
-    -v /path/to/videos:/data \
-    nvcr.io/nvidia/deepstream:9.0-triton-multiarch
-```
-
----
-
-## Building Custom Docker Images
-
-Use a DeepStream image as the base for your application:
-
-```dockerfile
-FROM nvcr.io/nvidia/deepstream:9.0-triton-multiarch
-
-# Install pyservicemaker
-RUN pip install --break-system-packages \
-    /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl \
-    pyyaml
-
-# Copy application files
-WORKDIR /app
-COPY my_app.py .
-COPY my_config.yml .
-
-# Enable video driver libraries at runtime (encode/decode)
-ENV NVIDIA_DRIVER_CAPABILITIES=${NVIDIA_DRIVER_CAPABILITIES},video
-
-ENTRYPOINT ["python3", "my_app.py"]
-```
-
-### Build and Run
-
-```bash
-# Build
-docker build -t my-ds-app .
-
-# Run with display
-docker run --rm --gpus all \
-    -e DISPLAY=$DISPLAY \
-    -v /tmp/.X11-unix:/tmp/.X11-unix \
-    my-ds-app
-
-# Run with RTSP source (no display needed)
-docker run --rm --gpus all \
-    my-ds-app rtsp://camera-ip/stream
-```
-
----
-
-## Additional Packages
-
-DeepStream 9.0 containers do **not** include certain multimedia libraries by default. Install them if needed:
-
-### Audio/Codec Support
-
-```bash
-# Run the bundled install script for common multimedia packages
-/opt/nvidia/deepstream/deepstream/user_additional_install.sh
-
-# Or install specific packages manually
-apt-get install -y gstreamer1.0-libav gstreamer1.0-plugins-good \
-    gstreamer1.0-plugins-bad gstreamer1.0-plugins-ugly
-```
-
-### ffmpeg (for sample video preparation scripts)
-
-```bash
-apt-get install --reinstall libflac8 libmp3lame0 libxvidcore4 ffmpeg
-```
-
-### Kafka Support (librdkafka)
-
-```bash
-apt-get install -y librdkafka-dev
-```
-
-### Tracker Support (libmosquitto)
-
-```bash
-apt-get install -y libmosquitto1
-```
-
----
-
-## Important Paths Inside the Container
-
-| Path | Contents |
-|------|----------|
-| `/opt/nvidia/deepstream/deepstream/` | DeepStream SDK root |
-| `/opt/nvidia/deepstream/deepstream/samples/models/` | Sample models (Primary_Detector, Secondary_*, etc.) |
-| `/opt/nvidia/deepstream/deepstream/samples/streams/` | Sample video streams (e.g., `sample_1080p_h264.mp4`) |
-| `/opt/nvidia/deepstream/deepstream/samples/configs/` | Sample configuration files |
-| `/opt/nvidia/deepstream/deepstream/lib/` | DeepStream libraries (GStreamer plugins, protocol adapters) |
-| `/opt/nvidia/deepstream/deepstream/lib/gst-plugins/` | GStreamer plugin `.so` files |
-| `/opt/nvidia/deepstream/deepstream/service-maker/python/` | pyservicemaker wheel file |
-
----
-
-## Environment Variables
-
-| Variable | Purpose | Example |
-|----------|---------|---------|
-| `GST_PLUGIN_PATH` | GStreamer plugin search path | `/opt/nvidia/deepstream/deepstream/lib/gst-plugins` |
-| `LD_LIBRARY_PATH` | Shared library search path | `/opt/nvidia/deepstream/deepstream/lib:$LD_LIBRARY_PATH` |
-| `GST_DEBUG` | GStreamer debug log level | `3` (INFO) or `nvinfer:5` (plugin-specific) |
-| `NVIDIA_DRIVER_CAPABILITIES` | GPU capabilities exposed | `${NVIDIA_DRIVER_CAPABILITIES},video` |
-| `DISPLAY` | X11 display for rendering sinks | `:0` |
-
----
-
-## Common Docker Issues
-
-### `ModuleNotFoundError: No module named 'pyservicemaker'`
-
-**Cause**: The wheel is bundled but not installed.
-
-**Fix**: Add to Dockerfile:
-```dockerfile
-RUN pip install --break-system-packages \
-    /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl \
-    pyyaml
-```
-
-### Display sinks fail with `Could not open display`
-
-**Cause**: X11 forwarding not configured.
-
-**Fix**: Pass display environment and socket:
-```bash
-docker run --rm --gpus all \
-    -e DISPLAY=$DISPLAY \
-    -v /tmp/.X11-unix:/tmp/.X11-unix \
-    my-ds-app
-```
-
-Or use `fakesink` / `filesink` for headless operation.
-
-### `Failed to load plugin ... libnvds_kafka_proto.so`
-
-**Cause**: `librdkafka` not installed (not bundled in the container).
-
-**Fix**: Add to Dockerfile:
-```dockerfile
-RUN apt-get update && apt-get install -y librdkafka-dev && rm -rf /var/lib/apt/lists/*
-```
-
-### Warning about audio decoder not available
-
-**Cause**: Multimedia codec packages removed in DS 9.0 containers.
-
-**Fix**:
-```dockerfile
-RUN /opt/nvidia/deepstream/deepstream/user_additional_install.sh
-```
diff --git a/skills/deepstream/deepstream-dev/references/gstreamer_plugins.md b/skills/deepstream/deepstream-dev/references/gstreamer_plugins.md
deleted file mode 100644
index e3c7982f..00000000
--- a/skills/deepstream/deepstream-dev/references/gstreamer_plugins.md
+++ /dev/null
@@ -1,984 +0,0 @@
-# DeepStream GStreamer Plugins Overview
-
-## Introduction
-
-DeepStream provides a comprehensive set of custom GStreamer plugins optimized for NVIDIA GPUs. These plugins handle video decoding, inference, tracking, visualization, and various other video analytics tasks. Understanding these plugins is crucial for building effective DeepStream applications.
-
-## Plugin Categories
-
-### Source Plugins
-Plugins that generate or capture video data from various sources.
-
-### Processing Plugins
-Plugins that transform, analyze, or process video data.
-
-### Sink Plugins
-Plugins that output video to displays, files, or network destinations.
-
----
-
-## Source Plugins
-
-### nvv4l2decoder
-**Purpose**: Hardware-accelerated video decoder using NVIDIA V4L2 API (from nvvideo4linux2 plugin)
-
-**Key Properties**:
-- `capture-io-mode`: Capture I/O mode for the sink pad (`auto`, `mmap`, `dmabuf-import`)
-- `output-io-mode`: Output I/O mode for the src pad (`auto`, `mmap`, `dmabuf-import`)
-- `cudadec-memtype`: CUDA buffer memory type (`memtype_device`, `memtype_pinned`, `memtype_unified`)
-- `gpu-id`: GPU device ID used for decoding
-- `drop-frame-interval`: Interval for dropping frames (0 keeps all frames)
-- `num-extra-surfaces`: Additional decode surfaces to allocate
-- `disable-dpb`: Disable DPB buffers to reduce latency
-- `low-latency-mode`: Enable low-latency decoding for I/IPPP streams
-- `skip-frames`: Frame skipping policy (`decode_all`, `decode_non_ref`, `decode_key`)
-- `device`: Decoder device path (read-only, default `/dev/nvidia0`)
-
-**Usage**:
-```bash
-nvv4l2decoder output-io-mode=0 drop-frame-interval=0
-```
-
-**Common Pipeline Pattern**:
-```
-h264parse ! nvv4l2decoder ! ...
-```
-
-**Output Format**:
-- Outputs `video/x-raw(memory:NVMM)` - GPU memory format
-- This is already in NVMM format, so NO nvvideoconvert is needed before nvstreammux
-
-**Notes**:
-- Essential for GPU-accelerated pipelines
-- Supports H.264, H.265, VP8, VP9 codecs with zero-copy memory transfers
-- Output is already in NVMM memory, compatible with nvstreammux and other DeepStream plugins
-
----
-
-### nvurisrcbin
-**Purpose**: Source bin for handling URI-based sources (files, RTSP, HTTP)
-
-**Key Properties**:
-- `uri`: Source URI (file://, rtsp://, http://, etc.)
-- `num-buffers`: Number of buffers to process
-- `drop-on-latency`: Drop frames on latency
-
-**Usage**:
-```bash
-nvurisrcbin uri=file:///path/to/video.mp4
-```
-
-**Common Pipeline Pattern**:
-```
-nvurisrcbin uri=rtsp://camera-ip/stream ! ...
-```
-
-**Notes**:
-- Automatically handles demuxing and parsing for multiple protocols and formats
-
----
-
-### nvmultiurisrcbin
-**Purpose**: Source bin with built-in REST API server for dynamic multi-stream management
-
-**Key Properties**:
-| Property | Type | Description |
-|----------|------|-------------|
-| `uri-list` | string | Comma-separated list of initial URIs |
-| `sensor-id-list` | string | Comma-separated sensor IDs (maps 1:1 with uri-list) |
-| `sensor-name-list` | string | Comma-separated sensor names |
-| `ip-address` | string | REST API server IP (default: localhost) |
-| `port` | int | REST API server port (default: 9000, 0 to disable) |
-| `max-batch-size` | int | Maximum number of sources |
-| `batched-push-timeout` | int | Timeout in microseconds to push batch |
-| `live-source` | int | Set to 1 for live/dynamic sources (REQUIRED) |
-| `drop-pipeline-eos` | int | Set to 1 to keep pipeline alive when sources removed |
-| `async-handling` | int | Set to 1 for async state changes |
-| `select-rtp-protocol` | int | 0=UDP+TCP auto, 4=TCP only |
-| `latency` | int | Jitterbuffer size in ms for RTSP |
-
-**Built-in REST API Endpoints**:
-- `POST /api/v1/stream/add` - Add a stream dynamically
-- `POST /api/v1/stream/remove` - Remove a stream
-- `GET /api/v1/stream/get-stream-info` - Get current streams
-
-**Usage**:
-```python
-# Pipeline with built-in REST server on port 9000
-pipeline.add("nvmultiurisrcbin", "src", {
-    "port": 9000,
-    "max-batch-size": 16,
-    "live-source": 1,
-    "drop-pipeline-eos": 1,
-    "async-handling": 1,
-})
-# REST API automatically available at http://localhost:9000/api/v1/
-```
-
-**⚠️ CRITICAL for Dynamic Sources**:
-When using dynamic source addition, the sink element MUST have `async=0`:
-```python
-pipeline.add("nveglglessink", "sink", {
-    "sync": 0,
-    "qos": 0,
-    "async": 0  # CRITICAL - prevents state transition deadlock
-})
-```
-
-**Notes**:
-- Integrates nvds_rest_server, nvurisrcbin, and nvstreammux in one bin
-- Do NOT implement custom Flask/FastAPI server - use built-in REST API
-- See `rest_api_dynamic.md` for complete REST API documentation
-
----
-
-### nvdsdynamicsrcbin
-**Purpose**: Source bin for programmatically adding and removing file/URI-based video sources at runtime. Unlike `nvmultiurisrcbin` (REST API / config-driven), `nvdsdynamicsrcbin` is controlled entirely through code using `SourceManager`.
-
-**CRITICAL**: `nvdsdynamicsrcbin` does **not** manage sources on its own. You **must** use `SourceManager` from `pyservicemaker._pydeepstream.signal` to add, remove, and terminate sources. Without `SourceManager`, the bin has no way to receive source URIs.
-
-**Key Properties**:
-| Property | Type | Default | Description |
-|----------|------|---------|-------------|
-| `gpu-id` | uint | 0 | GPU Device ID to use for decoding |
-| `message-forward` | bool | False | Forward all children messages to the pipeline bus (required for EOS detection) |
-| `async-handling` | bool | False | Handle asynchronous state changes internally |
-| `current-file` | string (read-only) | null | Currently processing file path |
-| `current-id` | int (read-only) | -1 | ID of the chunk currently being processed |
-
-**Element Actions** (triggered via `SourceManager`):
-| Action | Description |
-|--------|-------------|
-| `add-source` | Add a new file/URI source to the bin |
-| `remove-source` | Remove a source by its unique ID |
-| `terminate` | Signal no more sources will be added; sends EOS after all finish |
-
-**Internal Children**: Contains `parsebin`, `queue_parsebin`, and `decoder` — it automatically parses and decodes the added sources.
-
----
-
-### v4l2src
-**Purpose**: Video4Linux2 source for USB cameras
-
-**Key Properties**:
-- `device`: Device path (e.g., `/dev/video0`)
-- `io-mode`: I/O mode
-- `do-timestamp`: Enable timestamping
-
-**Usage**:
-```bash
-v4l2src device=/dev/video0 ! ...
-```
-
-**Notes**:
-- Standard GStreamer plugin for USB webcams, may require format conversion
-
----
-
-### nvarguscamerasrc
-**Purpose**: NVIDIA camera source for Jetson CSI cameras
-
-**Key Properties**:
-- `sensor-id`: Sensor ID (0, 1, etc.)
-- `sensor-mode`: Sensor mode
-- `wbmode`: White balance mode
-- `exposuretimerange`: Exposure time range
-- `gainrange`: Gain range
-
-**Usage**:
-```bash
-nvarguscamerasrc sensor-id=0 ! ...
-```
-
-**Notes**:
-- Jetson-specific plugin optimized for CSI cameras with hardware-accelerated capture
-
----
-
-## Processing Plugins
-
-### nvstreammux
-**Purpose**: Batches multiple video streams into a single batch for efficient inference
-
-**IMPORTANT**: There are TWO versions of nvstreammux:
-- **OLD nvstreammux**: Default, uses GObject properties for configuration
-- **NEW nvstreammux**: Enabled with `USE_NEW_NVSTREAMMUX=yes`, uses config file for advanced settings
-
-**Key Properties (NEW nvstreammux - RECOMMENDED)**:
-- `batch-size`: Maximum number of buffers in a batch
-- `batched-push-timeout`: Timeout for batching in microseconds (default: 33000)
-- `config-file-path`: Path to configuration file for advanced settings
-- `num-surfaces-per-frame`: Number of surfaces per frame
-- `attach-sys-ts`: Attach system timestamp as NTP timestamp (boolean)
-- `max-latency`: Maximum latency in live mode (nanoseconds)
-- `sync-inputs`: Force synchronization of input frames (boolean)
-- `frame-num-reset-on-eos`: Reset frame numbers on EOS (boolean)
-- `frame-num-reset-on-stream-reset`: Reset frame numbers on stream reset (boolean)
-- `frame-duration`: Duration of input frames in milliseconds for NTP correction
-- `drop-pipeline-eos`: Don't propagate EOS downstream when all pads are at EOS (boolean)
-
-**Key Properties (OLD nvstreammux - Legacy)**:
-- `batch-size`: Number of streams to batch
-- `width`: Output batch width
-- `height`: Output batch height
-- `gpu-id`: GPU ID for processing
-- `batched-push-timeout`: Timeout for batching (microseconds)
-- `enable-padding`: Enable padding for different resolutions
-- `nvbuf-memory-type`: Memory type (0=default, 1=NVMM, 2=unified)
-
-**Usage**:
-```bash
-nvstreammux name=m batch-size=4 width=1920 height=1080
-```
-
-**Common Pipeline Pattern**:
-```
-source1 ! m.sink_0 source2 ! m.sink_1 nvstreammux name=m batch-size=2 ! ...
-```
-
-**Notes**:
-- **Critical plugin** for multi-stream applications
-- **NEW nvstreammux** (recommended): More flexible, uses config file for width/height/memory-type settings
-- **OLD nvstreammux**: Uses GObject properties for width/height, may be deprecated in future
-- To use NEW version: Set environment variable `USE_NEW_NVSTREAMMUX=yes` before running pipeline
-- Batch size should match number of input streams
-- NEW version infers output resolution from downstream elements or uses config file
-
----
-
-### nvstreamdemux
-**Purpose**: Demultiplexes batched streams back to individual streams
-
-**Key Properties**:
-- `name`: Element name (required for pad access)
-
-**Usage**:
-```bash
-nvstreamdemux name=d
-```
-
-**Common Pipeline Pattern**:
-```
-nvstreammux name=m ! ... ! nvstreamdemux name=d d.src_0 ! ... d.src_1 ! ...
-```
-
-**Notes**:
-- Used after processing batched streams
-- Provides separate source pads for each stream
-- Essential for per-stream rendering or processing
-
----
-
-### nvinfer
-**Purpose**: TensorRT-based inference engine for deep learning models
-
-**Key Properties**:
-- `config-file-path`: Path to inference configuration file (supports **both** INI-style text format and YAML format)
-- `batch-size`: Batch size for inference
-- `gpu-id`: GPU ID for inference
-- `unique-id`: Unique identifier for this inference instance
-- `process-mode`: Infer processing mode (primary or secondary)
-- `interval`: Number of consecutive batches to skip for inference
-- `infer-on-gie-id`: Infer on metadata from GIE with this unique ID (-1 for all)
-- `infer-on-class-ids`: Operate on objects with specified class IDs
-- `filter-out-class-ids`: Ignore metadata for objects of specified class IDs
-- `model-engine-file`: Path to pre-generated TensorRT engine file
-- `output-tensor-meta`: Output raw tensor metadata (0=no, 1=yes)
-- `output-instance-mask`: Output instance mask in metadata (0=no, 1=yes)
-- `input-tensor-meta`: Use tensor metadata from upstream (0=no, 1=yes)
-- `clip-object-outside-roi`: Clip object bbox outside ROI from nvdspreprocess
-- `crop-objects-to-roi-boundary`: Crop object bbox to ROI boundary
-- `raw-output-file-write`: Write raw inference output to file
-- `raw-output-generated-callback`: Callback for raw output
-- `raw-output-generated-userdata`: Userdata for raw output callback
-
-**Configuration File Structure**:
-
-nvinfer supports **two configuration formats**:
-
-### Format 1: YAML Format (Recommended)
-
-```yaml
-# Example: pgie_config.yml (Primary detector using ResNet18)
-property:
-  gpu-id: 0
-  net-scale-factor: 0.00392156862745098
-  # Use ResNet18 TrafficCamNet model from DeepStream samples
-  onnx-file: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-  labelfile-path: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-  batch-size: 1
-  process-mode: 1
-  model-color-format: 0
-  # 0=FP32, 1=INT8, 2=FP16
-  network-mode: 2
-  num-detected-classes: 4
-  interval: 0
-  gie-unique-id: 1
-  # 1=DBSCAN, 2=NMS, 3=DBSCAN+NMS, 4=None
-  cluster-mode: 2
-
-class-attrs-all:
-  topk: 20
-  nms-iou-threshold: 0.5
-  pre-cluster-threshold: 0.2
-```
-
-### Format 2: INI-style Text Format
-
-```ini
-# Example: pgie_config.txt (Primary detector using ResNet18)
-[property]
-gpu-id=0
-net-scale-factor=0.00392156862745098
-onnx-file=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-labelfile-path=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-batch-size=1
-process-mode=1
-model-color-format=0
-network-mode=2
-num-detected-classes=4
-interval=0
-gie-unique-id=1
-cluster-mode=2
-
-[class-attrs-all]
-topk=20
-nms-iou-threshold=0.5
-pre-cluster-threshold=0.2
-```
-
-**Key Differences**:
-| Aspect | YAML Format | INI Format |
-|--------|-------------|------------|
-| File extension | `.yml` or `.yaml` | `.txt` |
-| Section headers | `property:` (no brackets) | `[property]` (with brackets) |
-| Key-value separator | `: ` (colon + space) | `=` (equals) |
-| Indentation | Required for nested values | Not used |
-
-**Usage**:
-```bash
-nvinfer config-file-path=/path/to/config.yml batch-size=4
-```
-
-**Common Pipeline Pattern**:
-```
-nvstreammux ! nvinfer config-file-path=pgie_config.txt ! ...
-```
-
-**Notes**:
-- **Primary inference engine** for object detection/classification
-- Supports TensorRT engines (.trt), ONNX models, and custom networks
-- Can be used as Primary GIE (PGIE) or Secondary GIE (SGIE)
-- Multiple instances can be cascaded for complex models
-- `output-tensor-meta=1` enables custom postprocessing
-- `input-tensor-meta=1` uses preprocessed tensors from nvdspreprocess
-- **Note**: `enable-dbscan` is DEPRECATED and is a config file parameter, not a GObject property
-
----
-
-### nvinferserver
-**Purpose**: Inference using Triton Inference Server backend
-
-**Key Properties**:
-- `config-file-path`: Path to Triton configuration file
-- `gpu-id`: GPU ID
-- `unique-id`: Unique identifier
-- `output-tensor-meta`: Output tensor metadata
-
-**Usage**:
-```bash
-nvinferserver config-file-path=/path/to/triton_config.txt
-```
-
-**Notes**:
-- Alternative to nvinfer for Triton-based inference
-- Supports remote inference servers
-- Better for scalable deployments
-- Requires Triton Inference Server setup
-
----
-
-### nvdspreprocess
-**Purpose**: Custom preprocessing plugin for region-of-interest (ROI) preprocessing
-
-**Key Properties**:
-- `config-file`: Path to preprocessing configuration file
-- `gpu-id`: GPU ID
-
-**Configuration File Structure**:
-```yaml
-preprocess-config:
-  - preprocess-group:
-      target-unique-ids: [1]
-      roi-params-src: [0]
-      process-on-roi: 1
-      network-input-shape: [1, 3, 544, 960]
-      tensor-format: 0  # 0=NCHW, 1=NHWC
-      maintain-aspect-ratio: 0
-      custom-transform-function: "custom_transform"
-      custom-tensor-prep-function: "custom_tensor_prep"
-```
-
-**Usage**:
-```bash
-nvdspreprocess config-file=/path/to/preprocess_config.yml
-```
-
-**Common Pipeline Pattern**:
-```
-nvstreammux ! nvdspreprocess config-file=preprocess.yml ! nvinfer input-tensor-meta=1 ! ...
-```
-
-**Notes**:
-- Enables custom preprocessing before inference
-- Processes ROIs or full frames
-- Outputs tensor metadata for nvinfer
-- Custom preprocessing library and functions are specified in the **config file**, not as GObject properties
-- Optimal performance: batch-size should match total units in config
-
----
-
-### nvdspostprocess
-**Purpose**: Custom postprocessing plugin for parsing model outputs
-
-**Key Properties**:
-- `postprocesslib-name`: Path to postprocessing library (.so)
-- `postprocesslib-config-file`: Path to postprocessing configuration file
-- `gpu-id`: GPU ID
-
-**Configuration File Structure** (YAML):
-```yaml
-postprocess-config:
-  - postprocess-group:
-      target-unique-ids: [1]
-      custom-parse-function: "custom_parse"
-      custom-bbox-parse-function: "custom_bbox_parse"
-      output-format: 0  # 0=object detection, 1=classification
-```
-
-**Usage**:
-```bash
-nvdspostprocess postprocesslib-name=./libpostprocess.so postprocesslib-config-file=config.yml
-```
-
-**Common Pipeline Pattern**:
-```
-nvinfer output-tensor-meta=1 ! nvdspostprocess postprocesslib-name=... ! ...
-```
-
-**Notes**:
-- Parses raw tensor outputs from nvinfer
-- Requires nvinfer with output-tensor-meta=1
-- Supports custom parsing functions
-- Used for models not supported by nvinfer's built-in parsers
-
----
-
-### nvtracker
-**Purpose**: Multi-object tracker for tracking objects across frames
-
-**Key Properties**:
-- `ll-lib-file`: Path to low-level tracker library (.so)
-- `ll-config-file`: Path to tracker configuration file
-- `tracker-width`: Tracker input width
-- `tracker-height`: Tracker input height
-- `gpu-id`: GPU ID
-- `input-tensor-meta`: Use tensor metadata (0=no, 1=yes)
-- `tensor-meta-gie-id`: GIE ID for tensor metadata (used with input-tensor-meta)
-- `display-tracking-id`: Display tracking ID in object text
-- `tracking-id-reset-mode`: Tracking ID reset mode on stream reset/EOS
-- `tracking-surface-type`: Selective tracking surface type
-- `user-meta-pool-size`: Tracker user metadata buffer pool size
-- `sub-batches`: Configuration of sub-batches for parallel processing
-- `sub-batch-err-recovery-trial-cnt`: Max trials to reinitialize tracker on error
-
-**Configuration File Structure**:
-```yaml
-tracker:
-  ll-lib-file: /path/to/libnvds_nvmultiobjecttracker.so
-  ll-config-file: /path/to/tracker_config.yml
-  enable-batch-process: 1
-  enable-past-frame: 1
-  tracker-width: 1920
-  tracker-height: 1080
-```
-
-**Usage**:
-```bash
-nvtracker ll-lib-file=/path/to/libnvds_nvmultiobjecttracker.so ll-config-file=/path/to/config.yml
-```
-
-**Common Pipeline Pattern**:
-```
-nvinfer ! nvtracker ll-lib-file=... ! ...
-```
-
-**Notes**:
-- Tracks objects across video frames
-- Assigns unique tracking IDs to objects
-- Supports multiple tracking algorithms
-- Requires object metadata from inference engine
-- Tracker dimensions should match preprocess/infer dimensions when using input-tensor-meta=1
-
----
-
-### nvdsosd (nvosdbin)
-**Purpose**: On-Screen Display element (`nvdsosd`) and DeepStream convenience bin (`nvosdbin`) for drawing bounding boxes, labels, masks, and clocks
-
-**Key Properties**:
-- `gpu-id`: GPU ID to render on
-- `process-mode`: Rendering backend (0=CPU, 1=GPU)
-- `display-text`: Enable text overlay (boolean)
-- `display-bbox`: Enable bounding box display (boolean)
-- `display-mask`: Enable instance mask display (boolean)
-- `display-clock`: Enable clock display (boolean)
-- `clock-font`: Font for clock text
-- `clock-font-size`: Font size for clock
-- `x-clock-offset`: X offset for clock position
-- `y-clock-offset`: Y offset for clock position
-- `clock-color`: Clock color (RGBA as uint)
-- `blur-bbox`: Enable bbox blurring (boolean)
-- `blur-on-gie-class-ids`: Blur bboxes for specific GIE unique ID and class ID
-
-**Note**: Text and bbox styling properties (like colors, borders) are controlled through object metadata, not as GObject properties on the plugin itself.
-
-**Usage**:
-```bash
-nvdsosd display-text=1 display-bbox=1
-```
-
-**Common Pipeline Pattern**:
-```
-nvtracker ! nvdsosd ! ...
-```
-
-**Notes**:
-- Use `nvdsosd` for the raw transform element
-- Supports tracking ID display, text overlays, and optional blur/clocks
-- Keeps surfaces in NVMM for zero-copy rendering on GPU
-- Object-specific styling (text colors, bbox colors, etc.) is set through NvDsMeta object metadata, not plugin properties
-
----
-
-### nvmultistreamtiler
-**Purpose**: Tiles multiple video streams into a single output frame
-
-**Key Properties**:
-- `width`: Output width
-- `height`: Output height
-- `rows`: Number of rows in tile layout
-- `columns`: Number of columns in tile layout
-- `gpu-id`: GPU ID
-- `show-source`: Show source index (0=no, 1=yes)
-
-**Usage**:
-```bash
-nvmultistreamtiler width=1920 height=1080 rows=2 columns=2
-```
-
-**Common Pipeline Pattern**:
-```
-nvstreamdemux name=d d.src_0 ! ... d.src_1 ! ... ! nvmultistreamtiler ! ...
-```
-
-**Notes**:
-- Combines multiple streams into a grid layout, useful for multi-stream visualization
-
----
-
-### nvvideoconvert
-**Purpose**: Video format converter (color space conversion, scaling)
-
-**Key Properties**:
-- `gpu-id`: GPU ID
-- `nvbuf-memory-type`: Memory type
-- `src-crop`: Source crop rectangle
-- `dest-crop`: Destination crop rectangle
-
-**Usage**:
-```bash
-nvvideoconvert gpu-id=0
-```
-
-**Common Pipeline Pattern**:
-```
-nvdsosd ! nvvideoconvert ! nveglglessink
-```
-
-**Notes**:
-- GPU-accelerated color format conversion (NV12, RGBA, etc.), often needed before rendering sinks
-
----
-
-### nvdsanalytics
-**Purpose**: Video analytics plugin for motion detection, line crossing, etc.
-
-**Key Properties**:
-- `config-file`: Path to analytics configuration file
-- `enable`: Enable analytics (0=no, 1=yes)
-- `gpu-id`: GPU ID
-
-**Configuration File Parameters**:
-The config file **must** include a **property** group/section. Other groups define per-stream ROI, line-crossing, overcrowding, and direction rules. Stream index is given by the numeric suffix in the group name (e.g. `roi-filtering-stream-0` for stream 0).
-- `property`: General group; Mandatory.
-  - `config-width`,`config-height`:  Reference resolution width and height for analytics coordinate scaling.
-  - `enable`: Whether analytics is enabled (aligned with the element **enable** property).
-  - `display-font-size`: Optional; OSD font size.
-  - `osd-mode`: Optional; 0, 1, or 2. 0 = OSD off, 1 = labels only, 2 = full (default).
-  - `obj-cnt-win-in-ms`: Optional; object-count time window in milliseconds; range 1–1000000000.
-  - `display-obj-cnt`: Optional; whether to show per-class object counts on OSD.
-- `roi-filtering-stream-<stream_id>`: ROI Filtering group per stream
-  - `enable`: Enable ROI filtering for this stream.
-  - `class-id`: Class IDs to include in ROI analytics (semicolon-separated integer list).
-  - `inverse-roi`: Whether treat as “outside ROI” for counting/filtering.
-  - `roi-<label>`: ROI coordinations in polygon vertices: `x1;y1;x2;y2;...` (even number of integers). `<label>` is a custom name for the specified ROIs.
-- `overcrowding-stream-<stream_id>`: Overcrowding object count and duration in ROIs per stream.
-  - `enable`: Enable overcrowding analysis for this stream.
-  - `class-id`:  Class IDs to count for overcrowding in integer list.
-  - `object-threshold`: Object count threshold for overcrowding.
-  - `time-threshold`: Duration threshold in milliseconds.
-  - `roi-<label>`: Polygon vertices for the overcrowding region: `x1;y1;x2;y2;...`. `<label>` is a custom name for the specified ROIs.
-- `line-crossing-stream-<stream_id>`: Line Crossing object count per stream.
-  - `enable`: Enable line-crossing counting for this stream.
-  - `extended`: Whether to use extended line-crossing logic. 
-  - `class-id`: Class IDs to count for line crossing in integer list.
-  - `line-crossing-<label>`: **8 integers:** direction vector (x1,y1,x2,y2) then line (x1,y1,x2,y2). Coordinates relative to config-width/config-height. `<label>` is a custom name for the specified lines.
-  - `mode`: Detection strictness options: `strict`, `balanced`, or `loose`.
-- `direction-detection-stream-<stream_id>`: Defines reference direction vectors for judging object movement direction per stream.
-   - `enable`: Enable direction detection for this stream.
-   - `class-id`: Class IDs of the objects which need direction detection.
-   - `direction-<label>`: **8 integers:** direction vector (x1,y1,x2,y2) then line (x1,y1,x2,y2). `<label>` is a custom name for the specified directions.
-   - `mode`: Direction detection mode options: `strict`, `balanced`, or `loose`.
-
-**Notes**:
-**<stream_id>** should be the stream id which be compatible for the source id identified by the nvstreammux sink pad id.
-Each **roi-<label>** defines one ROI; multiple ROIs per stream are allowed.
-Each **line-crossing-<label>** defines one line; multiple lines per stream are allowed.
-Each **direction-<label>** defines one reference direction; multiple directions per stream are allowed.
-
-**Configuration File Samples**:
-There are two formats configuration files: .txt and .yml.
-- YAML format:
-```yaml
-property:
-  enable: 1
-  config-width: 1920
-  config-height: 1080
-  display-font-size: 12
-  osd-mode: 2
-roi-filtering-stream-0:
-  enable: 1
-  class-id: -1
-  roi-DOOR: 256;639;675;83;876;224;926;482;866;741
-overcrowding-stream-0:
-  enable: 1
-  class-id: 1;2
-  object-threshold: 1000
-  roi-ENTRANCE: 282;347;987;843
-line-crossing-stream-0:
-  enable: 1
-  line-crossing-Exit: 789;672;1084;900;851;773;1203;732
-  class-id: 0
-  mode: loose
-direction-detection-stream-0:
-  enable: 1
-  direction-South: 284;840;360;662
-  class-id: 0
-```
-- TXT format:
-```txt
-[property]
-enable=1
-config-width=1920
-config-height=1080
-osd-mode=2
-display-font-size=12
-
-[roi-filtering-stream-0]
-enable=1
-roi-RF=256;639;675;83;876;224;926;482;866;741
-inverse-roi=0
-class-id=-1
-
-[overcrowding-stream-1]
-enable=1
-roi-OC=282;347;987;843
-object-threshold=3
-class-id=-1
-
-[line-crossing-stream-0]
-enable=1
-line-crossing-Exit=789;672;1084;900;851;773;1203;732
-class-id=0
-mode=loose
-
-[direction-detection-stream-0]
-enable=1
-direction-South=284;840;360;662
-class-id=0
-```
-
-**Usage**:
-```bash
-nvdsanalytics config-file=/path/to/analytics_config.yml
-```
-
-**Notes**:
-- Performs motion, line crossing, intrusion, and loitering detection; requires configuration file
-
----
-
-### nvmsgbroker
-**Purpose**: Message broker plugin for sending metadata to cloud services
-
-**IMPORTANT**: `nvmsgbroker` is a **SINK component** that terminates the pipeline branch. It cannot have downstream components. If you need both message broker output and display, use `tee` to split the pipeline.
-
-**Key Properties**:
-- `proto-lib`: Path to protocol library (.so)
-- `conn-str`: Connection string
-- `config-file`: Configuration file path
-- `topic`: Topic name (for Kafka/MQTT)
-- `sync`: Synchronous mode (0=async, 1=sync)
-
-**Usage**:
-```bash
-nvmsgbroker proto-lib=/path/to/libnvds_kafka_proto.so conn-str=localhost:9092 topic=analytics
-```
-
-**Pipeline Patterns**:
-```bash
-# Headless (Kafka only)
-tracker ! nvmsgconv ! nvmsgbroker
-
-# With display (use tee)
-tracker ! tee name=t
-t. ! queue ! nvmsgconv ! nvmsgbroker
-t. ! queue ! tiler ! osd ! converter ! sink
-```
-
-**Notes**:
-- **SINK component**: Terminates pipeline branch, cannot have downstream elements
-- Sends metadata to cloud services
-- Supports Kafka, MQTT, Azure, Redis, AMQP
-- Requires protocol-specific library
-- Can send object metadata, frame metadata, etc.
-- For pipelines requiring both Kafka and display, use `tee` to create separate branches
-
----
-
-### nvmsgconv
-**Purpose**: Message converter plugin for transforming metadata formats
-
-**Key Properties**:
-- `msg2p-lib`: Payload generation library path with absolute path
-- `payload-type`: Payload type (0=deepstream, 1=custom, etc.)
-- `msg2p-newapi`: Use new API which supports multiple payloads (boolean)
-- `frame-interval`: Interval for frame-level metadata generation
-- `debug-payload-dir`: Directory to dump generated payloads for debugging
-
-**Usage**:
-```bash
-nvmsgconv config-file=/path/to/msgconv_config.txt
-```
-
-**Notes**:
-- Converts metadata to different formats
-- Used before nvmsgbroker
-- Supports custom schemas
-
----
-
-## Sink Plugins
-
-### nveglglessink
-**Purpose**: EGL/GLES-based video renderer for x86_64 platforms
-
-**Key Properties**:
-- `sync`: Synchronize to display refresh (0=no, 1=yes)
-- `window-x`: Window X position
-- `window-y`: Window Y position
-- `window-width`: Window width
-- `window-height`: Window height
-- `display-id`: Display ID
-
-**Usage**:
-```bash
-nveglglessink sync=1
-```
-
-**Notes**:
-- For x86_64 desktop/server platforms with hardware-accelerated rendering
-
----
-
-### nv3dsink
-**Purpose**: 3D video renderer for Jetson platforms
-
-**Key Properties**:
-- `sync`: Synchronize to display refresh
-- `window-x`: Window X position
-- `window-y`: Window Y position
-- `window-width`: Window width
-- `window-height`: Window height
-
-**Usage**:
-```bash
-nv3dsink sync=1
-```
-
-**Notes**:
-- For ARM64/Jetson platforms with hardware-accelerated rendering
-
----
-
-### nvvideoconvert + filesink
-**Purpose**: Save processed video to file
-
-**Usage**:
-```bash
-nvvideoconvert ! x264enc ! mp4mux ! filesink location=output.mp4
-```
-
-**Notes**:
-- Requires encoding before saving
-- Can use hardware encoders (nvv4l2h264enc, nvv4l2h265enc)
-
----
-
-## Standard GStreamer Plugins Used in DeepStream
-
-### h264parse / h265parse
-**Purpose**: Parse H.264/H.265 video streams
-
-**Usage**:
-```bash
-h264parse
-```
-
-### queue
-**Purpose**: Buffer management and synchronization
-
-**Key Properties**:
-- `max-size-buffers`: Maximum buffer size
-- `max-size-time`: Maximum time-based size
-- `leaky`: Leaky queue mode
-
-**Usage**:
-```bash
-queue max-size-buffers=200
-```
-
-### tee
-**Purpose**: Split pipeline into multiple branches
-
-**Usage**:
-```bash
-tee name=t t. ! queue ! ... t. ! queue ! ...
-```
-
----
-
-## Plugin Selection Guidelines
-
-### For Video Sources:
-- **Files**: `nvurisrcbin` or `filesrc` + `qtdemux` + `h264parse`
-- **RTSP Streams**: `nvurisrcbin` with `rtsp://` URI
-- **Dynamic sources (REST API)**: `nvmultiurisrcbin` — config/REST-driven multi-stream
-- **Dynamic sources (programmatic)**: `nvdsdynamicsrcbin` + `SourceManager` — script-driven add/remove
-- **USB Cameras**: `v4l2src`
-- **Jetson CSI Cameras**: `nvarguscamerasrc`
-
-### For Decoding:
-- **Always use**: `nvv4l2decoder` for hardware acceleration
-- **Avoid**: Software decoders (avdec_h264, etc.) for performance
-
-### For Multi-Stream:
-- **Always use**: `nvstreammux` to batch streams
-- **Batch size**: Match number of input streams
-- **Use**: `nvstreamdemux` after processing to split streams
-
-### For Inference:
-- **Primary**: `nvinfer` for TensorRT-based inference
-- **Alternative**: `nvinferserver` for Triton-based inference
-- **Custom preprocessing**: `nvdspreprocess` before inference
-- **Custom postprocessing**: `nvdspostprocess` after inference
-
-### For Tracking:
-- **Use**: `nvtracker` after primary inference
-- **Configure**: Tracker dimensions to match inference input
-
-### For Visualization:
-- **Use**: `nvdsosd` for drawing bounding boxes and labels
-- **Use**: `nvmultistreamtiler` for multi-stream display
-- **Use**: `nvvideoconvert` before rendering sinks
-
-### For Rendering:
-- **x86_64**: `nveglglessink`
-- **Jetson**: `nv3dsink`
-- **File output**: `nvvideoconvert` + encoder + `filesink`
-
----
-
-## Common Pipeline Patterns
-
-### Single Stream with Detection:
-```
-filesrc ! h264parse ! nvv4l2decoder ! nvstreammux batch-size=1 ! 
-nvinfer config-file-path=pgie.yml ! nvtracker ! nvdsosd ! 
-nvvideoconvert ! nveglglessink
-```
-
-### Multi-Stream with Detection:
-```
-stream1 ! m.sink_0 stream2 ! m.sink_1 
-nvstreammux name=m batch-size=2 ! nvinfer ! nvtracker ! 
-nvstreamdemux name=d d.src_0 ! nvdsosd ! sink1 d.src_1 ! nvdsosd ! sink2
-```
-
-### Cascaded Inference (Primary + Secondary):
-```
-nvstreammux ! nvinfer config-file-path=pgie_config.txt ! 
-nvinfer config-file-path=sgie1_config.txt ! nvinfer config-file-path=sgie2_config.txt ! 
-nvtracker ! nvdsosd ! sink
-```
-
-### Custom Preprocessing + Inference:
-```
-nvstreammux ! nvdspreprocess config-file=preprocess_config.txt ! 
-nvinfer input-tensor-meta=1 config-file-path=infer_config.txt ! 
-nvdspostprocess postprocesslib-name=... ! nvdsosd ! sink
-```
-
-### Multi-Stream with Analytics and Cloud:
-```
-streams ! nvstreammux ! nvinfer ! nvtracker ! nvdsanalytics ! 
-nvmsgconv ! nvmsgbroker proto-lib=... conn-str=... ! 
-nvstreamdemux ! nvdsosd ! sink
-```
-
----
-
-## Performance Optimization Tips
-
-1. **Batch Size**: Use appropriate batch sizes (typically 1-8) based on GPU memory
-2. **Resolution**: Match stream resolution to model input requirements
-3. **Memory Type**: Use NVMM memory (`nvbuf-memory-type=1`) for zero-copy
-4. **Inference Precision**: Use FP16 or INT8 for better performance
-5. **Pipeline Parallelism**: Run multiple pipelines on different GPUs
-6. **Buffer Management**: Configure queue sizes appropriately
-7. **Tracker Configuration**: Match tracker dimensions to inference dimensions
-
----
-
-## Error Handling and Debugging
-
-1. **Check Plugin Availability**: Use `gst-inspect-1.0 nvinfer` to verify plugins
-2. **Enable Debugging**: Set `GST_DEBUG=3` for verbose logging
-3. **Check Metadata**: Use probes to inspect metadata at pipeline points
-4. **Memory Issues**: Monitor GPU memory usage with `nvidia-smi`
-5. **Pipeline State**: Check pipeline state transitions (NULL → READY → PLAYING)
-
----
-
-This comprehensive overview should help you understand and use DeepStream plugins effectively in your applications.
-
diff --git a/skills/deepstream/deepstream-dev/references/kafka_messaging.md b/skills/deepstream/deepstream-dev/references/kafka_messaging.md
deleted file mode 100644
index fe9ae79e..00000000
--- a/skills/deepstream/deepstream-dev/references/kafka_messaging.md
+++ /dev/null
@@ -1,1843 +0,0 @@
-# Kafka and Message Broker Integration
-
-## Overview
-
-This document is a comprehensive reference for integrating DeepStream applications with external message brokers. It covers two complementary areas:
-
-- **Part 1 -- Kafka Integration Use Cases and Patterns**: Pipeline architectures for streaming analytics data to Apache Kafka, including native `nvmsgbroker` pipelines, Python Kafka producer probes, multi-topic integration, error handling, and performance optimization.
-- **Part 2 -- Message Broker and Converter Configuration Reference**: Detailed property tables and configuration file formats for the `nvmsgconv` and `nvmsgbroker` GStreamer plugins, protocol adaptor libraries (Kafka, MQTT, Redis, AMQP, Azure IoT), payload schemas, and troubleshooting guidance.
-
----
-
-# Part 1: Kafka Integration Use Cases and Patterns
-
-## Use Case Requirements
-
-- Process video streams with AI inference
-- Extract object detection and tracking metadata
-- Stream metadata to Kafka topics
-- Support multiple Kafka topics for different data types
-- Handle Kafka connection failures gracefully
-- Support both sync and async message sending
-- Integrate with cloud services and data pipelines
-
-## Prerequisites
-
-Before building any Kafka-based DeepStream pipeline, install these system dependencies:
-
-```bash
-# REQUIRED: librdkafka -- DeepStream's Kafka protocol adapter (libnvds_kafka_proto.so)
-# dynamically links against librdkafka.so.1, which is NOT bundled with DeepStream.
-sudo apt-get install -y librdkafka-dev
-
-# If also running a local MQTT broker for tracker:
-sudo apt-get install -y libmosquitto1        # client library for nvtracker
-sudo apt-get install -y mosquitto            # broker daemon (if running locally)
-sudo apt-get install -y mosquitto-clients    # CLI tools for testing
-```
-
-> **Without `librdkafka-dev`**, any pipeline using `nvmsgbroker` with the Kafka protocol adapter will fail at startup with: `unable to open shared library` / `Failed to start`.
-
-## Architecture Overview
-
-### Critical Rule: async=0 on ALL Sinks
-
-**CRITICAL**: When using `tee` to split a pipeline OR using dynamic sources (nvmultiurisrcbin), **ALL sink elements MUST have `async: 0`**. This includes:
-- Display sinks (nveglglessink, nv3dsink)
-- Message broker sinks (nvmsgbroker)
-- File sinks (filesink)
-- Any other sink element
-
-**Symptom if missing**: Pipeline stays stuck in PAUSED state. Cameras show "added" but no video displays and no data flows.
-
-**Why**: GStreamer requires all sinks to "preroll" (receive data) before transitioning to PLAYING state. With `async: 0`, sinks don't block the state transition waiting for preroll.
-
-### Pipeline Architecture
-
-**IMPORTANT**: `nvmsgbroker` is a **SINK component** that terminates the pipeline branch. It cannot have downstream components.
-
-For **headless pipelines** (Kafka only, no display):
-```
-Source -> Decoder -> Muxer -> Inference -> Tracker -> Message Converter -> Message Broker (sink)
-```
-
-For **pipelines with both Kafka and display**, use `tee` to split paths:
-```
-Source -> Decoder -> Muxer -> Inference -> Tracker -> Tee
-                                                      |-> [Metadata Branch] Message Converter -> Message Broker (sink)
-                                                      |-> [Video Branch] Tiler -> OSD -> Converter -> Renderer (sink)
-```
-
-### Data Flow
-1. Video processing generates metadata (objects, tracks, frames)
-2. Metadata is converted to message format
-3. Messages are sent to Kafka broker (metadata branch terminates here)
-4. Video continues to display pipeline (if using tee split)
-5. Downstream Kafka consumers process analytics data
-
-## Implementation Approaches
-
-### Approach 1: Using nvmsgbroker Plugin (Native DeepStream)
-
-The native DeepStream approach uses `nvmsgbroker` plugin with Kafka protocol library.
-
-**CRITICAL**: `nvmsgbroker` is a **SINK component** that terminates the pipeline branch. It cannot have downstream components like OSD or renderer. If you need both Kafka output and display, use `tee` to split the pipeline into separate branches.
-
-For detailed property tables and configuration file formats for `nvmsgconv` and `nvmsgbroker`, see Part 2 below.
-
-#### Example 1: Headless Pipeline (Kafka Only)
-
-```python
-from pyservicemaker import Pipeline
-import platform
-import sys
-
-def kafka_native_pipeline_headless(video_path, infer_config, kafka_config):
-    """
-    DeepStream pipeline with native Kafka integration (headless, no display)
-
-    Args:
-        video_path: Path to video file
-        infer_config: Inference configuration file
-        kafka_config: Kafka configuration dict
-    """
-    pipeline = Pipeline("kafka-pipeline-headless")
-
-    # Source and decoding
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Inference
-    pipeline.add("nvinfer", "pgie", {"config-file-path": infer_config})
-
-    # Tracker
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml"
-    })
-
-    # Message converter (converts metadata to message format)
-    # IMPORTANT: msg2p-newapi=True uses NvDsObjectMeta directly (no NvDsEventMsgMeta required)
-    pipeline.add("nvmsgconv", "msgconv", {
-        "config": kafka_config["msgconv_config"],
-        "payload-type": 0,  # 0=deepstream full schema, 1=minimal
-        "msg2p-newapi": True,  # CRITICAL: Use new API to avoid NvDsEventMsgMeta requirement
-    })
-
-    # Message broker (Kafka) - THIS IS A SINK, terminates the pipeline
-    # IMPORTANT: conn-str uses semicolon separator (host;port), NOT colon
-    pipeline.add("nvmsgbroker", "msgbroker", {
-        "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-        "conn-str": kafka_config["broker_servers"],  # Must be "host;port" format
-        "sync": 0,   # 0=async message sending, 1=sync
-        "async": 0,  # CRITICAL for dynamic sources: prevents state transition deadlock
-        "config": kafka_config["broker_config"]
-    })
-
-    # Link pipeline - msgbroker is the sink, no components after it
-    pipeline.link("src", "parser", "decoder")
-    pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "pgie", "tracker", "msgconv", "msgbroker")
-
-    pipeline.start().wait()
-```
-
-#### Example 2: Pipeline with Both Kafka and Display (Using Tee)
-
-```python
-from pyservicemaker import Pipeline
-import platform
-import sys
-
-def kafka_native_pipeline_with_display(video_path, infer_config, kafka_config):
-    """
-    DeepStream pipeline with native Kafka integration AND display
-
-    Uses tee to split pipeline into metadata branch (Kafka) and video branch (display)
-
-    Args:
-        video_path: Path to video file
-        infer_config: Inference configuration file
-        kafka_config: Kafka configuration dict
-    """
-    pipeline = Pipeline("kafka-pipeline-with-display")
-
-    # Source and decoding
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Inference
-    pipeline.add("nvinfer", "pgie", {"config-file-path": infer_config})
-
-    # Tracker
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml"
-    })
-
-    # Add tee to split pipeline
-    pipeline.add("tee", "tee")
-
-    # Metadata branch: tee -> queue -> msgconv -> msgbroker (sink)
-    pipeline.add("queue", "queue_meta")
-    # IMPORTANT: msg2p-newapi=True uses NvDsObjectMeta directly (no NvDsEventMsgMeta required)
-    pipeline.add("nvmsgconv", "msgconv", {
-        "config": kafka_config["msgconv_config"],
-        "payload-type": 0,
-        "msg2p-newapi": True,  # CRITICAL: Use new API
-    })
-    # IMPORTANT: conn-str uses semicolon separator (host;port), NOT colon
-    # CRITICAL: async=0 required on ALL sinks when using tee or dynamic sources!
-    pipeline.add("nvmsgbroker", "msgbroker", {
-        "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-        "conn-str": kafka_config["broker_servers"],  # Must be "host;port" format
-        "sync": 0,   # Async message sending
-        "async": 0,  # CRITICAL: ALL sinks need async=0 to prevent state deadlock!
-        "config": kafka_config["broker_config"]
-    })
-
-    # Video branch: tee -> queue -> tiler -> osd -> converter -> sink
-    pipeline.add("queue", "queue_video")
-    pipeline.add("nvmultistreamtiler", "tiler", {"rows": 1, "columns": 1})
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nvvideoconvert", "converter")
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    # CRITICAL: async=0 required on ALL sinks when using tee or dynamic sources!
-    pipeline.add(sink_type, "sink", {
-        "sync": 0,   # Don't sync to clock for live sources
-        "qos": 0,    # Disable QoS
-        "async": 0   # CRITICAL: ALL sinks need async=0 to prevent state deadlock!
-    })
-
-    # Link main pipeline
-    pipeline.link("src", "parser", "decoder")
-    pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "pgie", "tracker", "tee")
-
-    # Link metadata branch (terminates at msgbroker sink)
-    pipeline.link(("tee", "queue_meta"), ("src_%u", ""))
-    pipeline.link("queue_meta", "msgconv", "msgbroker")
-
-    # Link video branch (terminates at display sink)
-    pipeline.link(("tee", "queue_video"), ("src_%u", ""))
-    pipeline.link("queue_video", "tiler", "osd", "converter", "sink")
-
-    pipeline.start().wait()
-
-if __name__ == "__main__":
-    kafka_config = {
-        # IMPORTANT: Use semicolon separator, NOT colon!
-        "broker_servers": "localhost;9092",  # Correct: semicolon
-        # "broker_servers": "localhost:9092",  # Wrong: colon
-        "broker_config": "/path/to/kafka_broker_config.txt",
-        "msgconv_config": "/path/to/msgconv_config.txt"
-    }
-    # Use headless version for Kafka-only, or with_display version for both Kafka and display
-    kafka_native_pipeline_headless(sys.argv[1], sys.argv[2], kafka_config)
-    # OR
-    # kafka_native_pipeline_with_display(sys.argv[1], sys.argv[2], kafka_config)
-```
-
-#### Example 3: Using Legacy API (msg2p-newapi=0) with EventMessageUserMetadata
-
-When `msg2p-newapi` is `0` (the default), `nvmsgconv` expects `NvDsEventMsgMeta` to be pre-attached to each frame buffer. This metadata is **NOT** generated automatically by any DeepStream plugin. You must attach it via a probe **upstream** of `nvmsgconv`.
-
-There are two sub-approaches:
-
-##### Option A: Built-in `add_message_meta_probe` (Simplest)
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator
-import platform
-
-def kafka_legacy_builtin_probe(video_path, infer_config, kafka_config):
-    """
-    Kafka pipeline using msg2p-newapi=0 with built-in add_message_meta_probe.
-    The built-in probe automatically generates EventMessageUserMetadata
-    from NvDsObjectMeta for every detected object.
-    """
-    pipeline = Pipeline("kafka-legacy-builtin")
-
-    # Source and decoding
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Inference + tracker
-    pipeline.add("nvinfer", "pgie", {"config-file-path": infer_config})
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml"
-    })
-
-    # OSD (needed as attachment point for the built-in probe)
-    pipeline.add("nvosdbin", "osd")
-
-    # Tee to split display and Kafka branches
-    pipeline.add("tee", "tee")
-
-    # Metadata branch
-    pipeline.add("queue", "queue_meta")
-    pipeline.add("nvmsgconv", "msgconv", {
-        "config": kafka_config["msgconv_config"],
-        "payload-type": 0,
-        "msg2p-newapi": 0,  # Legacy API - requires EventMessageUserMetadata
-    })
-    pipeline.add("nvmsgbroker", "msgbroker", {
-        "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-        "conn-str": kafka_config["broker_servers"],
-        "sync": 0,
-        "async": 0,
-    })
-
-    # Display branch
-    pipeline.add("queue", "queue_video")
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink", {"sync": 0, "qos": 0, "async": 0})
-
-    # Link
-    pipeline.link("src", "parser", "decoder")
-    pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "pgie", "tracker", "osd", "tee")
-    pipeline.link(("tee", "queue_meta"), ("src_%u", ""))
-    pipeline.link("queue_meta", "msgconv", "msgbroker")
-    pipeline.link(("tee", "queue_video"), ("src_%u", ""))
-    pipeline.link("queue_video", "sink")
-
-    # CRITICAL: attach built-in probe AFTER osd, BEFORE tee->msgconv
-    # This automatically creates EventMessageUserMetadata from NvDsObjectMeta
-    pipeline.attach("osd", "add_message_meta_probe", "metadata generator")
-
-    pipeline.start().wait()
-```
-
-**Reference**: `deepstream_test4_app` sample
-(`/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/pipeline_api/deepstream_test4_app/deepstream_test4.py`)
-
-##### Option B: Custom EventMessageGenerator (Multi-Camera / Custom Sensor Mappings)
-
-For multi-camera pipelines where you need control over sensor IDs and URIs:
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator, SensorInfo
-
-class EventMessageGenerator(BatchMetadataOperator):
-    """
-    Generate EventMessageUserMetadata for downstream nvmsgconv.
-    Required when msg2p-newapi=0 (legacy API).
-
-    Uses pyservicemaker API:
-        batch_meta.acquire_event_message_meta()  -> acquire from pool
-        event_msg.generate(obj, frame, sensor_id, uri, labels)  -> populate
-        frame_meta.append(event_msg)  -> attach to frame
-    """
-
-    def __init__(self, sensor_map, labels):
-        super().__init__()
-        self._sensor_map = sensor_map  # dict: source_id (int) -> SensorInfo
-        self._labels = labels          # list of class label strings
-
-    def handle_metadata(self, batch_meta, frame_interval=1):
-        for frame_meta in batch_meta.frame_items:
-            frame_num = frame_meta.frame_number
-            for object_meta in frame_meta.object_items:
-                if not (frame_num % frame_interval):
-                    event_msg = batch_meta.acquire_event_message_meta()
-                    if event_msg:
-                        source_id = frame_meta.source_id
-                        sensor_info = self._sensor_map.get(source_id)
-                        sensor_id = sensor_info.sensor_id if sensor_info else "N/A"
-                        uri = sensor_info.uri if sensor_info else "N/A"
-                        event_msg.generate(
-                            object_meta, frame_meta, sensor_id, uri, self._labels
-                        )
-                        frame_meta.append(event_msg)
-
-
-def kafka_legacy_custom_generator(video_paths, infer_config, kafka_config, labels):
-    """
-    Multi-camera Kafka pipeline using msg2p-newapi=0 with custom EventMessageGenerator.
-    """
-    pipeline = Pipeline("kafka-legacy-custom")
-
-    # Build sensor map from video paths
-    sensor_map = {}
-    for i, uri in enumerate(video_paths):
-        sensor_map[i] = SensorInfo(
-            sensor_id=f"Camera{i+1}",
-            sensor_name=f"cam{i+1}",
-            uri=uri
-        )
-
-    # ... (add sources, inference, tracker, tee, msgconv with msg2p-newapi=0, etc.)
-
-    # Attach custom EventMessageGenerator probe UPSTREAM of nvmsgconv
-    pipeline.attach(
-        "tracker",
-        Probe("event_msg_gen", EventMessageGenerator(sensor_map, labels))
-    )
-
-    pipeline.start().wait()
-```
-
-**Key API calls**:
-- `batch_meta.acquire_event_message_meta()` -- acquires `EventMessageUserMetadata` from the pool
-- `event_msg.generate(object_meta, frame_meta, sensor_id, uri, labels)` -- populates the metadata
-- `frame_meta.append(event_msg)` -- attaches it to the frame for downstream nvmsgconv
-
-**Reference**: `deepstream_test5_app` sample
-(`/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/pipeline_api/deepstream_test5_app/deepstream_test5.py`)
-
----
-
-#### Kafka Broker Configuration File
-
-**kafka_broker_config.txt**:
-```
-[broker]
-enable=1
-broker-ip-port=localhost:9092
-topic=deepstream-analytics
-# Optional: SSL/TLS configuration
-# enable-tls=1
-# ca-file=/path/to/ca-cert
-# client-cert-file=/path/to/client-cert
-# client-key-file=/path/to/client-key
-```
-
-#### Message Converter Configuration File
-
-**msgconv_config.txt**:
-```
-[message-converter]
-enable=1
-# Message format: deepstream or custom
-msg-format=deepstream
-# Schema file for custom format
-schema-file=/path/to/schema.json
-# Payload type: 0=deepstream, 1=custom
-payload-type=0
-```
-
-### Approach 2: Using Python Kafka Producer (Custom Probe)
-
-This approach uses Python's `kafka-python` library in a custom probe for more control.
-
-#### Custom Kafka Producer Probe
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator
-from kafka import KafkaProducer
-from kafka.errors import KafkaError
-import json
-import sys
-import platform
-
-class KafkaMetadataSender(BatchMetadataOperator):
-    """
-    Custom probe to send metadata to Kafka
-
-    Sends object detection and tracking metadata to Kafka topics
-    """
-    def __init__(self, kafka_config):
-        """
-        Initialize Kafka producer
-
-        Args:
-            kafka_config: Dict with Kafka configuration
-                - bootstrap_servers: Kafka broker addresses
-                - topic: Topic name
-                - security_config: Optional security config
-        """
-        super().__init__()
-
-        # Kafka producer configuration
-        producer_config = {
-            "bootstrap_servers": kafka_config["bootstrap_servers"],
-            "value_serializer": lambda v: json.dumps(v).encode('utf-8'),
-            "key_serializer": lambda k: str(k).encode('utf-8') if k else None,
-            "acks": "all",  # Wait for all replicas
-            "retries": 3,
-            "max_in_flight_requests_per_connection": 1,
-            "enable_idempotence": True
-        }
-
-        # Add security configuration if provided
-        if "security_config" in kafka_config:
-            security = kafka_config["security_config"]
-            if security.get("use_ssl"):
-                producer_config.update({
-                    "security_protocol": "SSL",
-                    "ssl_cafile": security.get("ca_file"),
-                    "ssl_certfile": security.get("cert_file"),
-                    "ssl_keyfile": security.get("key_file")
-                })
-            elif security.get("use_sasl"):
-                producer_config.update({
-                    "security_protocol": "SASL_SSL",
-                    "sasl_mechanism": security.get("sasl_mechanism", "PLAIN"),
-                    "sasl_plain_username": security.get("username"),
-                    "sasl_plain_password": security.get("password")
-                })
-
-        self.producer = KafkaProducer(**producer_config)
-        self.topic = kafka_config["topic"]
-        self.send_frame_metadata = kafka_config.get("send_frame_metadata", True)
-        self.send_object_metadata = kafka_config.get("send_object_metadata", True)
-        self.batch_size = kafka_config.get("batch_size", 1)  # Send every N frames
-
-        self.frame_count = 0
-        self.error_count = 0
-
-    def handle_metadata(self, batch_meta):
-        """Process batch metadata and send to Kafka"""
-        for frame_meta in batch_meta.frame_items:
-            self.frame_count += 1
-
-            # Send metadata every N frames (if batch_size > 1)
-            if self.frame_count % self.batch_size != 0:
-                continue
-
-            try:
-                # Prepare message
-                message = self._prepare_message(frame_meta)
-
-                # Send to Kafka
-                future = self.producer.send(
-                    topic=self.topic,
-                    key=str(frame_meta.frame_number),  # Use frame number as key
-                    value=message
-                )
-
-                # Optional: Add callback for success/failure
-                future.add_callback(self._on_send_success)
-                future.add_errback(self._on_send_error)
-
-            except Exception as e:
-                print(f"Error sending message to Kafka: {e}")
-                self.error_count += 1
-
-    def _prepare_message(self, frame_meta):
-        """Prepare message from frame metadata"""
-        message = {
-            "frame_number": frame_meta.frame_number,
-            # Note: Use buffer_pts for PTS timestamp, ntp_timestamp for NTP timestamp
-            "buffer_pts": frame_meta.buffer_pts,
-            "ntp_timestamp": frame_meta.ntp_timestamp,
-            "pad_index": frame_meta.pad_index,
-            "source_id": frame_meta.source_id  # Use source_id property
-        }
-
-        # Add frame-level metadata
-        if self.send_frame_metadata:
-            message["frame_metadata"] = {
-                "source_width": frame_meta.source_width,
-                "source_height": frame_meta.source_height,
-                "pipeline_width": frame_meta.pipeline_width,
-                "pipeline_height": frame_meta.pipeline_height
-            }
-
-        # Add object metadata
-        if self.send_object_metadata:
-            objects = []
-            for obj_meta in frame_meta.object_items:
-                obj_data = {
-                    "class_id": obj_meta.class_id,
-                    "confidence": float(obj_meta.confidence),
-                    # Use object_id to get the tracker-assigned tracking ID
-                    "object_id": obj_meta.object_id,
-                    "bbox": {
-                        "left": float(obj_meta.rect_params.left),
-                        "top": float(obj_meta.rect_params.top),
-                        "width": float(obj_meta.rect_params.width),
-                        "height": float(obj_meta.rect_params.height)
-                    }
-                }
-
-                # Add secondary inference results if available
-                # (stored in obj_meta.obj_user_meta_list)
-                if hasattr(obj_meta, 'obj_user_meta_list'):
-                    obj_data["attributes"] = self._extract_attributes(obj_meta)
-
-                objects.append(obj_data)
-
-            message["objects"] = objects
-            message["object_count"] = len(objects)
-
-        return message
-
-    def _extract_attributes(self, obj_meta):
-        """Extract secondary inference attributes from object metadata"""
-        attributes = {}
-        # Process obj_user_meta_list to extract classification results
-        # This depends on how secondary inference stores results
-        return attributes
-
-    def _on_send_success(self, record_metadata):
-        """Callback for successful message send"""
-        pass  # Can add logging here
-
-    def _on_send_error(self, exception):
-        """Callback for failed message send"""
-        print(f"Failed to send message to Kafka: {exception}")
-        self.error_count += 1
-
-    def flush(self):
-        """Flush pending messages"""
-        self.producer.flush()
-
-    def close(self):
-        """Close Kafka producer"""
-        self.producer.flush()
-        self.producer.close()
-        print(f"Kafka producer closed. Sent {self.frame_count} frames, {self.error_count} errors")
-
-def kafka_custom_probe_pipeline(video_path, infer_config, kafka_config):
-    """Pipeline with custom Kafka probe"""
-    pipeline = Pipeline("kafka-custom-probe")
-
-    # Source and decoding
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Inference
-    pipeline.add("nvinfer", "pgie", {"config-file-path": infer_config})
-
-    # Tracker
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml"
-    })
-
-    # OSD and sink
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nvvideoconvert", "converter")
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink", {"sync": 1})
-
-    # Link pipeline
-    pipeline.link("src", "parser", "decoder")
-    pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "pgie", "tracker", "osd", "converter", "sink")
-
-    # Attach Kafka probe
-    kafka_sender = KafkaMetadataSender(kafka_config)
-    pipeline.attach("tracker", Probe("kafka-sender", kafka_sender))
-
-    try:
-        pipeline.start().wait()
-    finally:
-        kafka_sender.close()
-
-if __name__ == "__main__":
-    kafka_config = {
-        "bootstrap_servers": "localhost:9092",
-        "topic": "deepstream-analytics",
-        "send_frame_metadata": True,
-        "send_object_metadata": True,
-        "batch_size": 1  # Send every frame
-    }
-    kafka_custom_probe_pipeline(sys.argv[1], sys.argv[2], kafka_config)
-```
-
-### Approach 3: Multi-Topic Kafka Integration
-
-Send different types of metadata to different Kafka topics.
-
-```python
-class MultiTopicKafkaSender(BatchMetadataOperator):
-    """Send different metadata types to different Kafka topics"""
-    def __init__(self, kafka_configs):
-        """
-        Args:
-            kafka_configs: Dict mapping topic names to Kafka configs
-                {
-                    "object-detections": {...},
-                    "tracking-events": {...},
-                    "frame-metadata": {...}
-                }
-        """
-        super().__init__()
-        self.producers = {}
-        self.topics = {}
-
-        for topic_name, config in kafka_configs.items():
-            producer = KafkaProducer(
-                bootstrap_servers=config["bootstrap_servers"],
-                value_serializer=lambda v: json.dumps(v).encode('utf-8')
-            )
-            self.producers[topic_name] = producer
-            self.topics[topic_name] = config.get("topic", topic_name)
-
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # Send object detections
-            if "object-detections" in self.producers:
-                detections = self._prepare_detections(frame_meta)
-                self.producers["object-detections"].send(
-                    topic=self.topics["object-detections"],
-                    value=detections
-                )
-
-            # Send tracking events (new tracks, lost tracks)
-            if "tracking-events" in self.producers:
-                events = self._prepare_tracking_events(frame_meta)
-                if events:
-                    self.producers["tracking-events"].send(
-                        topic=self.topics["tracking-events"],
-                        value=events
-                    )
-
-            # Send frame metadata
-            if "frame-metadata" in self.producers:
-                frame_data = self._prepare_frame_metadata(frame_meta)
-                self.producers["frame-metadata"].send(
-                    topic=self.topics["frame-metadata"],
-                    value=frame_data
-                )
-
-    def _prepare_detections(self, frame_meta):
-        """Prepare object detection message"""
-        # Build detections list by iterating (object_items is an iterator)
-        detections = [
-            {
-                "class_id": obj.class_id,
-                "confidence": float(obj.confidence),
-                "bbox": {
-                    "left": float(obj.rect_params.left),
-                    "top": float(obj.rect_params.top),
-                    "width": float(obj.rect_params.width),
-                    "height": float(obj.rect_params.height)
-                }
-            }
-            for obj in frame_meta.object_items
-        ]
-        return {
-            "frame_number": frame_meta.frame_number,
-            "buffer_pts": frame_meta.buffer_pts,  # Use buffer_pts for timestamp
-            "ntp_timestamp": frame_meta.ntp_timestamp,
-            "detections": detections
-        }
-
-    def _prepare_tracking_events(self, frame_meta):
-        """Prepare tracking event message"""
-        # Detect new tracks, lost tracks, etc.
-        # This requires maintaining state across frames
-        return {}  # Implement tracking event detection
-
-    def _prepare_frame_metadata(self, frame_meta):
-        """Prepare frame metadata message"""
-        # Note: object_items is an ITERATOR, not a list - cannot use len() directly
-        # Count objects by iterating
-        obj_count = sum(1 for _ in frame_meta.object_items)
-        return {
-            "frame_number": frame_meta.frame_number,
-            "buffer_pts": frame_meta.buffer_pts,  # Use buffer_pts for timestamp
-            "ntp_timestamp": frame_meta.ntp_timestamp,
-            "object_count": obj_count
-        }
-
-    def close(self):
-        """Close all producers"""
-        for producer in self.producers.values():
-            producer.flush()
-            producer.close()
-```
-
-## Error Handling and Resilience
-
-### Retry Logic and Error Handling
-
-```python
-class ResilientKafkaSender(BatchMetadataOperator):
-    """Kafka sender with retry logic and error handling"""
-    def __init__(self, kafka_config):
-        super().__init__()
-        self.config = kafka_config
-        self.max_retries = kafka_config.get("max_retries", 3)
-        self.retry_delay = kafka_config.get("retry_delay", 1.0)
-        self.message_queue = []  # Queue for failed messages
-        self._init_producer()
-
-    def _init_producer(self):
-        """Initialize or reinitialize producer"""
-        try:
-            self.producer = KafkaProducer(
-                bootstrap_servers=self.config["bootstrap_servers"],
-                value_serializer=lambda v: json.dumps(v).encode('utf-8'),
-                retries=self.max_retries,
-                max_in_flight_requests_per_connection=1,
-                enable_idempotence=True
-            )
-            self.connected = True
-        except Exception as e:
-            print(f"Failed to initialize Kafka producer: {e}")
-            self.connected = False
-
-    def handle_metadata(self, batch_meta):
-        if not self.connected:
-            self._init_producer()
-            if not self.connected:
-                # Store messages for later retry
-                self.message_queue.append(batch_meta)
-                return
-
-        try:
-            # Process current batch
-            self._send_batch(batch_meta)
-
-            # Retry queued messages
-            while self.message_queue:
-                queued_batch = self.message_queue.pop(0)
-                try:
-                    self._send_batch(queued_batch)
-                except Exception as e:
-                    # Re-queue if still failing
-                    self.message_queue.append(queued_batch)
-                    break
-
-        except Exception as e:
-            print(f"Error sending to Kafka: {e}")
-            self.message_queue.append(batch_meta)
-            # Try to reconnect
-            self.connected = False
-
-    def _send_batch(self, batch_meta):
-        """Send batch metadata to Kafka"""
-        for frame_meta in batch_meta.frame_items:
-            message = self._prepare_message(frame_meta)
-            future = self.producer.send(
-                topic=self.config["topic"],
-                value=message
-            )
-            # Wait for delivery (synchronous for reliability)
-            future.get(timeout=10)
-```
-
-## Performance Optimization
-
-### Batching Messages
-
-```python
-class BatchedKafkaSender(BatchMetadataOperator):
-    """Batch multiple frames before sending to Kafka"""
-    def __init__(self, kafka_config, batch_size=10):
-        super().__init__()
-        self.producer = KafkaProducer(
-            bootstrap_servers=kafka_config["bootstrap_servers"],
-            value_serializer=lambda v: json.dumps(v).encode('utf-8'),
-            batch_size=16384,  # Kafka batch size in bytes
-            linger_ms=100  # Wait up to 100ms to batch
-        )
-        self.topic = kafka_config["topic"]
-        self.batch_size = batch_size
-        self.frame_buffer = []
-
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            self.frame_buffer.append(frame_meta)
-
-            if len(self.frame_buffer) >= self.batch_size:
-                self._send_batch()
-
-    def _send_batch(self):
-        """Send batched frames"""
-        batch_message = {
-            "frames": [self._prepare_message(f) for f in self.frame_buffer]
-        }
-        self.producer.send(topic=self.topic, value=batch_message)
-        self.frame_buffer.clear()
-
-    def flush(self):
-        """Flush remaining frames"""
-        if self.frame_buffer:
-            self._send_batch()
-        self.producer.flush()
-```
-
-## Testing and Validation
-
-### Test Kafka Consumer
-
-```python
-from kafka import KafkaConsumer
-import json
-
-def test_kafka_consumer(bootstrap_servers, topic):
-    """Test consumer to verify messages are being sent"""
-    consumer = KafkaConsumer(
-        topic,
-        bootstrap_servers=bootstrap_servers,
-        value_deserializer=lambda m: json.loads(m.decode('utf-8')),
-        auto_offset_reset='earliest',
-        enable_auto_commit=True
-    )
-
-    print(f"Consuming messages from topic: {topic}")
-    for message in consumer:
-        print(f"Received: {message.value}")
-```
-
-## Common Patterns
-
-### Pattern 1: Real-time Analytics Dashboard
-- Send object counts and statistics to Kafka
-- Dashboard consumes and displays in real-time
-
-### Pattern 2: Data Lake Ingestion
-- Send all metadata to Kafka
-- Kafka Connect streams to data lake (S3, HDFS)
-
-### Pattern 3: Alert System
-- Send only significant events (intrusions, anomalies)
-- Alert service consumes and triggers notifications
-
-### Pattern 4: Multi-Tenant Analytics
-- Use different topics for different customers/streams
-- Enable topic-based access control
-
----
-
-# Part 2: Message Broker and Converter Configuration Reference
-
-## Architecture
-
-```
-Pipeline -> nvmsgconv -> nvmsgbroker -> External Broker
-              |              |
-              |              +-- Protocol Adaptor Library
-              |                   (libnvds_kafka_proto.so, etc.)
-              |
-              +-- Config File (sensor, place, analytics metadata)
-```
-
-**IMPORTANT**: `nvmsgbroker` is a **SINK component** that terminates the pipeline branch. It cannot have downstream components.
-
----
-
-## nvmsgconv Plugin
-
-### Purpose
-
-Converts DeepStream metadata (NvDsEventMsgMeta or NvDsFrameMeta/NvDsObjectMeta) to message payload format.
-
-### GStreamer Properties
-
-| Property | Type | Description | Default |
-|----------|------|-------------|---------|
-| `config` | string | Path to message converter configuration file | None |
-| `payload-type` | int | Payload schema type (see below) | 0 |
-| `comp-id` | uint | Component ID for filtering metadata | All |
-| `msg2p-lib` | string | Path to custom payload generation library | None |
-| `frame-interval` | uint | Generate payload every N frames | 30 |
-| `msg2p-newapi` | bool | **IMPORTANT**: Use new message-to-payload API (see below) | false |
-| `debug-payload-dir` | string | Directory to dump payloads for debugging | None |
-| `multiple-payloads` | bool | Generate multiple payloads per buffer | false |
-
-### CRITICAL: msg2p-newapi Property
-
-**Problem**: By default (`msg2p-newapi: false`), `nvmsgconv` requires `NvDsEventMsgMeta` (exposed as `EventMessageUserMetadata` in pyservicemaker) to be attached to the buffer. This metadata is **NOT automatically generated** by inference or tracker plugins. Without explicitly handling this, nvmsgconv silently produces **zero messages**.
-
-**Two Solutions** (pick one):
-
-#### Solution A: Set msg2p-newapi=True (Simple, Recommended for Most Cases)
-
-Uses the new API that reads directly from `NvDsFrameMeta` and `NvDsObjectMeta` without requiring `NvDsEventMsgMeta`:
-
-```python
-# CORRECT - Uses object metadata directly, no NvDsEventMsgMeta needed
-pipeline.add("nvmsgconv", "msgconv", {
-    "config": msgconv_config,
-    "payload-type": 0,
-    "msg2p-newapi": True,      # Use new API - reads from NvDsObjectMeta directly
-})
-```
-
-#### Solution B: Keep msg2p-newapi=0 and Attach EventMessageUserMetadata Probe
-
-Required when using custom `msg2p-lib` payload libraries that expect legacy `NvDsEventMsgMeta`, or when you need fine-grained control over per-object message generation.
-
-**Option B1: Built-in probe** (simplest):
-```python
-pipeline.add("nvmsgconv", "msgconv", {
-    "config": msgconv_config,
-    "payload-type": 0,
-    # msg2p-newapi defaults to 0 (legacy API)
-})
-
-# Built-in probe auto-generates EventMessageUserMetadata from NvDsObjectMeta
-pipeline.attach("osd", "add_message_meta_probe", "metadata generator")
-```
-
-**Option B2: Custom EventMessageGenerator** (for multi-camera / custom sensor mappings):
-```python
-from pyservicemaker import Probe, BatchMetadataOperator, SensorInfo
-
-class EventMessageGenerator(BatchMetadataOperator):
-    def __init__(self, sensor_map, labels):
-        super().__init__()
-        self._sensor_map = sensor_map  # dict: source_id -> SensorInfo
-        self._labels = labels          # list of class label strings
-
-    def handle_metadata(self, batch_meta, frame_interval=1):
-        for frame_meta in batch_meta.frame_items:
-            for object_meta in frame_meta.object_items:
-                event_msg = batch_meta.acquire_event_message_meta()
-                if event_msg:
-                    source_id = frame_meta.source_id
-                    sensor_info = self._sensor_map.get(source_id)
-                    sensor_id = sensor_info.sensor_id if sensor_info else "N/A"
-                    uri = sensor_info.uri if sensor_info else "N/A"
-                    event_msg.generate(
-                        object_meta, frame_meta, sensor_id, uri, self._labels
-                    )
-                    frame_meta.append(event_msg)
-
-# Attach UPSTREAM of nvmsgconv (e.g., on tracker or osd element)
-sensor_map = {0: SensorInfo("Camera1", "cam1", "file:///video.mp4")}
-labels = ["car", "bicycle", "person", "roadsign"]
-pipeline.attach("tracker", Probe("event_msg_gen", EventMessageGenerator(sensor_map, labels)))
-```
-
-For complete pipeline examples using the legacy API, see Part 1 above (Example 3).
-
-#### Common Mistake
-
-```python
-# WRONG - Without msg2p-newapi=True AND without EventMessageUserMetadata probe,
-# nvmsgconv has no input and produces ZERO messages silently!
-pipeline.add("nvmsgconv", "msgconv", {
-    "config": msgconv_config,
-    "payload-type": 0
-})
-```
-
-**Reference samples**:
-- Built-in probe: `/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/pipeline_api/deepstream_test4_app/deepstream_test4.py`
-- Custom generator: `/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/pipeline_api/deepstream_test5_app/deepstream_test5.py`
-
-### Payload Types
-
-| Value | Name | Description |
-|-------|------|-------------|
-| 0 | `PAYLOAD_DEEPSTREAM` | Full DeepStream schema - separate JSON payload per object |
-| 1 | `PAYLOAD_DEEPSTREAM_MINIMAL` | Minimal schema - multiple objects in single JSON payload |
-| 2 | `PAYLOAD_DEEPSTREAM_PROTOBUF` | Protobuf encoded - multiple objects in single payload |
-| 256 | `PAYLOAD_CUSTOM` | Custom schema using msg2p-lib |
-
-### Pipeline Usage
-
-```python
-# Using pyservicemaker Pipeline API
-pipeline.add("nvmsgconv", "msgconv", {
-    "config": "/path/to/msgconv_config.txt",
-    "payload-type": 0  # Full DeepStream schema
-})
-```
-
----
-
-## nvmsgconv Configuration File
-
-The configuration file defines metadata about sensors, places, and analytics that gets embedded in the message payload.
-
-### Supported Formats
-
-- **INI-style format** (`.txt`) - Recommended
-- **YAML format** (`.yml`)
-
-### Configuration Sections
-
-#### [sensor0], [sensor1], ... - Sensor/Camera Information
-
-| Parameter | Type | Description | Required |
-|-----------|------|-------------|----------|
-| `enable` | int | Enable this sensor (0/1) | Yes |
-| `type` | string | Sensor type (e.g., "Camera", "Lidar") | Yes |
-| `id` | string | Unique sensor identifier | Yes |
-| `location` | string | GPS coordinates "lat;lon;alt" | No |
-| `description` | string | Human-readable description | No |
-| `coordinate` | string | Local coordinates "x;y;z" | No |
-
-#### [place0], [place1], ... - Location/Place Information
-
-| Parameter | Type | Description | Required |
-|-----------|------|-------------|----------|
-| `enable` | int | Enable this place (0/1) | Yes |
-| `id` | string/int | Place identifier | Yes |
-| `type` | string | Place type (e.g., "garage", "intersection/road") | Yes |
-| `name` | string | Place name | Yes |
-| `location` | string | GPS coordinates "lat;lon;alt" | No |
-| `coordinate` | string | Local coordinates "x;y;z" | No |
-| `place-sub-field1` | string | Custom sub-field 1 | No |
-| `place-sub-field2` | string | Custom sub-field 2 | No |
-| `place-sub-field3` | string | Custom sub-field 3 | No |
-
-#### [analytics0], [analytics1], ... - Analytics Information
-
-| Parameter | Type | Description | Required |
-|-----------|------|-------------|----------|
-| `enable` | int | Enable this analytics config (0/1) | Yes |
-| `id` | string | Analytics identifier | Yes |
-| `description` | string | Analytics description | No |
-| `source` | string | Analytics source/algorithm name | No |
-| `version` | string | Analytics version | No |
-
-### Example Configuration (INI-style)
-
-```ini
-# msgconv_config.txt
-
-[sensor0]
-enable=1
-type=Camera
-id=CAMERA_001
-location=45.293701;-75.830391;48.155
-description=Entrance Camera
-coordinate=5.2;10.1;11.2
-
-[sensor1]
-enable=1
-type=Camera
-id=CAMERA_002
-location=45.293702;-75.830392;48.156
-description=Exit Camera
-coordinate=6.2;11.1;12.2
-
-[place0]
-enable=1
-id=1
-type=garage
-name=ParkingLot_A
-location=30.32;-40.55;100.0
-coordinate=1.0;2.0;3.0
-place-sub-field1=Zone_A
-place-sub-field2=Lane_1
-place-sub-field3=Level_P1
-
-[analytics0]
-enable=1
-id=ANALYTICS_001
-description=Vehicle Detection and Tracking
-source=ResNet18_TrafficCamNet
-version=1.0
-```
-
-### Example Configuration (YAML)
-
-```yaml
-# msgconv_config.yml
-
-sensor0:
-  enable: 1
-  type: Camera
-  id: CAMERA_001
-  location: 45.293701;-75.830391;48.155
-  description: Entrance Camera
-  coordinate: 5.2;10.1;11.2
-
-place0:
-  enable: 1
-  id: 1
-  type: garage
-  name: ParkingLot_A
-  location: 30.32;-40.55;100.0
-  coordinate: 1.0;2.0;3.0
-  place-sub-field1: Zone_A
-  place-sub-field2: Lane_1
-  place-sub-field3: Level_P1
-
-analytics0:
-  enable: 1
-  id: ANALYTICS_001
-  description: Vehicle Detection and Tracking
-  source: ResNet18_TrafficCamNet
-  version: 1.0
-```
-
-### Multi-Source Configuration
-
-For multi-source pipelines, create sensor/place entries for each source:
-
-```ini
-# Sensor entries map to source_id in the pipeline
-[sensor0]
-enable=1
-type=Camera
-id=STREAM_0
-description=Camera 0
-
-[sensor1]
-enable=1
-type=Camera
-id=STREAM_1
-description=Camera 1
-
-# Place entries map to source_id
-[place0]
-enable=1
-id=0
-type=intersection
-name=Location_0
-
-[place1]
-enable=1
-id=1
-type=intersection
-name=Location_1
-```
-
----
-
-## nvmsgbroker Plugin
-
-### Purpose
-
-Sends payload metadata to external message brokers using protocol adaptor libraries.
-
-### GStreamer Properties
-
-| Property | Type | Description | Default |
-|----------|------|-------------|---------|
-| `proto-lib` | string | Path to protocol adaptor library | **Required** |
-| `conn-str` | string | Connection string for broker | **Required** |
-| `config` | string | Path to protocol-specific config file | None |
-| `topic` | string | Message topic name | None |
-| `comp-id` | uint | Component ID for filtering payloads | All |
-| `sync` | int | Synchronous (1) or async (0) message sending | 0 |
-| `async` | int | **CRITICAL**: Set to 0 for dynamic sources/tee pipelines | 1 |
-| `new-api` | bool | Use new nvmsgbroker API | false |
-| `sleep-time` | uint | Sleep time in ms between do_work calls | 0 |
-
-**CRITICAL: async=0 for Dynamic Sources and Tee Splits**
-
-When using `nvmsgbroker` in a pipeline with:
-- Dynamic sources (nvmultiurisrcbin)
-- Tee splits (multiple branches with different sinks)
-
-You **MUST** set `async: 0` on nvmsgbroker AND all other sinks. Otherwise, the pipeline will be stuck in PAUSED state.
-
-```python
-# CORRECT - async=0 for tee/dynamic source pipelines
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-    "conn-str": "localhost;9092",
-    "sync": 0,   # Async message sending
-    "async": 0,  # CRITICAL: Required for tee/dynamic sources!
-})
-
-# WRONG - missing async=0 causes pipeline stuck in PAUSED
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-    "conn-str": "localhost;9092",
-    "sync": 0,
-    # async defaults to 1, causing state transition deadlock!
-})
-```
-
-### Protocol Adaptor Libraries
-
-Located at `/opt/nvidia/deepstream/deepstream/lib/`:
-
-| Protocol | Library | Connection String Format |
-|----------|---------|-------------------------|
-| Kafka | `libnvds_kafka_proto.so` | `host;port` (semicolon-separated) |
-| MQTT | `libnvds_mqtt_proto.so` | `host;port` (semicolon-separated) |
-| Redis | `libnvds_redis_proto.so` | `host;port` (semicolon-separated) |
-| AMQP | `libnvds_amqp_proto.so` | `host;port;username;password` (semicolon-separated) |
-| Azure IoT | `libnvds_azure_proto.so` | Full Azure connection string |
-| Azure IoT Edge | `libnvds_azure_edge_proto.so` | - |
-
-**CRITICAL: Connection String Format**
-
-DeepStream message broker uses **semicolon (`;`)** as separator, NOT colon (`:`).
-
-```python
-# CORRECT - semicolon separator
-"conn-str": "localhost;9092"
-
-# WRONG - colon separator (will fail to connect)
-"conn-str": "localhost:9092"
-```
-
-### Pipeline Usage
-
-```python
-# Using pyservicemaker Pipeline API
-# For simple pipelines (single source, no tee):
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-    "conn-str": "localhost;9092",  # IMPORTANT: Use semicolon, not colon!
-    "topic": "deepstream-analytics",
-    "sync": 0,
-    "config": "/path/to/kafka_config.txt"
-})
-
-# For pipelines with dynamic sources OR tee splits:
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-    "conn-str": "localhost;9092",  # IMPORTANT: Use semicolon, not colon!
-    "topic": "deepstream-analytics",
-    "sync": 0,
-    "async": 0,  # CRITICAL: Required for tee/dynamic sources!
-    "config": "/path/to/kafka_config.txt"
-})
-```
-
----
-
-## Protocol Adaptor Configurations
-
-### Kafka Protocol Adaptor
-
-#### Dependencies Installation
-
-```bash
-# Add Confluent repository
-sudo mkdir -p /etc/apt/keyrings
-wget -qO - https://packages.confluent.io/deb/7.8/archive.key | gpg \
-  --dearmor | sudo tee /etc/apt/keyrings/confluent.gpg > /dev/null
-
-CP_DIST=$(lsb_release -cs)
-echo "Types: deb
-URIs: https://packages.confluent.io/deb/8.0
-Suites: stable
-Components: main
-Architectures: $(dpkg --print-architecture)
-Signed-by: /etc/apt/keyrings/confluent.gpg
-
-Types: deb
-URIs: https://packages.confluent.io/clients/deb/
-Suites: ${CP_DIST}
-Components: main
-Architectures: $(dpkg --print-architecture)
-Signed-By: /etc/apt/keyrings/confluent.gpg" | sudo tee /etc/apt/sources.list.d/confluent-platform.sources > /dev/null
-
-# Install dependencies
-sudo apt-get update
-sudo apt-get install librdkafka-dev libglib2.0-dev libjansson-dev libssl-dev
-```
-
-#### Configuration File (cfg_kafka.txt)
-
-```ini
-[message-broker]
-# Consumer group ID for Kafka consumer
-#consumer-group-id = mygroup
-
-# Generic librdkafka configuration (applies to both producer and consumer)
-# Semicolon-separated key=value pairs
-#proto-cfg = "message.max.bytes=200000;log_level=6"
-
-# Producer-specific librdkafka configuration
-#producer-proto-cfg = "queue.buffering.max.messages=200000;message.send.max.retries=3"
-
-# Consumer-specific librdkafka configuration
-#consumer-proto-cfg = "max.poll.interval.ms=20000"
-
-# Partition key field name in JSON message
-# Use "sensor.id" for full schema, "sensorId" for minimal schema
-#partition-key = sensor.id
-
-# Enable connection sharing within same process
-#share-connection = 1
-```
-
-#### Connection String
-
-Format: `hostname;port`
-
-Example: `localhost;9092` or `kafka-broker.example.com;9092`
-
-#### TLS/SSL Configuration
-
-For secure connections, refer to `/opt/nvidia/deepstream/deepstream/sources/libs/kafka_protocol_adaptor/Security_Setup.md`
-
----
-
-### MQTT Protocol Adaptor
-
-#### Dependencies Installation
-
-```bash
-# Install dependencies
-sudo apt-get install libglib2.0-dev libcjson-dev libssl-dev
-
-# Add Mosquitto PPA and install
-sudo apt-add-repository ppa:mosquitto-dev/mosquitto-ppa
-sudo apt-get update
-sudo apt-get install libmosquitto-dev mosquitto
-```
-
-#### Configuration File (cfg_mqtt.txt)
-
-```ini
-[message-broker]
-# Username for broker authentication (deprecated - use env var)
-#username = user
-
-# Password for broker authentication (deprecated - use env var)
-#password = password
-
-# Unique client ID (empty = random)
-client-id = deepstream-client
-
-# TLS Configuration
-#enable-tls = 1
-#tls-cafile = /path/to/ca-cert.pem
-#tls-capath = /path/to/ca-certs-dir/
-#tls-certfile = /path/to/client-cert.pem
-#tls-keyfile = /path/to/client-key.pem
-
-# Connection sharing
-#share-connection = 1
-
-# Mosquitto loop timeout in ms
-#loop-timeout = 2000
-
-# Keep-alive interval in seconds
-#keep-alive = 60
-
-# Enable threaded mode (required for nvmsgbroker plugin)
-#set-threaded = 1
-```
-
-#### User Authentication via Environment Variables
-
-```bash
-export USER_MQTT=username
-export PASSWORD_MQTT=password
-```
-
-#### Connection String
-
-Format: `hostname;port`
-
-Example: `localhost;1883`
-
-#### Running Mosquitto Broker
-
-```bash
-# Add mosquitto user
-sudo adduser --system mosquitto
-
-# Run broker
-mosquitto
-
-# Or with config file
-mosquitto -c /etc/mosquitto/mosquitto.conf
-```
-
-#### Verify Messages
-
-```bash
-# Subscribe to topic
-mosquitto_sub -t deepstream-analytics -v
-
-# Publish test message
-mosquitto_pub -t deepstream-analytics -m 'test message'
-```
-
----
-
-### Redis Protocol Adaptor
-
-#### Dependencies Installation
-
-```bash
-# Install dependencies
-sudo apt-get install libglib2.0-dev libssl-dev libhiredis-dev
-```
-
-#### Configuration File (cfg_redis.txt)
-
-```ini
-[message-broker]
-# Redis server hostname
-#hostname=localhost
-
-# Redis server port
-#port=6379
-
-# Password for Redis AUTH (deprecated - use env var)
-#password=password
-
-# Redis stream key for payload
-#payloadkey=metadata
-
-# Consumer group name
-#consumergroup=mygroup
-
-# Consumer name
-#consumername=myname
-
-# Maximum stream size (for capped streams)
-#streamsize=10000
-
-# Connection sharing
-#share-connection = 1
-```
-
-#### User Authentication via Environment Variables
-
-```bash
-export PASSWORD_REDIS=password
-```
-
-#### Connection String
-
-Format: `hostname;port`
-
-Example: `localhost;6379`
-
-#### Running Redis Server
-
-```bash
-# Download and build Redis
-wget http://download.redis.io/releases/redis-6.0.8.tar.gz
-tar xzf redis-6.0.8.tar.gz
-cd redis-6.0.8
-make
-
-# Run server
-src/redis-server
-
-# Or with protected mode disabled (for external connections)
-src/redis-server --protected-mode no
-```
-
----
-
-### AMQP Protocol Adaptor (RabbitMQ)
-
-#### Dependencies Installation
-
-```bash
-# Install dependencies
-sudo apt-get install libglib2.0-dev librabbitmq-dev
-
-# Install RabbitMQ server (optional, for local testing)
-sudo apt-get install rabbitmq-server
-sudo service rabbitmq-server start
-```
-
-#### Configuration File (cfg_amqp.txt)
-
-```ini
-[message-broker]
-# RabbitMQ server hostname
-hostname = localhost
-
-# RabbitMQ server port
-port = 5672
-
-# Username (deprecated - use env var)
-username = guest
-
-# Password (deprecated - use env var)
-password = guest
-
-# AMQP exchange name
-exchange = amq.topic
-
-# Topic/routing key
-topic = deepstream-analytics
-
-# Maximum frame size
-amqp-framesize = 131072
-
-# Heartbeat interval in seconds (0 = disabled)
-#amqp-heartbeat = 0
-
-# Connection sharing
-#share-connection = 1
-```
-
-#### User Authentication via Environment Variables
-
-```bash
-export USER_AMQP=username
-export PASSWORD_AMQP=password
-```
-
-#### Connection String
-
-Format: `hostname;port;username;password`
-
-Example: `localhost;5672;guest;guest`
-
-#### Setup RabbitMQ Queue
-
-```bash
-# Enable management plugin
-sudo rabbitmq-plugins enable rabbitmq_management
-
-# Create queue
-sudo rabbitmqadmin -u guest -p guest -V / declare queue name=myqueue durable=false auto_delete=true
-
-# Bind queue to exchange
-rabbitmqadmin -u guest -p guest -V / declare binding source=amq.topic destination=myqueue routing_key=deepstream-analytics
-
-# List queues
-sudo rabbitmqctl list_queues
-```
-
-#### Consume Messages
-
-```bash
-# Install amqp-tools
-sudo apt-get install amqp-tools
-
-# Consume from queue
-amqp-consume -q "myqueue" -r "deepstream-analytics" -e "amq.topic" cat
-```
-
----
-
-### Azure IoT Protocol Adaptor
-
-#### Dependencies Installation
-
-```bash
-# Install dependencies
-sudo apt-get update
-sudo apt-get install -y libcurl4-openssl-dev libssl-dev uuid-dev libglib2.0-dev
-
-# Build Azure IoT SDK
-git clone https://github.com/Azure/azure-iot-sdk-c.git
-cd azure-iot-sdk-c
-git checkout tags/1.11.0
-git submodule update --init
-
-# Modify CMakeLists.txt:
-# - Line 61: set build_as_dynamic to ON
-# - Line 65: set use_edge_modules to ON
-
-mkdir cmake && cd cmake
-cmake ..
-cmake --build .
-sudo make install
-```
-
-#### Configuration File (cfg_azure.txt)
-
-```ini
-[message-broker]
-# Azure IoT Hub connection string
-#connection_str = HostName=<my-hub>.azure-devices.net;DeviceId=<device_id>;SharedAccessKey=<my-policy-key>
-
-# Custom message properties (key=value pairs)
-#custom_msg_properties = key1=value1;key2=value2;
-
-# Connection sharing
-#share-connection = 1
-
-# Cleanup timeout in seconds during disconnect
-#cleanup-timeout = 20
-```
-
-#### Connection String
-
-Full Azure IoT Hub connection string:
-```
-HostName=<my-hub>.azure-devices.net;DeviceId=<device_id>;SharedAccessKey=<my-policy-key>
-```
-
----
-
-## nvmsgbroker Library Configuration
-
-The nvmsgbroker library (wrapper around protocol adaptors) has its own configuration:
-
-### Configuration File (cfg_nvmsgbroker.txt)
-
-```ini
-[nvmsgbroker]
-# Enable auto-reconnection (0=disable, 1=enable)
-auto-reconnect=1
-
-# Reconnection retry interval in seconds
-retry-interval=1
-
-# Maximum retry limit in seconds
-max-retry-limit=3600
-
-# Work interval in microseconds
-work-interval=10000
-```
-
----
-
-## Message Payload Formats
-
-### Full Schema (payload-type=0)
-
-Generates separate JSON payload per object:
-
-```json
-{
-  "messageid": "unique-uuid",
-  "mdsversion": "1.0",
-  "@timestamp": "2024-01-15T10:30:00.000Z",
-  "place": {
-    "id": "1",
-    "name": "ParkingLot_A",
-    "type": "garage",
-    "location": {
-      "lat": 30.32,
-      "lon": -40.55,
-      "alt": 100.0
-    }
-  },
-  "sensor": {
-    "id": "CAMERA_001",
-    "type": "Camera",
-    "description": "Entrance Camera"
-  },
-  "analyticsModule": {
-    "id": "ANALYTICS_001",
-    "description": "Vehicle Detection",
-    "source": "ResNet18_TrafficCamNet",
-    "version": "1.0"
-  },
-  "object": {
-    "id": "1",
-    "speed": 0,
-    "direction": 0,
-    "orientation": 0,
-    "vehicle": {
-      "type": "car",
-      "make": "",
-      "model": "",
-      "color": "",
-      "license": ""
-    },
-    "bbox": {
-      "topleftx": 100,
-      "toplefty": 200,
-      "bottomrightx": 300,
-      "bottomrighty": 400
-    },
-    "location": {
-      "lat": 0,
-      "lon": 0,
-      "alt": 0
-    },
-    "coordinate": {
-      "x": 0,
-      "y": 0,
-      "z": 0
-    }
-  },
-  "event": {
-    "id": "event-uuid",
-    "type": "entry"
-  },
-  "videoPath": ""
-}
-```
-
-### Minimal Schema (payload-type=1)
-
-Multiple objects in single JSON payload:
-
-```json
-{
-  "messageid": "unique-uuid",
-  "mdsversion": "1.0",
-  "@timestamp": "2024-01-15T10:30:00.000Z",
-  "sensorId": "CAMERA_001",
-  "objects": [
-    {
-      "id": "1",
-      "type": "car",
-      "confidence": 0.95,
-      "bbox": {
-        "topleftx": 100,
-        "toplefty": 200,
-        "bottomrightx": 300,
-        "bottomrighty": 400
-      }
-    },
-    {
-      "id": "2",
-      "type": "person",
-      "confidence": 0.88,
-      "bbox": {
-        "topleftx": 400,
-        "toplefty": 150,
-        "bottomrightx": 450,
-        "bottomrighty": 350
-      }
-    }
-  ]
-}
-```
-
----
-
-## Troubleshooting
-
-### Common Issues
-
-1. **"Connection refused" error**
-   - Verify broker is running and accessible
-   - Check firewall rules
-   - Verify connection string format
-
-2. **"Library not found" error**
-   - Verify proto-lib path exists
-   - Check library dependencies: `ldd /opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so`
-
-3. **Messages not appearing in broker**
-   - Verify topic exists (or auto-create is enabled)
-   - Check broker logs for errors
-   - Enable DeepStream logging (see below)
-
-4. **TLS/SSL connection failures**
-   - Verify certificate paths
-   - Check certificate validity
-   - Ensure proper permissions on key files
-
-### Enable DeepStream Logging
-
-```bash
-# Setup logger
-chmod u+x /opt/nvidia/deepstream/deepstream/sources/tools/nvds_logger/setup_nvds_logger.sh
-sudo /opt/nvidia/deepstream/deepstream/sources/tools/nvds_logger/setup_nvds_logger.sh
-
-# View logs
-tail -f /tmp/nvds/ds.log
-```
-
----
-
-## Best Practices
-
-1. **Use async mode** (`sync=0`) for better performance
-2. **Configure appropriate batch sizes** in nvmsgconv's `frame-interval`
-3. **Use minimal schema** (`payload-type=1`) for lower bandwidth
-4. **Enable auto-reconnect** in nvmsgbroker config for resilience
-5. **Use environment variables** for credentials instead of config files
-6. **Monitor broker lag** to ensure consumers keep up
-7. **Use TLS/SSL** for production deployments
-8. **Implement retry logic**: Handle transient Kafka failures (see Part 1 above for Python examples)
-9. **Batch messages**: Reduce network overhead (see Part 1 above for batching patterns)
-10. **Use appropriate partitioning**: Use frame_number or source_id as key
-11. **Handle backpressure**: Pause pipeline if Kafka is slow
-12. **Monitor producer metrics**: Track send rates and errors
-13. **Clean shutdown**: Flush and close producers properly
-
----
-
-## Related Documentation
-
-- **GStreamer Plugins Overview**: `gstreamer_plugins.md`
-- **Service Maker Python API**: `service_maker_api.md`
diff --git a/skills/deepstream/deepstream-dev/references/media_extractor_advanced.md b/skills/deepstream/deepstream-dev/references/media_extractor_advanced.md
deleted file mode 100644
index 58576fb8..00000000
--- a/skills/deepstream/deepstream-dev/references/media_extractor_advanced.md
+++ /dev/null
@@ -1,911 +0,0 @@
-# Advanced Media Extraction with MediaExtractor, MediaChunk, and FrameSampler
-
-## Overview
-
-The `pyservicemaker.utils` module provides advanced utilities for extracting frames from media sources with precise control over timing, sampling, and batch processing. These utilities are particularly useful for:
-- Processing specific time segments (chunks) of video files
-- Frame sampling at precise intervals
-- Batch processing multiple video sources
-- Dynamic source addition during runtime
-- Seeking and timestamp-based frame extraction
-
-## Core Classes
-
-### MediaChunk
-
-A `MediaChunk` represents a specific time segment of a media source with sampling parameters.
-
-**Constructor**:
-```python
-from pyservicemaker.utils import MediaChunk
-
-chunk = MediaChunk(
-    source="path/to/video.mp4",
-    start_pts=0,           # Start timestamp in nanoseconds
-    duration=-1,           # Duration in nanoseconds (-1 = entire file)
-    interval=0             # Frame sampling interval in nanoseconds (0 = no skipping)
-)
-```
-
-**Parameters**:
-- `source` (str): File path or URL of media source
-- `start_pts` (int): Start timestamp in nanoseconds (default: 0)
-- `duration` (int): Duration in nanoseconds (default: -1 for entire file)
-- `interval` (int): Frame sampling interval in nanoseconds (default: 0 for no frame skipping)
-
-**Properties**:
-- `source`: Returns the media source path/URL
-- `start_pts`: Returns the start timestamp
-- `duration`: Returns the duration
-- `interval`: Returns the sampling interval
-
-**Example**:
-```python
-from pyservicemaker.utils import MediaChunk
-
-# Extract entire video
-chunk1 = MediaChunk(source="video1.mp4")
-
-# Extract 10 seconds starting from 5 seconds
-chunk2 = MediaChunk(
-    source="video2.mp4",
-    start_pts=5_000_000_000,   # 5 seconds in nanoseconds
-    duration=10_000_000_000     # 10 seconds in nanoseconds
-)
-
-# Extract with frame sampling every 0.5 seconds
-chunk3 = MediaChunk(
-    source="video3.mp4",
-    interval=500_000_000        # 0.5 seconds in nanoseconds
-)
-
-# Extract 30 seconds starting at 1 minute, sample every 2 seconds
-chunk4 = MediaChunk(
-    source="video4.mp4",
-    start_pts=60_000_000_000,   # 1 minute
-    duration=30_000_000_000,    # 30 seconds
-    interval=2_000_000_000      # 2 seconds
-)
-```
-
-### VideoFrame
-
-Represents a decoded video frame with timestamp information.
-
-**Constructor**:
-```python
-from pyservicemaker.utils import VideoFrame
-
-frame = VideoFrame(data=tensor, timestamp=pts)
-```
-
-**Parameters**:
-- `data` (Tensor): Frame data as DeepStream tensor
-- `timestamp` (int): Frame timestamp in nanoseconds (default: -1)
-
-**Properties**:
-- `timestamp`: Returns the frame timestamp
-- `tensor`: Returns the frame data tensor
-
-**Example**:
-```python
-# Typically created internally by FrameSampler
-# Access in your processing code:
-for frame in output_queue:
-    if frame is None:
-        break  # End of stream
-    
-    print(f"Frame timestamp: {frame.timestamp} ns")
-    tensor_data = frame.tensor
-    # Process tensor_data...
-```
-
-### FrameSampler
-
-Manages frame sampling logic based on MediaChunk specifications.
-
-**Constructor**:
-```python
-from pyservicemaker.utils import FrameSampler
-
-sampler = FrameSampler(chunk=media_chunk, seek_fn=None)
-```
-
-**Parameters**:
-- `chunk` (MediaChunk): Media chunk specification
-- `seek_fn` (Callable, optional): Function to call for seeking (default: None)
-
-**Properties**:
-- `done`: Returns True when chunk processing is complete
-
-**Methods**:
-
-#### `sample(buffer, pts)`
-Sample a frame based on chunk specifications.
-
-**Parameters**:
-- `buffer`: Buffer containing frame data
-- `pts` (int): Presentation timestamp in nanoseconds
-
-**Returns**: `VideoFrame` object if frame should be sampled, `None` otherwise
-
-**Example** (typically used internally):
-```python
-# Internal usage by MediaExtractor
-sampler = FrameSampler(chunk)
-frame = sampler.sample(buffer, pts)
-if frame:
-    queue.put(frame)
-elif sampler.done:
-    print("Chunk processing complete")
-```
-
-### MediaExtractor
-
-High-level utility for extracting frames from media sources with advanced features.
-
-**Constructor**:
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-
-extractor = MediaExtractor(
-    chunks=[chunk1, chunk2, ...],  # List of MediaChunk objects
-    batch_size=0,                   # 0 = no batching, N = batch N sources
-    scaling=(1920, 1080),           # Target resolution (width, height)
-    n_thread=1,                     # Number of worker threads
-    q_size=1,                       # Output queue capacity
-    enable_seek=False,              # Enable seeking for frame retrieval
-    blocking=False                  # Block when queue is full
-)
-```
-
-**Parameters**:
-- `chunks` (List[MediaChunk], optional): List of media chunks to process
-- `batch_size` (int): Batch size for processing (0 = no batching, default: 0)
-- `scaling` (Tuple[int, int]): Target resolution (width, height), default: (1920, 1080)
-- `n_thread` (int): Number of worker threads (default: 1)
-- `q_size` (int): Output queue capacity (default: 1)
-- `enable_seek` (bool): Enable seeking for efficient frame retrieval (default: False)
-- `blocking` (bool): Block when output queue is full (default: False)
-
-**Methods**:
-
-#### `__call__()`
-Start extraction and return output queues.
-
-**Returns**: List of `queue.Queue` objects containing `VideoFrame` objects
-
-#### `append(chunk)`
-Dynamically add a new chunk during runtime (only if initialized without chunks).
-
-**Parameters**:
-- `chunk` (MediaChunk): Media chunk to add
-
-**Returns**: `queue.Queue` for the added chunk
-
-**Context Manager Support**:
-MediaExtractor supports context manager protocol for automatic cleanup.
-
-```python
-with MediaExtractor(chunks=[...]) as extractor:
-    queues = extractor()
-    # Process frames...
-# Automatic cleanup on exit
-```
-
-## Usage Patterns
-
-### Pattern 1: Extract Entire Video Files
-
-Extract all frames from multiple video files.
-
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-import torch  # pip install torch torchvision (not in base DS container)
-
-def extract_all_frames(video_paths):
-    """Extract all frames from multiple videos"""
-    # Create chunks for each video
-    chunks = [MediaChunk(source=path) for path in video_paths]
-    
-    # Create extractor
-    with MediaExtractor(chunks=chunks, n_thread=len(video_paths), q_size=10) as extractor:
-        # Start extraction
-        queues = extractor()
-        
-        # Process frames from each video
-        for i, q in enumerate(queues):
-            print(f"Processing video {i}: {video_paths[i]}")
-            frame_count = 0
-            
-            while True:
-                frame = q.get()
-                if frame is None:
-                    break  # End of stream
-                
-                # Convert to PyTorch tensor
-                torch_tensor = torch.utils.dlpack.from_dlpack(frame.tensor)
-                
-                # Process frame
-                print(f"  Frame {frame_count}: timestamp={frame.timestamp} ns, shape={torch_tensor.shape}")
-                
-                frame_count += 1
-            
-            print(f"  Total frames: {frame_count}")
-
-# Example usage
-video_files = ["video1.mp4", "video2.mp4", "video3.mp4"]
-extract_all_frames(video_files)
-```
-
-### Pattern 2: Extract Time Segments
-
-Extract specific time segments from videos.
-
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-
-def extract_time_segments(video_path, segments):
-    """
-    Extract specific time segments from a video
-    
-    Args:
-        video_path: Path to video file
-        segments: List of (start_time, duration) tuples in seconds
-    """
-    # Create chunks for each segment
-    chunks = [
-        MediaChunk(
-            source=video_path,
-            start_pts=int(start * 1e9),      # Convert to nanoseconds
-            duration=int(duration * 1e9)      # Convert to nanoseconds
-        )
-        for start, duration in segments
-    ]
-    
-    with MediaExtractor(chunks=chunks, n_thread=1, q_size=5) as extractor:
-        queues = extractor()
-        
-        for i, (q, (start, duration)) in enumerate(zip(queues, segments)):
-            print(f"Segment {i}: {start}s - {start+duration}s")
-            frames = []
-            
-            while True:
-                frame = q.get()
-                if frame is None:
-                    break
-                frames.append(frame)
-            
-            print(f"  Extracted {len(frames)} frames")
-            
-            # Process frames for this segment
-            for frame in frames:
-                # Your processing logic here
-                pass
-
-# Example: Extract three 10-second segments
-segments = [
-    (0, 10),      # First 10 seconds
-    (30, 10),     # 10 seconds starting at 30s
-    (60, 10)      # 10 seconds starting at 1 minute
-]
-extract_time_segments("long_video.mp4", segments)
-```
-
-### Pattern 3: Frame Sampling at Intervals
-
-Extract frames at specific intervals (e.g., every N seconds).
-
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-import cv2  # pip install opencv-python-headless (not in base DS container)
-import numpy as np
-import torch  # pip install torch torchvision (not in base DS container)
-
-def sample_frames_at_interval(video_path, interval_sec=1.0, output_dir="./sampled"):
-    """
-    Sample frames at regular intervals
-    
-    Args:
-        video_path: Path to video file
-        interval_sec: Sampling interval in seconds
-        output_dir: Directory to save sampled frames
-    """
-    import os
-    os.makedirs(output_dir, exist_ok=True)
-    
-    # Create chunk with sampling interval
-    chunk = MediaChunk(
-        source=video_path,
-        interval=int(interval_sec * 1e9)  # Convert to nanoseconds
-    )
-    
-    with MediaExtractor(chunks=[chunk], q_size=10) as extractor:
-        queues = extractor()
-        q = queues[0]
-        
-        frame_idx = 0
-        while True:
-            frame = q.get()
-            if frame is None:
-                break
-            
-            # Convert to numpy for saving
-            torch_tensor = torch.utils.dlpack.from_dlpack(frame.tensor)
-            frame_np = torch_tensor.cpu().numpy()
-            
-            # Convert RGB to BGR for OpenCV
-            frame_bgr = cv2.cvtColor(frame_np, cv2.COLOR_RGB2BGR)
-            
-            # Save frame
-            timestamp_sec = frame.timestamp / 1e9
-            filename = f"{output_dir}/frame_{frame_idx:06d}_t{timestamp_sec:.3f}s.jpg"
-            cv2.imwrite(filename, frame_bgr)
-            
-            print(f"Saved: {filename}")
-            frame_idx += 1
-        
-        print(f"Total sampled frames: {frame_idx}")
-
-# Sample frames every 2 seconds
-sample_frames_at_interval("video.mp4", interval_sec=2.0)
-```
-
-### Pattern 4: Batch Processing Multiple Sources
-
-Process multiple video sources in batches with scaling.
-
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-import torch  # pip install torch torchvision (not in base DS container)
-
-def batch_process_videos(video_paths, batch_size=4, target_resolution=(1280, 720)):
-    """
-    Process multiple videos in batches with scaling
-    
-    Args:
-        video_paths: List of video file paths
-        batch_size: Number of videos to process in parallel
-        target_resolution: Target (width, height) for scaling
-    """
-    # Create chunks
-    chunks = [MediaChunk(source=path) for path in video_paths]
-    
-    # Create extractor with batching
-    with MediaExtractor(
-        chunks=chunks,
-        batch_size=batch_size,
-        scaling=target_resolution,
-        n_thread=1,
-        q_size=10
-    ) as extractor:
-        queues = extractor()
-        
-        # Process each batch queue
-        for batch_idx, q in enumerate(queues):
-            print(f"Processing batch {batch_idx}")
-            frame_count = 0
-            
-            while True:
-                frame = q.get()
-                if frame is None:
-                    break
-                
-                # Frame is already scaled to target resolution
-                torch_tensor = torch.utils.dlpack.from_dlpack(frame.tensor)
-                print(f"  Batch {batch_idx}, Frame {frame_count}: shape={torch_tensor.shape}")
-                
-                # Process batched frame
-                # ... your processing logic ...
-                
-                frame_count += 1
-            
-            print(f"  Batch {batch_idx} complete: {frame_count} frames")
-
-# Process 12 videos in batches of 4
-videos = [f"video_{i}.mp4" for i in range(12)]
-batch_process_videos(videos, batch_size=4, target_resolution=(1280, 720))
-```
-
-### Pattern 5: Dynamic Source Addition
-
-Add video sources dynamically during runtime.
-
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-import threading
-import time
-
-def dynamic_extraction_system(n_threads=2):
-    """
-    System that accepts video processing requests dynamically
-    """
-    # Create extractor without initial chunks (for dynamic addition)
-    with MediaExtractor(chunks=None, n_thread=n_threads, q_size=5) as extractor:
-        # Start extractor threads
-        extractor()
-        
-        def process_chunk(chunk, queue):
-            """Process frames from a chunk"""
-            print(f"Processing: {chunk.source}")
-            frame_count = 0
-            
-            while True:
-                frame = queue.get()
-                if frame is None:
-                    break
-                
-                # Process frame
-                frame_count += 1
-            
-            print(f"Completed: {chunk.source} ({frame_count} frames)")
-        
-        # Simulate dynamic requests
-        video_requests = [
-            ("video1.mp4", 0, 10),    # (path, start_sec, duration_sec)
-            ("video2.mp4", 5, 15),
-            ("video3.mp4", 0, 20),
-            ("video4.mp4", 10, 10),
-        ]
-        
-        threads = []
-        for path, start_sec, duration_sec in video_requests:
-            # Create chunk
-            chunk = MediaChunk(
-                source=path,
-                start_pts=int(start_sec * 1e9),
-                duration=int(duration_sec * 1e9)
-            )
-            
-            # Add to extractor (returns queue for this chunk)
-            q = extractor.append(chunk)
-            
-            # Process in separate thread
-            t = threading.Thread(target=process_chunk, args=(chunk, q))
-            t.start()
-            threads.append(t)
-            
-            # Simulate delay between requests
-            time.sleep(0.5)
-        
-        # Wait for all processing to complete
-        for t in threads:
-            t.join()
-        
-        print("All requests processed")
-
-# Run dynamic extraction system
-dynamic_extraction_system(n_threads=2)
-```
-
-### Pattern 6: Frame Extraction with Seeking
-
-Enable seeking for efficient frame retrieval with large intervals.
-
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-
-def extract_keyframes_with_seeking(video_path, keyframe_interval_sec=10.0):
-    """
-    Extract keyframes efficiently using seeking
-    
-    Args:
-        video_path: Path to video file
-        keyframe_interval_sec: Interval between keyframes in seconds
-    """
-    # Create chunk with large interval
-    chunk = MediaChunk(
-        source=video_path,
-        interval=int(keyframe_interval_sec * 1e9)
-    )
-    
-    # Enable seeking for efficient frame retrieval
-    with MediaExtractor(
-        chunks=[chunk],
-        enable_seek=True,  # Enable seeking
-        q_size=5
-    ) as extractor:
-        queues = extractor()
-        q = queues[0]
-        
-        keyframes = []
-        while True:
-            frame = q.get()
-            if frame is None:
-                break
-            
-            keyframes.append(frame)
-            print(f"Keyframe {len(keyframes)}: timestamp={frame.timestamp/1e9:.2f}s")
-        
-        print(f"Extracted {len(keyframes)} keyframes")
-        return keyframes
-
-# Extract keyframes every 10 seconds
-keyframes = extract_keyframes_with_seeking("long_video.mp4", keyframe_interval_sec=10.0)
-```
-
-### Pattern 7: Blocking Mode for Controlled Processing
-
-Use blocking mode to control frame processing rate.
-
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-import time
-
-def controlled_frame_processing(video_path, processing_delay=0.1):
-    """
-    Process frames with controlled rate using blocking mode
-    
-    Args:
-        video_path: Path to video file
-        processing_delay: Simulated processing delay per frame
-    """
-    chunk = MediaChunk(source=video_path)
-    
-    # Use blocking mode with small queue
-    with MediaExtractor(
-        chunks=[chunk],
-        q_size=2,          # Small queue
-        blocking=True      # Block when queue is full
-    ) as extractor:
-        queues = extractor()
-        q = queues[0]
-        
-        frame_count = 0
-        while True:
-            frame = q.get()
-            if frame is None:
-                break
-            
-            # Simulate slow processing
-            print(f"Processing frame {frame_count}...")
-            time.sleep(processing_delay)
-            
-            frame_count += 1
-        
-        print(f"Processed {frame_count} frames")
-
-# Process with controlled rate
-controlled_frame_processing("video.mp4", processing_delay=0.1)
-```
-
-## Advanced Usage
-
-### Multi-Threaded Parallel Extraction
-
-Process multiple videos in parallel using multiple threads.
-
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-from concurrent.futures import ThreadPoolExecutor
-import torch  # pip install torch torchvision (not in base DS container)
-
-def parallel_video_analysis(video_paths, n_workers=4):
-    """
-    Analyze multiple videos in parallel
-    
-    Args:
-        video_paths: List of video file paths
-        n_workers: Number of parallel workers
-    """
-    # Create chunks
-    chunks = [MediaChunk(source=path) for path in video_paths]
-    
-    # Create extractor with multiple threads
-    with MediaExtractor(
-        chunks=chunks,
-        n_thread=n_workers,
-        q_size=10
-    ) as extractor:
-        queues = extractor()
-        
-        def analyze_video(video_idx, queue, video_path):
-            """Analyze a single video"""
-            print(f"Analyzing: {video_path}")
-            
-            frame_stats = {
-                'count': 0,
-                'total_intensity': 0.0,
-                'timestamps': []
-            }
-            
-            while True:
-                frame = queue.get()
-                if frame is None:
-                    break
-                
-                # Analyze frame
-                torch_tensor = torch.utils.dlpack.from_dlpack(frame.tensor)
-                mean_intensity = torch_tensor.float().mean().item()
-                
-                frame_stats['count'] += 1
-                frame_stats['total_intensity'] += mean_intensity
-                frame_stats['timestamps'].append(frame.timestamp)
-            
-            # Compute statistics
-            avg_intensity = frame_stats['total_intensity'] / frame_stats['count']
-            duration_sec = (frame_stats['timestamps'][-1] - frame_stats['timestamps'][0]) / 1e9
-            
-            return {
-                'video': video_path,
-                'frames': frame_stats['count'],
-                'avg_intensity': avg_intensity,
-                'duration': duration_sec
-            }
-        
-        # Process all videos in parallel
-        with ThreadPoolExecutor(max_workers=n_workers) as executor:
-            futures = [
-                executor.submit(analyze_video, i, q, path)
-                for i, (q, path) in enumerate(zip(queues, video_paths))
-            ]
-            
-            results = [f.result() for f in futures]
-        
-        # Print results
-        for result in results:
-            print(f"\nVideo: {result['video']}")
-            print(f"  Frames: {result['frames']}")
-            print(f"  Duration: {result['duration']:.2f}s")
-            print(f"  Avg Intensity: {result['avg_intensity']:.2f}")
-
-# Analyze 8 videos with 4 workers
-videos = [f"video_{i}.mp4" for i in range(8)]
-parallel_video_analysis(videos, n_workers=4)
-```
-
-### Combining with Inference Pipeline
-
-Extract frames and run inference on them.
-
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-from pyservicemaker import Pipeline, Flow
-import torch  # pip install torch torchvision (not in base DS container)
-
-def extract_and_infer(video_path, model_config, segment_duration=30):
-    """
-    Extract video segments and run inference on each
-    
-    Args:
-        video_path: Path to video file
-        model_config: Path to inference model config
-        segment_duration: Duration of each segment in seconds
-    """
-    import cv2  # pip install opencv-python-headless (not in base DS container)
-    
-    # Get video duration (simplified - use actual video metadata in production)
-    cap = cv2.VideoCapture(video_path)
-    fps = cap.get(cv2.CAP_PROP_FPS)
-    total_frames = int(cap.get(cv2.CAP_PROP_FRAME_COUNT))
-    total_duration = total_frames / fps
-    cap.release()
-    
-    # Create chunks for each segment
-    n_segments = int(total_duration / segment_duration) + 1
-    chunks = [
-        MediaChunk(
-            source=video_path,
-            start_pts=int(i * segment_duration * 1e9),
-            duration=int(segment_duration * 1e9)
-        )
-        for i in range(n_segments)
-    ]
-    
-    # Extract frames
-    with MediaExtractor(chunks=chunks, n_thread=2, q_size=10) as extractor:
-        queues = extractor()
-        
-        for seg_idx, q in enumerate(queues):
-            print(f"Processing segment {seg_idx}...")
-            
-            # Collect frames from segment
-            frames = []
-            while True:
-                frame = q.get()
-                if frame is None:
-                    break
-                frames.append(frame)
-            
-            print(f"  Segment {seg_idx}: {len(frames)} frames")
-            
-            # Run inference on frames (simplified example)
-            for frame in frames:
-                torch_tensor = torch.utils.dlpack.from_dlpack(frame.tensor)
-                # Run your inference model here
-                # results = model(torch_tensor)
-                pass
-
-# Extract and infer on 30-second segments
-extract_and_infer("long_video.mp4", "model_config.yml", segment_duration=30)
-```
-
-## Best Practices
-
-### 1. Timestamp Conversion
-Always use nanoseconds for timestamps:
-```python
-# Convert seconds to nanoseconds
-seconds = 10.5
-nanoseconds = int(seconds * 1e9)
-
-# Convert nanoseconds to seconds
-nanoseconds = 10_500_000_000
-seconds = nanoseconds / 1e9
-```
-
-### 2. Queue Size Management
-Choose appropriate queue size based on memory and processing speed:
-```python
-# Small queue for memory-constrained systems
-extractor = MediaExtractor(chunks=[...], q_size=2)
-
-# Larger queue for smooth processing
-extractor = MediaExtractor(chunks=[...], q_size=20)
-
-# Use blocking mode if processing is slow
-extractor = MediaExtractor(chunks=[...], q_size=5, blocking=True)
-```
-
-### 3. Thread Count Selection
-```python
-# Single thread for sequential processing
-extractor = MediaExtractor(chunks=[...], n_thread=1)
-
-# Multiple threads for parallel processing
-extractor = MediaExtractor(chunks=[...], n_thread=4)
-
-# Match thread count to CPU cores
-import os
-n_cores = os.cpu_count()
-extractor = MediaExtractor(chunks=[...], n_thread=n_cores)
-```
-
-### 4. Seeking Optimization
-Enable seeking for large sampling intervals:
-```python
-# Enable seeking when interval > 1 second
-if interval_sec > 1.0:
-    extractor = MediaExtractor(chunks=[...], enable_seek=True)
-else:
-    extractor = MediaExtractor(chunks=[...], enable_seek=False)
-```
-
-### 5. Context Manager Usage
-Always use context manager for automatic cleanup:
-```python
-# Good: Automatic cleanup
-with MediaExtractor(chunks=[...]) as extractor:
-    queues = extractor()
-    # Process frames...
-# Cleanup happens automatically
-
-# Avoid: Manual cleanup required
-extractor = MediaExtractor(chunks=[...])
-queues = extractor()
-# Must manually clean up
-```
-
-### 6. Error Handling
-```python
-from pyservicemaker.utils import MediaExtractor, MediaChunk
-
-def safe_extraction(video_paths):
-    """Extract frames with error handling"""
-    chunks = [MediaChunk(source=path) for path in video_paths]
-    
-    try:
-        with MediaExtractor(chunks=chunks, q_size=10) as extractor:
-            queues = extractor()
-            
-            for i, q in enumerate(queues):
-                try:
-                    while True:
-                        frame = q.get(timeout=30)  # Timeout to detect stalls
-                        if frame is None:
-                            break
-                        
-                        # Process frame
-                        # ...
-                        
-                except Exception as e:
-                    print(f"Error processing video {i}: {e}")
-                    continue
-    
-    except Exception as e:
-        print(f"Extraction error: {e}")
-```
-
-## Performance Tips
-
-### 1. Batch Processing
-Use batching for multiple sources:
-```python
-# Process 12 videos in batches of 4
-extractor = MediaExtractor(
-    chunks=chunks,
-    batch_size=4,  # Process 4 at a time
-    scaling=(1280, 720)
-)
-```
-
-### 2. Memory Management
-Control memory usage with queue size:
-```python
-# Low memory: small queue
-extractor = MediaExtractor(chunks=[...], q_size=2)
-
-# High throughput: larger queue
-extractor = MediaExtractor(chunks=[...], q_size=20)
-```
-
-### 3. Parallel Processing
-Use multiple threads for I/O-bound tasks:
-```python
-# Process 8 videos with 4 threads
-extractor = MediaExtractor(
-    chunks=chunks,
-    n_thread=4
-)
-```
-
-## Common Use Cases
-
-### 1. Video Thumbnail Generation
-Extract keyframes at regular intervals for thumbnails.
-
-### 2. Video Segmentation
-Split long videos into processable segments.
-
-### 3. Frame Sampling for Training Data
-Extract frames at intervals for ML training datasets.
-
-### 4. Video Quality Analysis
-Sample frames to analyze video quality metrics.
-
-### 5. Event Detection
-Extract frames around specific timestamps for event analysis.
-
-### 6. Multi-Video Synchronization
-Process multiple synchronized video sources in batches.
-
-## Troubleshooting
-
-### Issue 1: Frames Not Extracted
-**Solution**: Check that source path is valid, verify timestamps are in nanoseconds
-
-### Issue 2: Memory Issues
-**Solution**: Reduce `q_size`, process frames immediately, use smaller batches
-
-### Issue 3: Slow Extraction
-**Solution**: Enable seeking for large intervals, increase thread count, use batching
-
-### Issue 4: Queue Timeout
-**Solution**: Increase queue size, enable blocking mode, check video file integrity
-
-## Related APIs
-
-- **BufferProvider/Feeder**: See `buffer_apis.md`
-- **BufferRetriever/Receiver**: See `buffer_apis.md`
-- **Pipeline API**: See `service_maker_api.md`
-
-## Summary
-
-The MediaExtractor, MediaChunk, and FrameSampler utilities provide powerful capabilities for advanced frame extraction:
-
-1. **MediaChunk**: Define time segments and sampling parameters
-2. **FrameSampler**: Intelligent frame sampling based on timestamps
-3. **MediaExtractor**: High-level extraction with batching, threading, and seeking
-4. **VideoFrame**: Container for extracted frames with timestamps
-
-Key features:
-- Precise timestamp-based extraction
-- Frame sampling at intervals
-- Batch processing multiple sources
-- Dynamic source addition
-- Seeking optimization
-- Multi-threaded parallel processing
-- Context manager support for cleanup
-
-These utilities are ideal for video analysis, training data preparation, thumbnail generation, and any application requiring precise frame extraction from video sources.
-
diff --git a/skills/deepstream/deepstream-dev/references/metamux_config.md b/skills/deepstream/deepstream-dev/references/metamux_config.md
deleted file mode 100644
index bb65ef47..00000000
--- a/skills/deepstream/deepstream-dev/references/metamux_config.md
+++ /dev/null
@@ -1,373 +0,0 @@
-# nvdsmetamux Configuration Reference
-
-## Overview
-
-The `nvdsmetamux` GStreamer plugin performs batch metadata multiplexing for the same source and the "same" frame. This plugin is essential for pipelines where multiple inference models process the same video stream parallelly and their metadata needs to be merged.
-
-### Key Concepts
-
-- **Same Frame Matching**: The "same" frame is determined based on the frame PTS (Presentation Timestamp). The plugin searches for the nearest frame PTS of the same source.
-- **PTS Tolerance**: There is a configurable PTS difference tolerance for matching frames. If the PTS difference exceeds this tolerance, frames are not considered the same.
-- **Active Pad Selection**: Applications can select which sink pad's video frame will be passed to the source pad.
-- **Metadata Merging**: The plugin merges metadata from multiple inference models, allowing you to combine results from different GIEs.
-- **Metadata Filtering**: Applications can configure to filter metadata based on source IDs from specific model.
-
----
-
-## GStreamer Element Properties
-
-The `nvdsmetamux` element exposes the following GStreamer properties:
-
-### Core Properties
-
-| Property | Type | Description | Default |
-|----------|------|-------------|---------|
-| `active-pad` | string | Active sink pad whose buffer will transfer to source pad | null |
-| `config-file` | string | Path to the nvdsmetamux configuration file | null |
-| `pts-tolerance` | int64 | Time difference tolerance when searching for the same frame of the same source ID (in microseconds) | 60000 |
-| `name` | string | The name of the GStreamer object | "nvdsmetamux0" |
-| `parent` | GstObject | The parent of the GStreamer object | - |
-
-### Latency Properties
-
-| Property | Type | Description | Default |
-|----------|------|-------------|---------|
-| `latency` | uint64 | Additional latency in live mode to allow upstream to take longer to produce buffers (in nanoseconds) | 0 |
-| `min-upstream-latency` | uint64 | Override minimum latency for dynamically plugged sources with higher latency (in nanoseconds) | 0 |
-
-### Start Time Properties
-
-| Property | Type | Description | Default |
-|----------|------|-------------|---------|
-| `start-time` | uint64 | Start time to use if `start-time-selection=set` | 18446744073709551615 |
-| `start-time-selection` | enum | Decides which start time is output | 0 (zero) |
-
-**start-time-selection Values**:
-| Value | Name | Description |
-|-------|------|-------------|
-| 0 | zero | Start at 0 running time (default) |
-| 1 | first | Start at first observed input running time |
-| 2 | set | Set start time with `start-time` property |
-
----
-
-## Configuration File Reference
-
-The `nvdsmetamux` plugin uses a configuration file (specified via `config-file` property) to define metadata muxing behavior.
-
-### Configuration File Format
-
-The configuration file uses INI-style format with the following structure:
-
-```ini
-[property]
-enable=1
-# sink pad name which data will be pass to src pad.
-active-pad=sink_0
-# default pts-tolerance is 60 ms.
-pts-tolerance=60000
-
-[user-configs]
-
-[group-0]
-# src-ids-model-<model unique ID>=<source ids>
-# mux all source if don't set it.
-src-ids-model-1=0;1;2
-src-ids-model-2=1;2;3
-```
-
-### Property Section
-
-The `[property]` section contains core configuration parameters.
-
-| Config Key | Type | Description | Default |
-|------------|------|-------------|---------|
-| `enable` | int | Enable the functions of MetaMux (0=disabled, 1=enabled) | 1 |
-| `active-pad` | string | Sink pad name whose data will be passed to source pad. Used to synchronize the sources from the branches. | - |
-| `pts-tolerance` | int64 | When the difference between the branch source and the base source is larger than this tolerance value, metamux will not combine the metadata into current output (in microseconds) | 60000 |
-
-### User-Configs Section
-
-The `[user-configs]` section is a placeholder for user-defined configurations. This section can be empty or contain custom settings.
-
-### Group Section
-
-The `[group-0]` section (and additional `[group-N]` sections) configures source ID filtering for specific GIE models.
-
-| Config Key Pattern | Type | Description |
-|--------------------|------|-------------|
-| `src-ids-model-<unique-id>` | string | The source IDs list to be output for the specified GIE. The GIE `unique-id` should be attached as the key postfix. Values are semicolon-separated. If not set, the metadata of all sources from the GIE will be muxed. |
-
-**Example**:
-```ini
-[group-0]
-src-ids-model-1=0;1;2
-src-ids-model-2=1;2;3
-```
-This means:
-- Output source 0, source 1, and source 2 inference results from the GIE with `unique-id=1`
-- Output source 1, source 2, and source 3 inference results from the GIE with `unique-id=2`
-
-**Note**: If `src-ids-model-<unique-id>` is not set for a particular GIE, the metadata of all sources from the GIE will be muxed by default.
-
----
-
-## Complete Configuration Examples
-
-### Example 1: Basic MetaMux Configuration
-
-```ini
-# config_metamux.txt
-[property]
-enable=1
-# sink pad name which data will be pass to src pad.
-active-pad=sink_0
-# default pts-tolerance is 60 ms.
-pts-tolerance=60000
-
-[user-configs]
-
-[group-0]
-# src-ids-model-<model unique ID>=<source ids>
-# mux all source if don't set it.
-src-ids-model-1=0;1;2;3
-src-ids-model-3=0;1;3
-```
-
-### Example 2: Configuration with Larger PTS Tolerance
-
-```ini
-# config_metamux_large_tolerance.txt
-[property]
-enable=1
-active-pad=sink_0
-# Increased tolerance for high-latency pipelines
-pts-tolerance=100000
-
-[user-configs]
-
-[group-0]
-src-ids-model-1=0;1;2;3
-src-ids-model-2=0;1;2;3
-```
-
-### Example 3: Multiple GIE Source Filtering
-
-```ini
-# config_metamux_multi_gie.txt
-[property]
-enable=1
-active-pad=sink_0
-pts-tolerance=60000
-
-[user-configs]
-
-[group-0]
-# Primary detector (unique-id=1): output sources 0, 1, 2
-src-ids-model-1=0;1;2
-# Primary detector (unique-id=2): output sources 1, 2, 3
-src-ids-model-2=1;2;3
-# Primary detector (unique-id=3): output all sources
-src-ids-model-3=0;1;2;3
-```
-
----
-
-## Pipeline Examples
-
-This example uses a `nvstreamdemux` element to split and select the stream, followed by muxing it for parallel inference with multiple models:
-- Primary object detector (ResNet18 TrafficCamNet)
-- YOLO26s detection model
-
-Pipeline Architecture:
-```
-4 video streams → nvstreammux → tee
-  ├─ Path 0 (Video): queue → nvdsmetamux sink_0
-  └─ Path 1 (Inference): queue → nvstreamdemux
-       ├─ Stream 0: queue → tee_0
-       ├─ Stream 1: queue → tee_1
-       ├─ Stream 2: queue → tee_2
-       └─ Stream 3: queue → tee_3
-            │
-            ├─ Branch 1: tee_0,1,2 → nvstreammux → nvinfer(ResNet18) → tracker → metamux sink_1
-            └─ Branch 2: tee_1,2,3 → nvstreammux → nvinfer(YOLO26s) → tracker → metamux sink_2
-                 │
-                 └─ nvdsmetamux → nvmultistreamtiler → nvdsosd → display
-```
-
-**Key Implementation Notes**:
-
-1. **Pad naming conventions**:
-   - `nvstreamdemux`: Use `"src_%u"` for output pads (auto-assigned in order)
-   - `nvdsmetamux`: Use `"sink_%u"` for input pads (auto-assigned in order)
-   - `nvstreammux`: Use `"sink_%u"` for input pads
-   - `tee`: Use `"src_%u"` for output pads
-
-2. **Linking order matters for `nvdsmetamux`**:
-   - First link → `sink_0` (should match `active-pad` in config)
-   - Second link → `sink_1`
-   - Third link → `sink_2`
-
-3. **`nvstreammux`**: Set `batched-push-timeout` to `40000` (microseconds).
-
-4. **Adaptive batching (process environment)**: Set the `NVSTREAMMUX_ADAPTIVE_BATCHING=yes` environment variable before the pipeline starts. Adaptive batching dynamically adjusts the batch size when a stream finishes early, avoiding empty slots in the batch.
-
-5. **`nvstreamdemux`**: Set `per-stream-eos: True` on `nvstreamdemux` so that each stream sends EOS independently upon completion, rather than waiting for all streams to finish. This prevents the pipeline from hanging while other streams are still active.
-
-**Code Pattern**:
-```python
-pipeline.add("nvstreammux", "mux", {
-    "batch-size": NUM_SOURCES,
-    "width": 1920,
-    "height": 1080,
-    "batched-push-timeout": 40000,
-})
-
-pipeline.add("nvstreamdemux", "demux", {"per-stream-eos": True})
-
-# Add queue and tee after demux for each stream
-for i in range(NUM_SOURCES):
-    pipeline.add("queue", f"queue_demux_{i}", {"max-size-buffers": 100})
-    pipeline.add("tee", f"tee_stream_{i}")
-
-# Link demux outputs - uses src_%u template
-for i in range(NUM_SOURCES):
-    pipeline.link(("demux", f"queue_demux_{i}"), ("src_%u", ""))
-    pipeline.link(f"queue_demux_{i}", f"tee_stream_{i}")
-
-# Link to metamux - use sink_%u template, order determines pad assignment
-pipeline.link(("queue_video_path", "metamux"), ("", "sink_%u"))  # → sink_0
-pipeline.link(("queue_branch1_out", "metamux"), ("", "sink_%u"))  # → sink_1
-pipeline.link(("queue_branch2_out", "metamux"), ("", "sink_%u"))  # → sink_2
-```
-
-**Configuration File** (`config_metamux.txt`):
-```ini
-[property]
-enable=1
-active-pad=sink_0
-pts-tolerance=60000
-
-[user-configs]
-
-[group-0]
-src-ids-model-1=0;1;2
-src-ids-model-2=1;2;3
-```
-
----
-
-## Common Use Cases
-
-### Use Case 1: Multi-Model Inference
-
-Combine results from multiple inference models (e.g., object detection + YOLO26s) into a single output stream.
-
-### Use Case 2: Selective Source Output
-
-Filter which source streams should have their inference results included in the final output using `src-ids-model-<model unique ID>=<source ids>` configuration.
-
----
-
-## Common Pitfalls
-
-### Pitfall 1: PTS Tolerance Too Small
-
-**Problem**: Frames are not being matched correctly, resulting in missing metadata.
-
-**❌ Wrong**:
-```ini
-[property]
-pts-tolerance=1000  # Too small for variable latency
-```
-
-**✅ Correct**:
-```ini
-[property]
-pts-tolerance=60000  # 60ms tolerance
-```
-
-### Pitfall 2: Incorrect Active Pad
-
-**Problem**: Wrong video frame is being output to the source pad.
-
-**Solution**: Ensure `active-pad` matches one of your sink pad names (e.g., `sink_0`, `sink_1`).
-
-```ini
-[property]
-active-pad=sink_0  # Must match an existing sink pad
-```
-
-### Pitfall 3: Missing GIE Unique ID in src-ids-model
-
-**Problem**: Source ID filtering not working for a specific model.
-
-**❌ Wrong**:
-```ini
-[group-0]
-src-ids-model=0;1;2;3  # Missing unique-id suffix
-```
-
-**✅ Correct**:
-```ini
-[group-0]
-src-ids-model-1=0;1;2;3  # Include the GIE unique-id (1)
-```
-
-### Pitfall 5: Missing Required Sections
-
-**Problem**: Configuration file missing required sections.
-
-**❌ Wrong**:
-```ini
-[property]
-enable=1
-active-pad=sink_0
-
-# Missing [user-configs] and [group-0] sections
-```
-
-**✅ Correct**:
-```ini
-[property]
-enable=1
-active-pad=sink_0
-pts-tolerance=60000
-
-[user-configs]
-
-[group-0]
-src-ids-model-1=0;1;2;3
-```
-
-### Pitfall 4: PTS Synchronization Issues
-
-**Problem**: When using separate nvstreammux instances, frames may have different PTS values.
-
-**Solution**:
-- Use the `tee` approach when possible to ensure consistent PTS across branches
-- Increase `pts-tolerance` if using separate streammux instances
-- Set `sync-inputs=0` on nvstreammux for live sources
-
----
-
-## Best Practices
-
-1. **Use tee for Single Source**: When processing the same streams through multiple models, use a `tee` element after the first nvstreammux to ensure consistent PTS values.
-
-2. **Set Appropriate PTS Tolerance**: Start with the default (60000 microseconds = 60ms) and adjust based on your pipeline's latency characteristics.
-
-3. **Configure Source IDs Explicitly**: Always specify which source IDs should output from each model using `src-ids-model-<model unique ID>=<source ids>` to avoid unexpected metadata merging.
-
-4. **Use Queues**: Add `queue` elements before and after inference elements to prevent pipeline stalls.
-
-5. **Match Batch Sizes**: Ensure batch sizes are consistent across all branches feeding into nvdsmetamux.
-
----
-
-## Related Documentation
-
-- **GStreamer Plugins Overview**: `gstreamer_plugins.md`
-- **Use Cases and Pipelines**: `use_cases_pipelines.md`
-- **nvinfer Configuration Reference**: `nvinfer_config.md`
-- **Best Practices**: `best_practices.md`
diff --git a/skills/deepstream/deepstream-dev/references/nvinfer_config.md b/skills/deepstream/deepstream-dev/references/nvinfer_config.md
deleted file mode 100644
index bcf29df1..00000000
--- a/skills/deepstream/deepstream-dev/references/nvinfer_config.md
+++ /dev/null
@@ -1,656 +0,0 @@
-# nvinfer Configuration File Reference
-
-## Overview
-
-The `nvinfer` GStreamer plugin uses a configuration file to define model parameters, preprocessing settings, and postprocessing options. This document provides a complete reference for all configuration parameters.
-
-## Configuration File Formats
-
-nvinfer supports **two configuration file formats**:
-
-### Format 1: YAML Format (`.yml` or `.yaml`) - Recommended
-
-```yaml
-property:
-  gpu-id: 0
-  net-scale-factor: 0.00392156862745098
-  onnx-file: /path/to/model.onnx
-  batch-size: 1
-  # ... more properties
-
-class-attrs-all:
-  topk: 20
-  pre-cluster-threshold: 0.2
-```
-
-### Format 2: INI-style Text Format (`.txt`)
-
-```ini
-[property]
-gpu-id=0
-net-scale-factor=0.00392156862745098
-onnx-file=/path/to/model.onnx
-batch-size=1
-# ... more properties
-
-[class-attrs-all]
-topk=20
-pre-cluster-threshold=0.2
-```
-
-### Key Syntax Differences
-
-| Aspect | YAML Format | INI Format |
-|--------|-------------|------------|
-| File extension | `.yml` or `.yaml` | `.txt` |
-| Section headers | `property:` (no brackets) | `[property]` (with brackets) |
-| Key-value separator | `: ` (colon + space) | `=` (equals) |
-| Indentation | Required for nested values | Not used |
-| Comments | `#` at start of line | `#` at start of line |
-
----
-
-## Property Section Reference
-
-The `property` section contains core inference configuration.
-
-### Model Definition
-
-| Parameter | Type | Description | Default |
-|-----------|------|-------------|---------|
-| `onnx-file` | string | Path to ONNX model file | - |
-| `model-engine-file` | string | Path to a pre-built TensorRT engine file. When set, nvinfer loads this engine directly instead of regenerating it from the ONNX file on every run. The engine filename encodes the batch size, GPU index, and precision (see naming convention below). | - |
-| `custom-network-config` | string | Path to custom network config file | - |
-| `custom-lib-path` | string | Path to custom parsing library (.so) | - |
-| `labelfile-path` | string | Path to class labels text file | - |
-| `int8-calib-file` | string | Path to INT8 calibration file | - |
-| `tlt-model-key` | string | Encryption key for TAO/TLT models | - |
-
-**Usage Example (YAML)**:
-```yaml
-property:
-  onnx-file: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-  model-engine-file: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx_b1_gpu0_fp16.engine
-  labelfile-path: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-```
-
-#### model-engine-file — Purpose and Naming Convention
-
-**Purpose:** The first time nvinfer runs with an ONNX model, TensorRT builds an optimised engine file. This serialisation step can take **minutes**. By specifying `model-engine-file`, you tell nvinfer to load an already-built engine directly, **skipping the ONNX-to-engine conversion** on subsequent runs and dramatically reducing startup time.
-
-> **Agent guidance:** When generating nvinfer config files, **always include `model-engine-file`** alongside `onnx-file`. This avoids expensive re-compilation every time the pipeline starts. The engine file is specific to the batch size, GPU, and precision — if any of these change, a new engine must be generated (i.e. the first run without a matching engine file will trigger generation automatically).
-
-**Naming convention:** TensorRT engine files follow the pattern:
-
-```
-<onnx-filename>_b<batch-size>_gpu<gpu-id>_<precision>.engine
-```
-
-| Component | Meaning | Example |
-|-----------|---------|---------|
-| `<onnx-filename>` | Full ONNX filename including `.onnx` extension | `resnet18_trafficcamnet_pruned.onnx` |
-| `b<batch-size>` | Batch size the engine was built for | `b1`, `b4`, `b16` |
-| `gpu<gpu-id>` | GPU device index | `gpu0`, `gpu1` |
-| `<precision>` | Network precision mode | `fp32`, `int8`, `fp16` |
-
-**Examples by batch size:**
-
-```yaml
-# batch-size: 1
-property:
-  batch-size: 1
-  model-engine-file: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx_b1_gpu0_fp16.engine
-
-# batch-size: 4
-property:
-  batch-size: 4
-  model-engine-file: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx_b4_gpu0_fp16.engine
-
-# batch-size: 16 (e.g. secondary classifier)
-property:
-  batch-size: 16
-  model-engine-file: /opt/nvidia/deepstream/deepstream/samples/models/Secondary_VehicleMake/resnet18_vehiclemakenet_pruned.onnx_b16_gpu0_fp16.engine
-```
-
-**INI-style equivalent:**
-```ini
-[property]
-batch-size=4
-model-engine-file=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx_b4_gpu0_fp16.engine
-```
-
-### Processing Configuration
-
-| Parameter | Type | Values | Description | Default |
-|-----------|------|--------|-------------|---------|
-| `gpu-id` | int | 0, 1, 2... | GPU device ID | 0 |
-| `batch-size` | int | 1-32 | Maximum batch size | 1 |
-| `process-mode` | int | 1=Primary, 2=Secondary | Inference mode | 1 |
-| `network-mode` | int | 0=FP32, 1=INT8, 2=FP16 | Precision mode | 0 |
-| `network-type` | int | 0=Detector, 1=Classifier, 2=Segmentation, 3=Instance Segmentation | Network type. Use instead of the legacy `is-classifier` key. | 0 |
-| `interval` | int | 0-N | Skip N consecutive batches | 0 |
-| `gie-unique-id` | int | 1-N | Unique ID for this GIE | 1 |
-
-**Usage Example (YAML)**:
-```yaml
-property:
-  gpu-id: 0
-  batch-size: 4
-  process-mode: 1
-  network-mode: 2  # FP16
-  interval: 0
-  gie-unique-id: 1
-```
-
-### Network Input Configuration
-
-| Parameter | Type | Description | Default |
-|-----------|------|-------------|---------|
-| `net-scale-factor` | float | Input normalization scale factor | 1.0 |
-| `offsets` | string | Channel offsets (semicolon-separated) | - |
-| `model-color-format` | int | 0=RGB, 1=BGR, 2=GRAY | 0 |
-| `network-input-order` | int | 0=NCHW, 1=NHWC | 0 |
-| `infer-dims` | string | Input tensor dimensions in C;H;W format (semicolon-separated). **Required** when the ONNX model has dynamic input shapes (e.g., exported with `dynamic=True`). Tells TensorRT the concrete dimensions to use for the optimization profile. | Inferred from ONNX (only works for static shapes) |
-| `maintain-aspect-ratio` | int | 0=disabled, 1=enabled | 0 |
-| `symmetric-padding` | int | 0=disabled, 1=enabled | 0 |
-| `force-implicit-batch-dim` | int | 0=disabled, 1=enabled | 0 |
-
-> **Agent guidance — `infer-dims` and dynamic ONNX models:** Many popular model frameworks (Ultralytics YOLO, HuggingFace, etc.) export ONNX models with dynamic axes by default. These models have symbolic dimension names (e.g., `batch`, `height`, `width`) instead of fixed integers, which TensorRT reads as `-1`. Without `infer-dims`, TensorRT's `setDimensions` call fails because all dimensions must be >= 0. **Always add `infer-dims` when the ONNX model has dynamic input shapes.**
-
-**Usage Example (YAML)** — static-shape model (infer-dims optional):
-```yaml
-property:
-  net-scale-factor: 0.00392156862745098  # 1/255
-  offsets: 0;0;0
-  model-color-format: 0  # RGB
-  maintain-aspect-ratio: 1
-```
-
-**Usage Example (YAML)** — dynamic-shape ONNX model (infer-dims required):
-```yaml
-property:
-  net-scale-factor: 0.00392156862745098  # 1/255
-  model-color-format: 0  # RGB
-  infer-dims: 3;640;640  # REQUIRED for dynamic ONNX models
-  maintain-aspect-ratio: 1
-```
-
-**Usage Example (INI)** — dynamic-shape ONNX model:
-```ini
-[property]
-net-scale-factor=0.00392156862745098
-model-color-format=0
-infer-dims=3;640;640
-maintain-aspect-ratio=1
-```
-
-### Detection Configuration
-
-| Parameter | Type | Description | Default |
-|-----------|------|-------------|---------|
-| `num-detected-classes` | int | Number of classes in model | - |
-| `cluster-mode` | int | 1=DBSCAN, 2=NMS, 3=DBSCAN+NMS, 4=None | 2 |
-| `parse-bbox-func-name` | string | Custom bbox parsing function name | - |
-| `output-blob-names` | string | Model output layer names (semicolon-separated) | - |
-
-**Usage Example (YAML)**:
-```yaml
-property:
-  num-detected-classes: 4
-  cluster-mode: 2  # NMS
-```
-
-> **Oriented bounding boxes (OBB) — `rotation_angle`:** `nvinfer` supports oriented bounding boxes via `NvDsInferObjectDetectionInfo.rotation_angle`. **If you are using an OBB model**, the angle output by the model can be **directly assigned** to `rotation_angle` in your custom bbox parser. **If you are not using an OBB model**, set `rotation_angle = 0`. In C++, `NvDsInferObjectDetectionInfo obj{};` value-initializes the struct and zero-initializes all fields, including `rotation_angle`; plain `NvDsInferObjectDetectionInfo obj;` does **not** and can leave rotated-box metadata uninitialized.
->
-> Example (C++):
-> ```cpp
-> NvDsInferObjectDetectionInfo obj{};
-> // ... fill classId, confidence, left/top/width/height ...
-> obj.rotation_angle = is_obb_model ? angle_from_model : 0.0f;
-> ```
-
-### Secondary GIE Configuration (process-mode: 2)
-
-| Parameter | Type | Description | Default |
-|-----------|------|-------------|---------|
-| `operate-on-gie-id` | int | GIE ID to operate on | -1 (all) |
-| `operate-on-class-ids` | string | Class IDs to process (semicolon-separated) | - |
-| `classifier-async-mode` | int | 0=sync, 1=async | 0 |
-| `classifier-threshold` | float | Classification confidence threshold | 0.0 |
-| `classifier-type` | string | Classifier label type (e.g., `vehicletype`, `vehiclemake`, `color`). Used to label classification results in metadata. | - |
-| `input-object-min-width` | int | Minimum object width to classify | 0 |
-| `input-object-min-height` | int | Minimum object height to classify | 0 |
-| `input-object-max-width` | int | Maximum object width to classify | INT_MAX |
-| `input-object-max-height` | int | Maximum object height to classify | INT_MAX |
-
-**Usage Example (YAML)** - Secondary classifier:
-```yaml
-property:
-  gpu-id: 0
-  onnx-file: /path/to/classifier.onnx
-  batch-size: 16
-  process-mode: 2
-  network-mode: 2
-  network-type: 1
-  gie-unique-id: 2
-  operate-on-gie-id: 1
-  operate-on-class-ids: 0
-  classifier-async-mode: 1
-  classifier-threshold: 0.51
-  classifier-type: vehicletype
-```
-
-### Tensor Output Configuration
-
-| Parameter | Type | Description | Default |
-|-----------|------|-------------|---------|
-| `output-tensor-meta` | int | 0=disabled, 1=enabled | 0 |
-| `output-instance-mask` | int | 0=disabled, 1=enabled | 0 |
-| `input-tensor-meta` | int | 0=disabled, 1=enabled | 0 |
-
-**Usage Example (YAML)**:
-```yaml
-property:
-  output-tensor-meta: 1  # Enable tensor output for custom postprocessing
-```
-
-### Scaling Configuration
-
-| Parameter | Type | Description | Default |
-|-----------|------|-------------|---------|
-| `scaling-filter` | int | Scaling filter type (0-5) | 0 |
-| `scaling-compute-hw` | int | 0=default, 1=GPU, 2=VIC | 0 |
-
----
-
-## Class Attributes Sections
-
-Class attributes sections configure detection parameters per class or for all classes.
-
-### class-attrs-all (All Classes)
-
-Applies to all detected classes.
-
-> **IMPORTANT — camelCase key**: The DBSCAN minimum cluster size parameter is `minBoxes` (camelCase). Do NOT use `min-boxes` (kebab-case) — it is not recognized and will produce an "unknown key" warning at runtime.
-
-| Parameter | Type | Description | Default |
-|-----------|------|-------------|---------|
-| `topk` | int | Maximum detections to keep after NMS | 20 |
-| `nms-iou-threshold` | float | NMS IoU threshold (0.0-1.0) | 0.3 |
-| `pre-cluster-threshold` | float | Confidence threshold before clustering | 0.4 |
-| `eps` | float | DBSCAN epsilon parameter | 0.0 |
-| `dbscan-min-score` | float | DBSCAN minimum confidence | 0.0 |
-| `minBoxes` | int | DBSCAN minimum cluster size (camelCase, NOT `min-boxes`) | 0 |
-| `roi-top-offset` | int | ROI top offset in pixels | 0 |
-| `roi-bottom-offset` | int | ROI bottom offset in pixels | 0 |
-| `detected-min-w` | int | Minimum detection width | 0 |
-| `detected-min-h` | int | Minimum detection height | 0 |
-| `detected-max-w` | int | Maximum detection width | INT_MAX |
-| `detected-max-h` | int | Maximum detection height | INT_MAX |
-
-**Usage Example (YAML)** - NMS clustering:
-```yaml
-class-attrs-all:
-  topk: 20
-  nms-iou-threshold: 0.5
-  pre-cluster-threshold: 0.2
-```
-
-**Usage Example (YAML)** - DBSCAN clustering:
-```yaml
-class-attrs-all:
-  detected-min-w: 4
-  detected-min-h: 4
-  minBoxes: 3
-  eps: 0.7
-  dbscan-min-score: 0.5
-```
-
-### class-attrs-N (Per-Class)
-
-Override attributes for specific class ID N.
-
-```yaml
-class-attrs-0:
-  topk: 30
-  nms-iou-threshold: 0.4
-  pre-cluster-threshold: 0.3
-
-class-attrs-1:
-  topk: 10
-  nms-iou-threshold: 0.6
-  pre-cluster-threshold: 0.5
-```
-
----
-
-## Complete Configuration Examples
-
-### Example 1: Primary Detector (YAML)
-
-```yaml
-# Primary detector using ResNet18 TrafficCamNet
-property:
-  gpu-id: 0
-  net-scale-factor: 0.00392156862745098
-  onnx-file: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-  model-engine-file: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx_b1_gpu0_fp16.engine
-  labelfile-path: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-  batch-size: 1
-  process-mode: 1
-  model-color-format: 0
-  network-mode: 2
-  num-detected-classes: 4
-  interval: 0
-  gie-unique-id: 1
-  cluster-mode: 2
-
-class-attrs-all:
-  topk: 20
-  nms-iou-threshold: 0.5
-  pre-cluster-threshold: 0.2
-
-class-attrs-0:
-  topk: 20
-  nms-iou-threshold: 0.5
-  pre-cluster-threshold: 0.4
-```
-
-### Example 2: Primary Detector (INI-style)
-
-```ini
-# Primary detector using ResNet18 TrafficCamNet
-[property]
-gpu-id=0
-net-scale-factor=0.00392156862745098
-onnx-file=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-model-engine-file=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx_b1_gpu0_fp16.engine
-labelfile-path=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-batch-size=1
-process-mode=1
-model-color-format=0
-network-mode=2
-num-detected-classes=4
-interval=0
-gie-unique-id=1
-cluster-mode=2
-
-[class-attrs-all]
-topk=20
-nms-iou-threshold=0.5
-pre-cluster-threshold=0.2
-
-[class-attrs-0]
-topk=20
-nms-iou-threshold=0.5
-pre-cluster-threshold=0.4
-```
-
-### Example 3: Secondary Classifier (YAML)
-
-```yaml
-# Secondary classifier for vehicle make
-property:
-  gpu-id: 0
-  net-scale-factor: 1.0
-  onnx-file: /opt/nvidia/deepstream/deepstream/samples/models/Secondary_VehicleMake/resnet18_vehiclemakenet_pruned.onnx
-  model-engine-file: /opt/nvidia/deepstream/deepstream/samples/models/Secondary_VehicleMake/resnet18_vehiclemakenet_pruned.onnx_b16_gpu0_fp16.engine
-  labelfile-path: /opt/nvidia/deepstream/deepstream/samples/models/Secondary_VehicleMake/labels.txt
-  batch-size: 16
-  process-mode: 2
-  model-color-format: 1
-  network-mode: 2
-  network-type: 1
-  gie-unique-id: 2
-  operate-on-gie-id: 1
-  operate-on-class-ids: 0
-  classifier-async-mode: 1
-  classifier-threshold: 0.51
-  classifier-type: vehiclemake
-```
-
-### Example 4: Tensor Output for Custom Postprocessing (YAML)
-
-```yaml
-# Enable tensor output for custom postprocessing
-property:
-  gpu-id: 0
-  net-scale-factor: 0.00392156862745098
-  onnx-file: /path/to/custom_model.onnx
-  batch-size: 1
-  process-mode: 1
-  model-color-format: 0
-  network-mode: 2
-  num-detected-classes: 4
-  gie-unique-id: 1
-  output-tensor-meta: 1
-  cluster-mode: 4  # No clustering, use custom postprocessing
-
-class-attrs-all:
-  pre-cluster-threshold: 0.1
-```
-
----
-
-## Common Pitfalls
-
-### Pitfall 1: Wrong Section Name
-
-**❌ Wrong (using `model:` instead of `property:`)**:
-```yaml
-model:
-  onnx-file: /path/to/model.onnx
-  batch-size: 1
-```
-
-**✅ Correct**:
-```yaml
-property:
-  onnx-file: /path/to/model.onnx
-  batch-size: 1
-```
-
-### Pitfall 2: Missing Colons in YAML
-
-**❌ Wrong**:
-```yaml
-property
-  gpu-id: 0
-```
-
-**✅ Correct**:
-```yaml
-property:
-  gpu-id: 0
-```
-
-### Pitfall 3: Wrong Indentation
-
-**❌ Wrong**:
-```yaml
-property:
-gpu-id: 0
-batch-size: 1
-```
-
-**✅ Correct**:
-```yaml
-property:
-  gpu-id: 0
-  batch-size: 1
-```
-
-### Pitfall 4: Using YAML syntax in INI file
-
-**❌ Wrong (YAML in .txt file)**:
-```ini
-property:
-  gpu-id: 0
-```
-
-**✅ Correct (INI format in .txt file)**:
-```ini
-[property]
-gpu-id=0
-```
-
-### Pitfall 5: Incorrect process-mode for Secondary GIE
-
-**❌ Wrong (using process-mode=1 for secondary)**:
-```yaml
-property:
-  process-mode: 1
-  operate-on-gie-id: 1  # Won't work with process-mode=1
-```
-
-**✅ Correct**:
-```yaml
-property:
-  process-mode: 2  # Must be 2 for secondary GIE
-  operate-on-gie-id: 1
-```
-
-### Pitfall 6: Missing `infer-dims` for Dynamic ONNX Models
-
-**❌ Wrong (no `infer-dims` with a dynamic-shape ONNX model)**:
-```yaml
-# Model exported with dynamic=True (e.g., Ultralytics YOLO)
-# ONNX input shape: [batch, 3, height, width] — all symbolic
-property:
-  onnx-file: yolo_model.onnx
-  net-scale-factor: 0.00392156862745098
-  # Missing infer-dims → TensorRT sees -1 for dynamic dims → engine build fails
-```
-
-**Error**: `IOptimizationProfile::setDimensions: Error Code 3: API Usage Error (Parameter check failed, condition: std::all_of(dims.d, dims.d + dims.nbDims, [](int32_t x) noexcept { return x >= 0; }))`
-
-**✅ Correct**:
-```yaml
-property:
-  onnx-file: yolo_model.onnx
-  net-scale-factor: 0.00392156862745098
-  infer-dims: 3;640;640  # C;H;W — tells TensorRT the concrete input dimensions
-```
-
-**When to add `infer-dims`**: Whenever the ONNX model was exported with dynamic axes (e.g., `dynamic=True` in Ultralytics, dynamic batch in other frameworks). If unsure, inspect the model with `python -c "import onnx; m = onnx.load('model.onnx'); print(m.graph.input)"` and check for symbolic dimension names.
-
-### Pitfall 7: Using Legacy `is-classifier` Instead of `network-type`
-
-**❌ Wrong (legacy key, produces deprecation warning)**:
-```yaml
-property:
-  is-classifier: 1
-```
-
-**✅ Correct (use `network-type` in YAML configs)**:
-```yaml
-property:
-  network-type: 1  # 0=Detector, 1=Classifier, 2=Segmentation, 3=Instance Segmentation
-```
-
-For primary detectors, simply omit both keys — the default is detector (`network-type: 0`).
-
-### Pitfall 8: Using `min-boxes` Instead of `minBoxes`
-
-**❌ Wrong (kebab-case — not recognized, produces "unknown key" warning)**:
-```yaml
-class-attrs-all:
-  min-boxes: 3
-```
-
-**✅ Correct (camelCase)**:
-```yaml
-class-attrs-all:
-  minBoxes: 3
-```
-
-Unlike most nvinfer config keys which use kebab-case, `minBoxes` uses camelCase. This is a legacy naming exception in the parser.
-
----
-
-## DeepStream 9.0 Sample Model Paths
-
-DeepStream 9.0 includes sample models at:
-
-```
-/opt/nvidia/deepstream/deepstream/samples/models/
-├── Primary_Detector/
-│   ├── resnet18_trafficcamnet_pruned.onnx
-│   ├── labels.txt
-│   └── cal_trt.bin (INT8 calibration)
-├── Secondary_VehicleMake/
-│   ├── resnet18_vehiclemakenet_pruned.onnx
-│   └── labels.txt
-├── Secondary_VehicleTypes/
-│   ├── resnet18_vehicletypenet_pruned.onnx
-│   └── labels.txt
-└── SONYC_Audio_Classifier/
-    └── ...
-```
-
-**Primary Detector Labels** (4 classes):
-- 0: Car
-- 1: TwoWheeler
-- 2: Person
-- 3: RoadSign
-
----
-
-## GObject Properties vs Config File Parameters
-
-Some parameters can be set via GObject properties on the `nvinfer` element:
-
-```python
-pipeline.add("nvinfer", "infer", {
-    "config-file-path": "/path/to/config.yml",  # Required
-    "batch-size": 4,                             # Overrides config file
-    "unique-id": 1,                              # Overrides config file
-    "output-tensor-meta": 1,                     # Overrides config file
-    "interval": 2                                # Overrides config file
-})
-```
-
-**Properties settable via GObject** (override config file):
-- `batch-size`
-- `unique-id`
-- `process-mode`
-- `interval`
-- `output-tensor-meta`
-- `input-tensor-meta`
-- `output-instance-mask`
-- `model-engine-file`
-
-**Properties only in config file**:
-- `net-scale-factor`
-- `onnx-file`
-- `infer-dims`
-- `labelfile-path`
-- `num-detected-classes`
-- `cluster-mode`
-- All `class-attrs-*` parameters
-
----
-
-## Validation Checklist
-
-Before running your pipeline, verify:
-
-- [ ] Config file extension matches format (`.yml` for YAML, `.txt` for INI)
-- [ ] Section name is `property:` (YAML) or `[property]` (INI)
-- [ ] Model file path exists and is accessible
-- [ ] `model-engine-file` is set and its name matches the current `batch-size`, `gpu-id`, and `network-mode` (precision)
-- [ ] `infer-dims` is set if the ONNX model has dynamic input shapes (e.g., exported with `dynamic=True`)
-- [ ] `num-detected-classes` matches your model
-- [ ] `batch-size` <= number of streams
-- [ ] `process-mode` is correct (1=Primary, 2=Secondary)
-- [ ] Secondary GIE has `operate-on-gie-id` set correctly
-- [ ] `gie-unique-id` is unique across all nvinfer instances
-
----
-
-## Related Documentation
-
-- **GStreamer Plugins Overview**: `gstreamer_plugins.md`
-- **Service Maker Python API**: `service_maker_api.md`
-- **Use Cases & Pipelines**: `use_cases_pipelines.md`
-- **Best Practices**: `best_practices.md`
diff --git a/skills/deepstream/deepstream-dev/references/rest_api_dynamic.md b/skills/deepstream/deepstream-dev/references/rest_api_dynamic.md
deleted file mode 100644
index a4c5e9cb..00000000
--- a/skills/deepstream/deepstream-dev/references/rest_api_dynamic.md
+++ /dev/null
@@ -1,391 +0,0 @@
-# REST API and Dynamic Source Management
-
-## Overview
-
-DeepStream supports dynamic addition and removal of video sources at runtime through REST APIs. This capability is built into `nvmultiurisrcbin`, which integrates an HTTP REST server, multiple `nvurisrcbin` instances, and `nvstreammux` into a single GStreamer bin.
-
-**CRITICAL: Always use the built-in REST server in nvmultiurisrcbin. Do NOT implement a separate Flask/FastAPI server for stream management.**
-
----
-
-## Architecture
-
-```
-┌─────────────────────────────────────────────────────────────┐
-│                    nvmultiurisrcbin                         │
-│  ┌──────────────┐  ┌──────────────┐  ┌──────────────────┐  │
-│  │ nvds_rest_   │  │ nvurisrcbin  │  │   nvstreammux    │  │
-│  │ server       │  │ (multiple)   │  │                  │  │
-│  │ Port: 9000   │  │              │  │                  │  │
-│  └──────────────┘  └──────────────┘  └──────────────────┘  │
-└─────────────────────────────────────────────────────────────┘
-```
-
----
-
-## Critical Configuration for Dynamic Sources
-
-### Sink Element Configuration
-
-**⚠️ CRITICAL: When using dynamic sources, the sink element MUST have `async=0`**
-
-```python
-# ✅ CORRECT - Required for dynamic source state transitions
-pipeline.add("nveglglessink", "sink", {
-    "sync": 0,   # Don't sync to clock (required for live sources)
-    "qos": 0,    # Disable QoS events
-    "async": 0   # CRITICAL: Synchronous state changes for dynamic streams
-})
-
-# ❌ WRONG - Will cause state transition deadlock
-pipeline.add("nveglglessink", "sink", {"sync": 0})  # Missing async=0
-```
-
-**Why `async=0` is required:**
-- Without it, the sink waits for preroll (first buffer) before allowing state transitions
-- With dynamic streams, this creates a deadlock: source waits for sink, sink waits for data
-- Setting `async=0` makes state changes synchronous, allowing proper transitions
-
-### nvmultiurisrcbin Configuration
-
-```python
-source_props = {
-    # REST API Server
-    "ip-address": "0.0.0.0",        # Listen on all interfaces
-    "port": 9000,                    # REST API port (0 to disable)
-    
-    # Batching
-    "max-batch-size": 16,            # Maximum number of sources
-    "batched-push-timeout": 33333,   # Push batch after 33ms even if not full
-    "width": 1920,
-    "height": 1080,
-    
-    # Dynamic source handling
-    "live-source": 1,                # REQUIRED for dynamic streams
-    "drop-pipeline-eos": 1,          # Keep pipeline alive when sources removed
-    "async-handling": 1,             # Handle async state changes
-    
-    # RTSP settings
-    "select-rtp-protocol": 0,        # 0=UDP+TCP auto, 4=TCP only
-    "latency": 100,                  # Jitterbuffer size in ms
-}
-
-pipeline.add("nvmultiurisrcbin", "src", source_props)
-```
-
----
-
-## REST API Endpoints
-
-The built-in REST server provides these endpoints:
-
-| Endpoint | Method | Description |
-|----------|--------|-------------|
-| `/api/v1/stream/add` | POST | Add a new stream |
-| `/api/v1/stream/remove` | POST | Remove a stream |
-| `/api/v1/stream/get-stream-info` | GET | Get current stream info |
-| `/api/v1/health/get-dsready-state` | GET | Check pipeline readiness |
-
-### Add Stream Payload
-
-```json
-{
-    "key": "sensor",
-    "value": {
-        "camera_id": "unique_sensor_id",
-        "camera_name": "human_readable_name",
-        "camera_url": "rtsp://camera-ip/stream",
-        "change": "camera_add"
-    }
-}
-```
-
-**Mandatory fields:**
-- `value/camera_id` - Unique identifier
-- `value/camera_url` - Stream URI
-- `value/change` - Must contain "add" substring
-
-### Remove Stream Payload
-
-```json
-{
-    "key": "sensor",
-    "value": {
-        "camera_id": "unique_sensor_id",
-        "camera_url": "rtsp://camera-ip/stream",
-        "change": "camera_remove"
-    }
-}
-```
-
-**Note:** The `change` field must contain "remove" substring.
-
-### Example curl Commands
-
-```bash
-# Add a stream
-curl -X POST 'http://localhost:9000/api/v1/stream/add' -d '{
-    "key": "sensor",
-    "value": {
-        "camera_id": "cam_001",
-        "camera_name": "Front Door",
-        "camera_url": "rtsp://192.168.1.100/stream",
-        "change": "camera_add"
-    }
-}'
-
-# Remove a stream
-curl -X POST 'http://localhost:9000/api/v1/stream/remove' -d '{
-    "key": "sensor",
-    "value": {
-        "camera_id": "cam_001",
-        "camera_url": "rtsp://192.168.1.100/stream",
-        "change": "camera_remove"
-    }
-}'
-
-# Get stream info
-curl -X GET 'http://localhost:9000/api/v1/stream/get-stream-info'
-
-# Check pipeline readiness
-curl -X GET 'http://localhost:9000/api/v1/health/get-dsready-state'
-```
-
----
-
-## Complete Pipeline Example
-
-```python
-from pyservicemaker import (
-    Pipeline, Probe, BatchMetadataOperator,
-    StateTransitionMessage, DynamicSourceMessage
-)
-import platform
-
-def run_dynamic_source_pipeline():
-    """Pipeline with dynamic source management via REST API."""
-    
-    def on_message(message):
-        """Handle pipeline messages for dynamic sources."""
-        if isinstance(message, DynamicSourceMessage):
-            if message.source_added:
-                print(f"Camera ADDED: {message.sensor_name} "
-                      f"(id={message.sensor_id}, source_id={message.source_id})")
-            else:
-                print(f"Camera REMOVED: source_id={message.source_id}")
-        
-        elif isinstance(message, StateTransitionMessage):
-            state_name = str(message.new_state).split('.')[-1]
-            print(f"{message.origin} -> {state_name}")
-    
-    pipeline = Pipeline("dynamic-source-pipeline")
-    
-    # Source with built-in REST server
-    pipeline.add("nvmultiurisrcbin", "src", {
-        "ip-address": "0.0.0.0",
-        "port": 9000,                    # REST API on port 9000
-        "max-batch-size": 16,
-        "batched-push-timeout": 33333,
-        "width": 1920,
-        "height": 1080,
-        "live-source": 1,                # Required for dynamic sources
-        "drop-pipeline-eos": 1,
-        "async-handling": 1,
-        "select-rtp-protocol": 0,
-        "latency": 100,
-    })
-    
-    # Inference
-    pipeline.add("nvinfer", "pgie", {
-        "config-file-path": "/path/to/pgie_config.yml",
-        "batch-size": 16
-    })
-    
-    # Tiler for multi-stream display
-    pipeline.add("nvmultistreamtiler", "tiler", {
-        "width": 1920,
-        "height": 1080,
-        "rows": 4,
-        "columns": 4
-    })
-    
-    # OSD
-    pipeline.add("nvosdbin", "osd")
-    
-    # Sink - CRITICAL: async=0 for dynamic sources
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink", {
-        "sync": 0,
-        "qos": 0,
-        "async": 0  # CRITICAL for dynamic source state transitions
-    })
-    
-    # Link pipeline
-    pipeline.link("src", "pgie", "tiler", "osd", "sink")
-    
-    # Prepare and activate
-    pipeline.prepare(on_message)
-    pipeline.activate()
-    
-    print("Pipeline started. REST API available at http://localhost:9000")
-    print("Add streams with: POST /api/v1/stream/add")
-    
-    pipeline.wait()
-
-if __name__ == "__main__":
-    from multiprocessing import Process
-    process = Process(target=run_dynamic_source_pipeline)
-    process.start()
-    process.join()
-```
-
----
-
-## Handling DynamicSourceMessage
-
-When streams are added or removed, the pipeline emits `DynamicSourceMessage`:
-
-```python
-from pyservicemaker import DynamicSourceMessage
-
-def on_message(message):
-    if isinstance(message, DynamicSourceMessage):
-        source_id = message.source_id      # Internal source ID (int)
-        sensor_id = message.sensor_id      # Your camera_id from REST API
-        sensor_name = message.sensor_name  # Your camera_name from REST API
-        
-        if message.source_added:
-            # Stream successfully added
-            # Map source_id to your camera tracking
-            print(f"Added: {sensor_name} (sensor_id={sensor_id})")
-        else:
-            # Stream removed
-            print(f"Removed: source_id={source_id}")
-```
-
----
-
-## Common Errors and Solutions
-
-### Error: Stream added but no video displayed
-
-**Symptom:** REST API returns success, `DynamicSourceMessage` received, but elements stuck in PAUSED state.
-
-**Cause:** Missing `async=0` on sink element.
-
-**Solution:**
-```python
-# Add async=0 to sink
-pipeline.add("nveglglessink", "sink", {
-    "sync": 0,
-    "qos": 0,
-    "async": 0  # This is the fix
-})
-```
-
-### Error: No data from source, reconnection attempts
-
-**Symptom:**
-```
-WARNING from dsnvurisrcbin0: No data from source since last 10 sec. Trying reconnection
-Could not send message. (Received end-of-file)
-```
-
-**Cause:** RTSP source issue - invalid URL, authentication required, or network problem.
-
-**Solution:**
-1. Test RTSP URL with ffplay: `ffplay rtsp://camera-ip/stream`
-2. Include credentials: `rtsp://user:password@camera-ip/stream`
-3. Try different RTP protocol: `select-rtp-protocol: 4` (TCP only)
-
-### Error: Pipeline EOS when stream removed
-
-**Symptom:** Pipeline stops when the last stream is removed.
-
-**Solution:** Set `drop-pipeline-eos: 1` on nvmultiurisrcbin.
-
-### Anti-Pattern: Implementing Custom REST Server
-
-**❌ WRONG - Do not implement a separate Flask/FastAPI server:**
-```python
-# DON'T DO THIS
-from flask import Flask
-app = Flask(__name__)
-
-@app.route('/add-camera', methods=['POST'])
-def add_camera():
-    # Custom REST server adds complexity and potential bugs
-    pass
-```
-
-**✅ CORRECT - Use the built-in REST server:**
-```python
-# Just configure the port on nvmultiurisrcbin
-pipeline.add("nvmultiurisrcbin", "src", {
-    "port": 9000,  # Built-in REST server on port 9000
-    # ... other properties
-})
-# REST API is automatically available at http://localhost:9000/api/v1/
-```
-
-If you need a proxy API for simplified requests, make HTTP calls to the built-in server instead of reimplementing stream management.
-
----
-
-## Headless Operation
-
-For headless (no display) operation, use `fakesink`:
-
-```python
-import os
-
-if "DISPLAY" not in os.environ:
-    # Headless mode
-    pipeline.add("fakesink", "sink", {
-        "sync": 0,
-        "async": 0
-    })
-else:
-    # Display mode
-    pipeline.add("nveglglessink", "sink", {
-        "sync": 0,
-        "qos": 0,
-        "async": 0
-    })
-```
-
----
-
-## RTSP URL Formats
-
-Common RTSP URL formats by manufacturer:
-
-| Manufacturer | URL Format |
-|--------------|------------|
-| Hikvision | `rtsp://user:pass@ip:554/Streaming/Channels/101` |
-| Dahua | `rtsp://user:pass@ip:554/cam/realmonitor?channel=1&subtype=0` |
-| Axis | `rtsp://user:pass@ip/axis-media/media.amp` |
-| Generic | `rtsp://user:pass@ip:554/stream1` |
-| NVIDIA Demo | `rtsp://nv-wowza-pdc.nvidia.com:1935/vod/concat_wh_52.mp4` |
-
----
-
-## Quick Reference
-
-| Requirement | Property | Value |
-|-------------|----------|-------|
-| Enable REST API | `port` | 9000 (or any port, 0 to disable) |
-| Dynamic sources | `live-source` | 1 |
-| Keep pipeline alive | `drop-pipeline-eos` | 1 |
-| Async state changes | `async-handling` | 1 |
-| **Sink async** | `async` | **0 (CRITICAL)** |
-| Sink sync | `sync` | 0 |
-
----
-
-## Related Documentation
-
-- **GStreamer Plugins**: `gstreamer_plugins.md`
-- **Service Maker API**: `service_maker_api.md`
-- **Troubleshooting**: `troubleshooting.md`
-- **Configuration Classes**: `utilities_config.md`
diff --git a/skills/deepstream/deepstream-dev/references/service_maker_api.md b/skills/deepstream/deepstream-dev/references/service_maker_api.md
deleted file mode 100644
index 9abf3ae4..00000000
--- a/skills/deepstream/deepstream-dev/references/service_maker_api.md
+++ /dev/null
@@ -1,1790 +0,0 @@
-# DeepStream Service Maker for Python (pyservicemaker) API Reference
-
-## Introduction
-
-The DeepStream Service Maker provides a high-level Python API (`pyservicemaker`) for building DeepStream applications. It abstracts away the complexity of GStreamer C API and provides a more intuitive, Pythonic interface for constructing video analytics pipelines.
-
-## Installation
-
-The pyservicemaker package is installed as part of DeepStream SDK:
-```bash
-pip install /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl pyyaml
-```
-
-**Inside a virtual environment**: `pyservicemaker` is installed system-wide but is NOT accessible from a standard venv. If the application uses a virtual environment, you must install it inside the venv:
-```bash
-python3 -m venv venv
-source venv/bin/activate
-pip install /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl pyyaml
-```
-
-## Two API Approaches
-
-Service Maker provides two APIs for building pipelines:
-
-1. **Pipeline API**: Low-level, element-by-element pipeline construction
-2. **Flow API**: High-level, declarative pipeline construction
-
----
-
-## Pipeline API
-
-The Pipeline API provides fine-grained control over pipeline construction, similar to GStreamer C API but with Python syntax.
-
-### Core Classes
-
-#### Pipeline
-Main class for creating and managing DeepStream pipelines.
-
-**Constructor**:
-```python
-from pyservicemaker import Pipeline
-
-# Create empty pipeline
-pipeline = Pipeline("pipeline-name")
-
-# Create pipeline from YAML config
-pipeline = Pipeline("pipeline-name", "/path/to/config.yml")
-```
-
-**Methods**:
-
-##### `add(element_type, name, properties=None)`
-Add a GStreamer element to the pipeline.
-
-**Parameters**:
-- `element_type` (str): GStreamer element factory name (e.g., "nvinfer", "nvstreammux")
-- `name` (str): Unique name for the element
-- `properties` (dict, optional): Element properties as key-value pairs
-
-**Returns**: Pipeline instance (for method chaining)
-
-**Example**:
-```python
-pipeline.add("filesrc", "src", {"location": "/path/to/video.h264"})
-pipeline.add("h264parse", "parser")
-pipeline.add("nvv4l2decoder", "decoder")
-pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-pipeline.add("nvinfer", "infer", {"config-file-path": "/path/to/config.yml"})
-```
-
-##### `link(*element_names)`
-Link elements in sequence. Elements are connected in the order specified.
-
-**Parameters**:
-- `*element_names`: Variable number of element names or tuples for request pads
-
-**Returns**: Pipeline instance (for method chaining)
-
-**Example**:
-```python
-# Simple linear linking
-pipeline.link("src", "parser", "decoder", "mux", "infer", "sink")
-
-# Linking with request pads (for nvstreammux)
-pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-# This connects decoder src pad to mux sink_0 pad
-```
-
-**Request Pad Linking**:
-For elements with dynamic pads (like nvstreammux), use tuple syntax:
-```python
-# Format: (source_element, sink_element), (source_pad, sink_pad_template)
-pipeline.link(("decoder1", "mux"), ("", "sink_%u"))  # Connects to sink_0
-pipeline.link(("decoder2", "mux"), ("", "sink_%u"))  # Connects to sink_1
-```
-
-**CRITICAL: Always use "sink_%u" pad template, NOT "sink_0", "sink_1", or f"sink_{i}"**
-- `"sink_%u"` is a GStreamer pad template that automatically assigns sink pads (sink_0, sink_1, sink_2, etc.)
-- Using literal pad names like `"sink_0"` or `f"sink_{i}"` will FAIL because these pads don't exist until requested
-- The `%u` format specifier tells GStreamer to automatically assign the next available sink pad index
-
-**Examples with different source types**:
-```python
-# With nvv4l2decoder (decoded video source)
-pipeline.link((f"decoder{i}", "mux"), ("", "sink_%u"))  # CORRECT
-
-# With nvurisrcbin (RTSP/file source with dynamic pads)
-pipeline.link((f"src{i}", "mux"), ("", "sink_%u"))  # CORRECT - nvurisrcbin has dynamic src pad
-
-# WRONG - DO NOT USE:
-pipeline.link((f"src{i}", "mux"), ("", f"sink_{i}"))  # INCORRECT - will fail!
-pipeline.link((f"src{i}", "mux"), ("", "sink_0"))     # INCORRECT - pad doesn't exist!
-```
-
-##### `attach(target, what, name='', tips='', properties=None)`
-Attach a probe (or other custom object) to a named element in the pipeline.
-
-**Parameters**:
-- `target` (str): Name of the pipeline element to attach to
-- `what`: Probe instance or name of a built-in probe module (e.g. `"measure_fps_probe"`)
-- `name` (str, optional): Name for the probe. Not needed when `what` is an explicitly created Probe object.
-- `tips` (str, optional): Extra information for the custom object
-- `properties` (dict, optional): Properties to set on the object. Not applicable for explicitly created Probe objects.
-
-**CRITICAL**: The parameter is **`name`**, NOT `probe_name`. Using `probe_name` will raise `TypeError`.
-
-**Returns**: Pipeline instance (for method chaining)
-
-**Example**:
-```python
-from pyservicemaker import Probe, BatchMetadataOperator
-
-class MyProbe(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        # Process metadata
-        pass
-
-pipeline.attach("infer", Probe("my-probe", MyProbe()))
-# Or attach built-in probe by module name, giving it a name
-pipeline.attach("infer", "measure_fps_probe", name="fps-probe")
-```
-
-##### `start()`
-Start the pipeline (set to PLAYING state).
-
-**Returns**: Pipeline instance (for method chaining)
-
-**Example**:
-```python
-pipeline.start()
-```
-
-##### `wait()`
-Wait for pipeline to finish (blocking call until EOS or error).
-
-**Returns**: None
-
-**Example**:
-```python
-pipeline.start().wait()
-```
-
-##### `set(properties)`
-Set properties on an element (when element is accessed via indexing).
-
-**Parameters**:
-- `properties` (dict): Properties to set
-
-**Example**:
-```python
-pipeline["infer"].set({"batch-size": 4})
-```
-
-##### Element Access via Indexing
-Access elements by name to get/set properties:
-
-```python
-# Get element
-infer_element = pipeline["infer"]
-
-# Set properties
-pipeline["infer"].set({"batch-size": 4})
-
-# Get properties
-batch_size = pipeline["infer"].get("batch-size")
-```
-
-### Complete Pipeline API Example
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator
-import platform
-
-PIPELINE_NAME = "my-pipeline"
-CONFIG_FILE = "/path/to/inference_config.txt"  # Must be INI-style text format, NOT YAML
-VIDEO_FILE = "/path/to/video.h264"
-
-class ObjectCounter(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # IMPORTANT: object_items returns an ITERATOR, not a list
-            # You cannot use len() directly - iterate and count instead
-            obj_count = 0
-            for obj in frame_meta.object_items:
-                obj_count += 1
-            print(f"Frame {frame_meta.frame_number}: {obj_count} objects")
-
-# Create pipeline
-pipeline = (Pipeline(PIPELINE_NAME)
-    .add("filesrc", "src", {"location": VIDEO_FILE})
-    .add("h264parse", "parser")
-    .add("nvv4l2decoder", "decoder")
-    .add("nvstreammux", "mux", {
-        "batch-size": 1,
-        "width": 1920,
-        "height": 1080
-    })
-    .add("nvinfer", "infer", {"config-file-path": CONFIG_FILE})
-    .add("nvosdbin", "osd")
-    .add("nv3dsink" if platform.processor() == "aarch64" else "nveglglessink", "sink")
-    .link("src", "parser", "decoder")
-    .link(("decoder", "mux"), ("", "sink_%u"))
-    .link("mux", "infer", "osd", "sink")
-    .attach("infer", Probe("counter", ObjectCounter()))
-    .start()
-    .wait())
-```
-
----
-
-## Flow API
-
-The Flow API provides a high-level, declarative interface for common pipeline patterns.
-
-### Core Classes
-
-#### Flow
-High-level API for building pipelines using method chaining.
-
-**Constructor**:
-```python
-from pyservicemaker import Flow, Pipeline
-
-pipeline = Pipeline("pipeline-name")
-flow = Flow(pipeline)
-```
-
-**Methods**:
-
-##### `batch_capture(sources, record_config=None, **kwargs)`
-Configure batch capture from multiple sources.
-
-**Parameters**:
-- `sources` (list): List of source file paths or URIs
-- `record_config` (class RecordConfig): Optional smart recording (see full table in **`record_config` details** section below). If **`None`**, no smart recording is configured on sources. 
-- `kwargs` (dict): Optional overrides merged into mux and/or source properties (see **`kwargs` dict details** section below). 
-
-**`record_config` details**:
-RecordConfig instance should be constructed as description in **`record_config` Construction examples** section. The following RecordConfig fields can be used to configure smart recording.
-| Field | Type | Default | Used when | Meaning |
-|-------|------|---------|-----------|---------|
-| **`recording_type`** | **str** | **`"local"`** | Always | **`"local"`** or **`"cloud"`** (case-insensitive check in validation). |
-| **`proto_lib`** | **Optional[str]** | **`None`** | **`recording_type == "cloud"`** (required) | Path to the protocol library (e.g. Kafka proto **`libnvds_kafka_proto.so`**). Set on the smart-recording controller as **`proto-lib`**. |
-| **`conn_str`** | **Optional[str]** | **`None`** | Cloud (required) | Broker connection string (e.g. **`"localhost;9092"`**). Property **`conn-str`**. |
-| **`msgconv_config_file`** | **Optional[str]** | **`None`** | Cloud (required) | Message converter config file path. Property **`msgconv-config-file`**. |
-| **`proto_config_file`** | **Optional[str]** | **`None`** | Cloud (required) | Protocol adaptor config file path. Property **`proto-config-file`**. |
-| **`topic_list`** | **Optional[str]** | **`None`** | Cloud (required) | Comma-separated topic list. Property **`topic-list`**. |
-| **`rec_cache`** | **int** | **20** | **`record_config` is set** | Maps to **`smart-rec-cache`** on each source (cache size in seconds). |
-| **`rec_container`** | **int** | **0** | **`record_config` is set** | Maps to **`smart-rec-container`** (**0**: MP4, **1**: MKV). |
-| **`rec_dir_path`** | **str** | **`"."`** | **`record_config` is set** | Maps to **`smart-rec-dir-path`** (output directory for recordings). |
-| **`rec_mode`** | **int** | **0** | **`record_config` is set** | Maps to **`smart-rec-mode`**. Docstring: **0** both, **1** video-only, **2** audio-only. |
-
-**`record_config` Construction examples**:
-```python
-from pyservicemaker import RecordConfig
-
-# Local smart recording (minimal)
-rec_local = RecordConfig()  # recording_type defaults to "local"
-
-# Local with explicit paths and cache
-rec_local = RecordConfig(
-    recording_type="local",
-    rec_cache=20,
-    rec_container=0,
-    rec_dir_path="/data/recordings",
-    rec_mode=0,
-)
-
-# Cloud smart recording (all cloud fields required)
-rec_cloud = RecordConfig(
-    recording_type="cloud",
-    proto_lib="/path/to/broker_library.so",
-    conn_str="localhost;9092",
-    msgconv_config_file="/path/to/dstest5_msgconv_sample_config.txt",
-    proto_config_file="/path/to/cfg_kafka.txt",
-    topic_list="sr-test",
-    rec_cache=20,
-    rec_dir_path=".",
-    rec_mode=0,
-)
-```
-
-**`kwargs` dict details**:
-Any matching **hyphenated** name in the merged **`kwargs`** dict overrides the default value of the corresponding property, the following keys are supported:
-- `gpu_id` (int): Used as the `gpu-id` property of **`nvstreammux`** and as `gpu-id` on each **`nvurisrcbin`**.
-- `width` (int): Used as the `width` property of **`nvstreammux`**, default value is 1920.
-- `height` (int): Used as the `height` property of **`nvstreammux`**, default value is 1080.
-- `batch_size` (int): Used as the `batch-size` property of **`nvstreammux`**, default value is the number of URIs (if non-empty).
-- `batched_push_timeout` (int): Used as the `batched-push-timeout` property of **`nvstreammux`**, default value is 33000.
-- `buffer_pool_size` (int): Used as the `buffer-pool-size` property of **`nvstreammux`**, default value is 4.
-- `drop_pipeline_eos` (bool): Used as the `drop-pipeline-eos` property of **`nvstreammux`**, default value is False.
-- `live_source` (bool): Used as the `live-source` property of **`nvstreammux`**, default value is False.
-- `file_loop`(bool): Used as the `file-loop` property of **`nvstreammux`**, default value is False.
-
-**Returns**: Flow instance (for method chaining)
-
-**Example**:
-```python
-flow.batch_capture([
-    "/path/to/video1.h264",
-    "/path/to/video2.h264",
-    "rtsp://camera-ip/stream"
-])
-
-# Mux resolution and batching setting
-flow.batch_capture(uris, width=1280, height=720, batch_size=4)
-
-# GPU and file loop for file sources
-flow.batch_capture(uris, gpu_id=0, file_loop=True)
-
-# Combine with YAML: kwargs override missing keys from source-config.properties
-flow.batch_capture("/path/to/sources.yaml", width=1920, height=1080, live_source=True)
-```
-**Important**:
-`batch_capture` function sets the nvstreammux batch-size according to the input stream number by default, it is not necessary to set 'batch-size' with `batch_capture` unless you want to support dynamic source adding/removing.
-
-
-##### `infer(config_file_path, with_triton, **kwargs)`
-Add inference stage to the pipeline.
-
-**Parameters**:
-- `config_file_path` (str): Path to inference configuration file
-- `with_triton` (bool): If **`False`** (default), adds **`nvinfer`**. If **`True`**, adds **`nvinferserver`** for Triton-based inference.
-- `kwargs` (dict): Optional properties passed to gst-nvinfer or gst-nvinferserver plugin of DeepStream. Underscores in keyword names are converted to hyphens for GStreamer properties (e.g. **`batch_size`** → **`batch-size`**). Common overrides include **`batch_size`**, **`unique_id`**, **`model_engine_file`**, **`gpu_id`**, and other keys supported by **nvinfer** / **nvinferserver** for your install.
-
-**Returns**: Flow instance (for method chaining)
-
-**Notes**: For multiple streams inferencing case, `batch_size` property should be set as the same value as the stream number.
-
-**Examples**:
-```python
-flow.infer("/path/to/pgie_config.yml")
-
-#set nvinfer/nvinferserver properties with Flow.infer function
-flow.infer("/path/to/pgie_config.yml",unique_id=5, batch_size=4)
-```
-
-##### `track(**kwargs)`
-Add tracker for object tracking. Must be used after primary inference.
-
-**Parameters**:
-The following keyword arguments(kwargs) are passed to **nvtrack** as properties.
-| Property            | Type | Description |
-|---------------------|------|-------------|
-| **`ll_config_file`** | str  | Path to the low-level tracker config file (e.g. NvDCF, NvSORT, IOU). |
-| **`ll_lib_file`**    | str  | Path to the tracker library (e.g. `libnvds_nvmultiobjecttracker.so`). |
-| **`gpu_id`**         | int  | GPU device id (default 0). |
-
-**Notes**:
-Example tracker configs (paths may vary by installation):
-- NvDCF (performance): `config_tracker_NvDCF_perf.yml`
-- NvDCF (accuracy): `config_tracker_NvDCF_accuracy.yml`
-- NvSORT: `config_tracker_NvSORT.yml`
-- IOU: `config_tracker_IOU.yml`
-- NvDeepSORT: `config_tracker_NvDeepSORT.yml`
-
-**Example**:
-```python
-flow = flow.track(ll_config_file=config_tracker_NvDCF_perf.yml, ll_lib_file=libnvds_nvmultiobjecttracker.so)
-```
-
-##### `analyze(config_file_path,**kwargs)`
-Add analytics for region-of-interest (ROI), line-crossing, overcrowding and direction analytics. The result will be output as AnalyticsFrameMeta in frame meta and AnalyticsObjInfo in object meta.
-
-**Parameters**:
-- `config_file_path` (str): Path to analytics configuration file
-- `kwargs` (dict): Optional properties passed to gst-nvdsanalytics plugin of DeepStream
-
-**Notes**:
-analytics MUST follow tracker to work properly.
-
-**Example**:
-```python
-from pyservicemaker import Pipeline, Flow, BatchMetadataOperator, Probe, RenderMode
-
-PGIE_CONFIG = "/path/to/config_infer_primary.yml"
-TRACKER_LL_CONFIG = "/path/to/config_tracker_NvDCF_perf.yml"
-TRACKER_LL_LIB = "/path/to/libnvds_nvmultiobjecttracker.so"
-ANALYTICS_CONFIG = "/path/to/config_analytics.txt"  # nvdsanalytics config
-SOURCE = "/path/to/source_list.yaml"
-
-class AnalyticsProbe(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # Frame-level analytics (ROI counts, line-cross counts)
-            for user_meta in frame_meta.nvdsanalytics_frame_items:
-                afm = user_meta.as_nvdsanalytics_frame()
-                if afm:
-                    print(f"Frame {frame_meta.frame_number}: unique_id={afm.unique_id} "
-                          f"obj_in_roi_cnt={afm.obj_in_roi_cnt} obj_lc_curr_cnt={afm.obj_lc_curr_cnt} "
-                          f"obj_cnt={afm.obj_cnt} oc_status={afm.oc_status}")
-
-            # Object-level analytics (which ROI/line each object is in)
-            for obj_meta in frame_meta.object_items:
-                for user_meta in obj_meta.nvdsanalytics_obj_items:
-                    aoi = user_meta.as_nvdsanalytics_obj()
-                    if aoi:
-                        print(f"  object_id={obj_meta.object_id} roi_status={aoi.roi_status} "
-                              f"lc_status={aoi.lc_status} dir_status={aoi.dir_status} obj_status={aoi.obj_status}")
-
-pipeline = Pipeline("analytics-demo")
-flow = Flow(pipeline).batch_capture(SOURCE, width=1920, height=1080)
-flow = flow.infer(PGIE_CONFIG)
-flow = flow.track(ll_config_file=TRACKER_LL_CONFIG, ll_lib_file=TRACKER_LL_LIB)
-flow = flow.analyze(ANALYTICS_CONFIG)
-flow = flow.attach(what=Probe("analytics_probe", AnalyticsProbe()))
-flow = flow.render(RenderMode.DISCARD, sync=False)
-flow()
-```
-
-##### `attach(what, name='', tips='', properties=None)`
-Attach a probe to the current flow.
-
-**Parameters**:
-- `what`: Probe instance or element name
-- `name` (str, optional): Name for the probe. Not applicable when `what` is an explicitly created Probe object.
-- `tips` (str, optional): Extra information for the custom object
-- `properties` (dict, optional): Properties to set on the object.
-
-**Returns**: Flow instance (for method chaining)
-
-**Example**:
-```python
-from pyservicemaker import Probe
-# Attach a custom probe (name is embedded in the Probe object)
-flow.attach(Probe("my-probe", MyProbe()))
-
-# Attach built-in probe by module name and name the probe by 'name'
-flow = flow.attach(
-            what="measure_fps_probe",
-            name="fps_probe"
-        )
-```
-
-##### `render()`
-Add rendering stage to the pipeline.
-
-**Returns**: Flow instance (for method chaining)
-
-**Example**:
-```python
-flow.render()
-```
-
-##### `__call__()` (Invocation)
-Execute the pipeline (start and wait).
-
-**Example**:
-```python
-flow()  # Starts and waits for completion
-```
-
-### Complete Flow API Example
-
-```python
-from pyservicemaker import Pipeline, Flow, Probe, BatchMetadataOperator
-
-class ObjectCounter(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # IMPORTANT: object_items is an ITERATOR - cannot use len()
-            obj_count = 0
-            for obj in frame_meta.object_items:
-                obj_count += 1
-            print(f"Frame {frame_meta.frame_number}: {obj_count} objects")
-
-def main():
-    pipeline = Pipeline("my-pipeline")
-    flow = Flow(pipeline)
-    
-    flow.batch_capture(["/path/to/video.h264"]) \
-        .infer("/path/to/inference_config.txt") \  # Must be INI-style text format
-        .attach(Probe("counter", ObjectCounter())) \
-        .render()()
-    
-if __name__ == "__main__":
-    main()
-```
-
----
-
-## Metadata API
-
-### CRITICAL: Iterator Handling
-
-**⚠️ WARNING**: Properties like `frame_meta.object_items`, `frame_meta.tensor_items`, and `frame_meta.user_items` return **ITERATORS**, not lists!
-
-**Common Mistakes to Avoid**:
-```python
-# ❌ WRONG - Will crash with "TypeError: object of type 'iterator' has no len()"
-count = len(frame_meta.object_items)
-
-# ❌ WRONG - Iterator can only be consumed once
-for obj in frame_meta.object_items:
-    process(obj)
-for obj in frame_meta.object_items:  # This loop will be empty!
-    do_something(obj)
-```
-
-**Correct Patterns**:
-```python
-# ✅ CORRECT - Count by iterating
-obj_count = 0
-for obj in frame_meta.object_items:
-    obj_count += 1
-    process(obj)
-
-# ✅ CORRECT - If you need to iterate multiple times, convert to list first
-# (only if you actually need multiple iterations)
-object_list = list(frame_meta.object_items)
-count = len(object_list)
-for obj in object_list:
-    process(obj)
-```
-
----
-
-### BatchMetadataOperator
-Base class for implementing custom metadata processing.
-
-**Methods**:
-
-##### `handle_metadata(batch_meta)`
-Override this method to process batch metadata.
-
-**Parameters**:
-- `batch_meta`: BatchMetadata object containing frame and object metadata
-
-**Example**:
-```python
-class MyOperator(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # Process each frame
-            # NOTE: object_items is an ITERATOR, not a list!
-            for object_meta in frame_meta.object_items:
-                # Process each object
-                pass
-```
-
-### BatchMetadata Object
-
-**Properties**:
-- `frame_items`: List of FrameMetadata objects
-- Methods for acquiring metadata objects
-
-**Methods**:
-- `acquire_object_meta()`: Create new object metadata
-- `acquire_display_meta()`: Create new display metadata
-- `acquire_user_meta()`: Create new user metadata
-- `acquire_event_message_meta()`: Create new `EventMessageUserMetadata` for nvmsgconv (see EventMessageUserMetadata section below)
-
-### FrameMetadata Object
-
-**Properties**:
-- `frame_number`: Frame number (int)
-- `pad_index`: Source pad index (int)
-- `batch_id`: Location of frame in the batch (int)
-- `source_id`: Source ID of the frame, e.g., camera ID (int)
-- `source_width`: Width of the frame at input to streammux (int)
-- `source_height`: Height of the frame at input to streammux (int)
-- `pipeline_width`: Width of the frame at output of streammux (int)
-- `pipeline_height`: Height of the frame at output of streammux (int)
-- `buffer_pts`: Presentation timestamp (PTS) of the frame in nanoseconds (int)
-- `ntp_timestamp`: NTP timestamp of the frame (int)
-- `object_items`: **ITERATOR** of ObjectMetadata objects (NOT a list - cannot use `len()`)
-- `tensor_items`: **ITERATOR** of TensorOutputUserMetadata objects (NOT a list - cannot use `len()`)
-- `segmentation_items`: **ITERATOR** of SegmentationUserMetadata objects (NOT a list - cannot use `len()`)
-- `nvdsanalytics_frame_items`: **ITERATOR** of AnalyticsFrameMeta objects (NOT a list - cannot use `len()`)
-**⚠️ IMPORTANT**: The `*_items` properties return iterators that can only be consumed once. See "CRITICAL: Iterator Handling" section above.
-
-**⚠️ NOTE**: There is no `timestamp` property. Use `buffer_pts` for PTS timestamp or `ntp_timestamp` for NTP timestamp.
-
-**Methods**:
-- `append(meta)`: Add metadata to frame
-
-### ObjectMetadata Object
-
-**Properties**:
-- `class_id`: Class ID (int)
-- `confidence`: Confidence score (float)
-- `object_id`: Unique tracking ID assigned by tracker (int). Value is `0xFFFFFFFFFFFFFFFF` (UNTRACKED_OBJECT_ID) if object has not been tracked.
-- `tracker_confidence`: Confidence value from tracker (float). Set to -0.1 for KLT and IOU trackers.
-- `rect_params`: Rectangle parameters object
-  - `left`: Left coordinate (float)
-  - `top`: Top coordinate (float)
-  - `width`: Width (float)
-  - `height`: Height (float)
-  - `border_width`: Border width (int)
-  - `border_color`: Border color (Color object)
-- `label`: String describing the object class
-- `text_params`: Text parameters for OSD display (NvOSD_TextParams)
-- `mask_params`: Mask parameters for object overlay (NvOSD_MaskParams)
-- `classifier_items`: **ITERATOR** of ClassifierMetadata objects. (NOT a list - cannot use `len()`)
-- `tensor_items`: **ITERATOR** of TensorOutputUserMetadata objects. (NOT a list - cannot use `len()`)
-- `nvdsanalytics_obj_items`: **ITERATOR** of AnalyticsObjInfo objects. (NOT a list - cannot use `len()`)
-
-**Note**: The attribute is `object_id`, NOT `tracking_id`. This is the unique ID assigned by the tracker to track objects across frames.
-
-### RectParams Object
-
-**Properties**:
-- `left`, `top`, `width`, `height`: Coordinates and dimensions
-- `border_width`: Border width
-- `border_color`: Border color (Color object)
-
-### TensorOutputUserMetadata Object
-
-**Methods**:
-- `as_tensor_output()`: Get tensor output object
-  - `get_layers()`: Get output layers dictionary
-
-**Example**:
-```python
-for user_meta in frame_meta.tensor_items:
-    tensor_output = user_meta.as_tensor_output()
-    layers = tensor_output.get_layers()
-    # layers is a dict: {"layer_name": tensor, ...}
-```
-
-### SegmentationUserMetadata Object
-
-**Properties**:
-- `unique_id`: Unique id of the component that generates the segmentation output.
-- `classes`: Number of classes in the segmentation output. |
-- `width`, `height`: Width and height of the segmentation mask array.
-- `class_map`: Class map array of the segmentation output; shape `(height, width)`, dtype int. Each pixel holds the class index.
-- `class_probabilities_map`: Class probabilities map array; shape `(height, width, classes)`, dtype float. Optional; may be empty if not produced by the model.
-
-**Example**:
-```python
-from pyservicemaker import Pipeline, Flow, BatchMetadataOperator
-
-class MyOperator(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # frame_meta is FrameMetadata
-            for user_meta in frame_meta.segmentation_items:
-                # user_meta is UserMetadata (segmentation type)
-                seg_meta = user_meta.as_segmentation()
-                if seg_meta:  # cast is valid when meta type matches
-                    # Use SegmentationUserMetadata attributes
-                    print("unique_id:", seg_meta.unique_id)
-                    print("classes:", seg_meta.classes)
-                    print("width:", seg_meta.width, "height:", seg_meta.height)
-                    # class_map: (height, width) int array
-                    print("class_map shape:", seg_meta.class_map.shape)
-                    # class_probabilities_map: (height, width, classes) float array, if present
-                    if seg_meta.class_probabilities_map.size > 0:
-                        print("class_probabilities_map shape:", seg_meta.class_probabilities_map.shape)
-```
-
-### AnalyticsFrameMeta object
-
-**Properties**:
-- `oc_status`: Map of overcrowding status per ROI (key = ROI label). Type: dict[str, bool]
-- `obj_in_roi_cnt`: Map of count of valid objects in each ROI (key = ROI label). Type: dict[str, int] 
-- `obj_lc_curr_cnt`: Map of line-crossing count in the current frame per line (key = line/ROI label). Type: dict[str, int]              |  |
-- `obj_lc_cum_cnt`: Map of cumulative line-crossing count per line (key = line/ROI label). Type: dict[str, int]
-- `unique_id`: Unique identifier for the nvdsanalytics instance.
-- `obj_cnt`: Map of object count per class ID (key = class ID). Type: dict[int, int]
-
-**Example**:
-```python
-from pyservicemaker import Pipeline, Flow, BatchMetadataOperator
-
-class MyOperator(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # frame_meta is FrameMetadata
-            for user_meta in frame_meta.nvdsanalytics_frame_items:
-                # user_meta is UserMetadata (nvdsanalytics frame type)
-                analytics_frame_meta = user_meta.as_nvdsanalytics_frame()
-                if analytics_frame_meta:  # cast is valid when meta type matches
-                    # Use AnalyticsFrameMeta attributes
-                    print("Frame {0} component id: {1}".format(analytics_frame_meta.unique_id))
-                    print("Frame {0} overcrowding status: {1}".format(frame_meta.frame_number, analytics_frame_meta.oc_status))
-                    print("Frame {0} object in ROI count: {1}".format(frame_meta.frame_number, analytics_frame_meta.obj_in_roi_cnt))
-                    print("Frame {0} object line crossing current count: {1}".format(frame_meta.frame_number, analytics_frame_meta.obj_lc_curr_cnt))
-                    print("Frame {0} object line crossing cumulative count: {1}".format(frame_meta.frame_number, analytics_frame_meta.obj_lc_cum_cnt))
-                    print("Frame {0} object count: {1}".format(frame_meta.frame_number,, analytics_frame_meta.obj_cnt))
-```
-
-### AnalyticsObjInfo object
-
-**Properties**:
-- `roi_status`: Array of ROI labels in which this object is present. Type: list[str].
-- `oc_status`: Array of OverCrowding labels in which this object is present. Type: list[str].
-- `lc_status`: Array of line-crossing labels which this object has crossed. Type: list[str].
-- `dir_status`: Direction string for the tracked object.
-- `unique_id`: Unique identifier for the nvdsanalytics instance.
-- `obj_status`: Status string for the tracked object.
-
-**Note**: AnalyticsObjInfo is stored as **user metadata** on the object. **ObjectMetadata** exposes an iterator **`nvdsanalytics_obj_items`** over user metadata of type **NVDS_USER_OBJ_META_NVDSANALYTICS**; each element is a **UserMetadata** instance, which you cast to **AnalyticsObjInfo** using **`as_nvdsanalytics_obj()`**.
-
-**Example**:
-```python
-from pyservicemaker import Pipeline, Flow, BatchMetadataOperator
-
-class MyOperator(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            for obj_meta in frame_meta.object_items:
-                # obj_meta is ObjectMetadata
-                for user_meta in obj_meta.nvdsanalytics_obj_items:
-                    # user_meta is UserMetadata (nvdsanalytics object type)
-                    analytics_obj = user_meta.as_nvdsanalytics_obj()
-                    if analytics_obj:  # cast is valid when meta type matches
-                        # Use AnalyticsObjInfo attributes
-                        print("Object {0} ROI status: {1}".format(object_meta.object_id, analytics_obj.roi_status))
-                        print("Object {0} overcrowding status: {1}".format(object_meta.object_id, analytics_obj.oc_status))
-                        print("Object {0} line crossing status: {1}".format(obj_meta.object_id, analytics_obj.lc_status))
-                        print("Object {0} moving in direction: {1}".format(obj_meta.object_id, analytics_obj.dir_status))
-                        print("Object {0} unique ID: {1}".format(object_meta.object_id, analytics_obj.unique_id))
-                        print("Object {0} status: {1}".format(object_meta.object_id, analytics_obj.obj_status))
-```
-
-### ClassifierMetadata object
-
-**Properties**:
-- `n_labels`: Number of output labels of the classifier.
-- `unique_component_id`: Unique id of the component that generates the classifier metadata.
-
-**Methods**:
-- `get_n_label(n)`: Returns the nth label of the classifier (0-based index `n`).
-
-**Example**:
-```python
-from pyservicemaker import Pipeline, Flow, BatchMetadataOperator
-
-class MyOperator(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            for obj_meta in frame_meta.object_items:
-                for classifier_meta in obj_meta.classifier_items:
-                    # classifier_meta is ClassifierMetadata
-                    print("n_labels:", classifier_meta.n_labels)
-                    print("unique_component_id:", classifier_meta.unique_component_id)
-                    for i in range(classifier_meta.n_labels):
-                        label = classifier_meta.get_n_label(i)
-                        print(f"  label[{i}]:", label)
-```
-
----
-
-## OSD (On-Screen Display) API
-
-### osd Module
-
-Provides classes for creating OSD elements.
-
-#### Text
-Text display element.
-
-**Properties**:
-- `display_text`: Text content (bytes)
-- `x_offset`: X position (int)
-- `y_offset`: Y position (int)
-- `font`: Font object
-- `set_bg_color`: Enable background color (bool)
-- `bg_color`: Background color (Color object)
-
-#### Font
-Font specification.
-
-**Properties**:
-- `name`: Font family (FontFamily enum)
-- `size`: Font size (int)
-- `color`: Font color (Color object)
-
-#### FontFamily Enum
-- `Serif`
-- `Sans`
-- `Mono`
-
-#### Color
-Color specification (RGBA).
-
-**Properties**:
-- Red, Green, Blue, Alpha values (0.0 to 1.0)
-
-**Constructor**:
-```python
-color = osd.Color(1.0, 0.0, 0.0, 1.0)  # Red, fully opaque
-```
-
-### DisplayMeta Object
-
-**Methods**:
-- `add_text(text)`: Add text element
-- `add_rect(rect)`: Add rectangle element
-- `add_line(line)`: Add line element
-- `add_circle(circle)`: Add circle element
-
-### Example: Adding Text Overlay
-
-```python
-from pyservicemaker import osd
-
-display_meta = batch_meta.acquire_display_meta()
-text = osd.Text()
-text.display_text = b"Object Count: 5"
-text.x_offset = 10
-text.y_offset = 12
-text.font.name = osd.FontFamily.Serif
-text.font.size = 12
-text.font.color = osd.Color(1.0, 1.0, 1.0, 1.0)
-text.set_bg_color = True
-text.bg_color = osd.Color(0.0, 0.0, 0.0, 1.0)
-display_meta.add_text(text)
-frame_meta.append(display_meta)
-```
-
----
-
-## Postprocessing API
-
-### postprocessing Module
-
-Provides classes for custom postprocessing.
-
-#### ObjectDetectorOutputConverter
-Base class for converting tensor outputs to object detections.
-
-**Methods**:
-
-##### `__call__(output_layers)`
-Convert tensor outputs to list of bounding boxes.
-
-**Parameters**:
-- `output_layers` (dict): Dictionary of layer names to tensors
-
-**Returns**: List of bounding boxes `[class_id, confidence, x1, y1, x2, y2]`
-
-**Example**:
-```python
-from pyservicemaker import postprocessing
-import torch
-
-class MyConverter(postprocessing.ObjectDetectorOutputConverter):
-    def __call__(self, output_layers):
-        outputs = []
-        bbox_tensor = output_layers.get('bbox_layer')
-        conf_tensor = output_layers.get('conf_layer')
-        
-        if bbox_tensor and conf_tensor:
-            # Convert DLPack tensors to PyTorch
-            bbox = torch.utils.dlpack.from_dlpack(bbox_tensor)
-            conf = torch.utils.dlpack.from_dlpack(conf_tensor)
-            
-            # Process and convert to format: [class_id, confidence, x1, y1, x2, y2]
-            # ... processing logic ...
-            
-        return outputs
-```
-
-**Usage**:
-```python
-converter = MyConverter()
-objects = converter(output_layers)
-# objects is list of [class_id, confidence, x1, y1, x2, y2]
-```
-
----
-
-## Probe API
-
-### Probe Class
-
-Wrapper for attaching callback functions to pipeline elements.
-
-**Constructor** (two overloads):
-```python
-from pyservicemaker import Probe
-
-# Overload 1: Metadata-level probe (most common)
-probe = Probe("probe-name", BatchMetadataOperator())
-
-# Overload 2: Buffer-level probe (for raw buffer access)
-probe = Probe("probe-name", BufferOperator())
-```
-
-**Parameters**:
-- `name` (str): Name of the probe
-- `operator`: `BatchMetadataOperator` instance **or** `BufferOperator` instance
-
-**Built-in Probes**:
-- `"measure_fps_probe"`: Measures FPS
-- `"measure_latency_probe"`: Measures latency
-- `"add_message_meta_probe"`: Automatically generates `EventMessageUserMetadata` (NvDsEventMsgMeta) from object metadata for downstream `nvmsgconv` consumption. Use this when `msg2p-newapi=0` and you don't need custom control over sensor mappings.
-
-**Example**:
-```python
-# Custom probe
-probe = Probe("my-probe", MyOperator())
-
-# Built-in probe
-pipeline.attach("infer", "measure_fps_probe", "fps-probe")
-
-# Built-in message meta probe (for Kafka with msg2p-newapi=0)
-pipeline.attach("osd", "add_message_meta_probe", "metadata generator")
-```
-
-### BufferOperator Class
-
-Low-level probe interface for accessing raw `Buffer` objects flowing through a pad. Use `BufferOperator` instead of `BatchMetadataOperator` when you need to inspect or count raw buffers that do NOT carry batch metadata — e.g., on the `src` pad of `nvdsdynamicsrcbin` (before any `nvstreammux`).
-
-**Methods to Override**:
-
-##### `handle_buffer(buffer)`
-Called for every buffer that passes through the probed pad.
-
-**Parameters**:
-- `buffer` (Buffer): The buffer flowing through the pad
-
-**Returns**: `bool` — `True` to pass the buffer downstream (keep), `False` to drop it.
-
-**Buffer Object Properties/Methods** (available inside `handle_buffer`):
-- `buffer.timestamp` (int): PTS timestamp of the buffer
-- `buffer.get_chunk_id(batch_id)` (int): Chunk/source ID assigned by `nvdsdynamicsrcbin`. Always 0 for `uridecodebin`.
-- `buffer.extract(batch_id)` → `Tensor`: Extract frame data as a tensor
-
-**Example**:
-```python
-from pyservicemaker import Pipeline, Probe, BufferOperator
-
-class MyBufferProbe(BufferOperator):
-    def __init__(self):
-        super().__init__()
-        self.count = 0
-
-    def handle_buffer(self, buffer):
-        self.count += 1
-        print(f"Buffer #{self.count}  ts={buffer.timestamp}")
-        return True
-
-probe = MyBufferProbe()
-pipeline.attach("dynamicsrcbin", Probe("buf-probe", probe), tips="src")
-```
-
----
-
-## EventMessageUserMetadata
-
-`EventMessageUserMetadata` wraps `NvDsEventMsgMeta` and is **required** by `nvmsgconv` when `msg2p-newapi` is `0` (the default / legacy API). Without it, nvmsgconv silently produces zero messages.
-
-It is acquired from the `BatchMetadata` pool and must be populated and appended to the corresponding `FrameMetadata`.
-
-### Acquiring and Generating Event Message Metadata
-
-```python
-event_msg = batch_meta.acquire_event_message_meta()  # Acquire from pool
-event_msg.generate(object_meta, frame_meta, sensor_id, uri, labels)  # Populate
-frame_meta.append(event_msg)  # Attach to frame
-```
-
-**Parameters for `generate()`**:
-- `object_meta` (ObjectMetadata): The detected object to create a message for
-- `frame_meta` (FrameMetadata): The frame containing the object
-- `sensor_id` (str): Camera/sensor identifier string (e.g., `"Camera1"`)
-- `uri` (str): Source URI of the stream (e.g., `"file:///path/to/video.mp4"`)
-- `labels` (list[str]): List of class label strings matching class IDs (e.g., `["person", "bag", "face"]`)
-
-### Two Approaches
-
-#### Approach 1: Built-in Probe (Simple)
-
-Use the built-in `"add_message_meta_probe"` -- no custom Python class needed:
-
-```python
-# Attach AFTER inference/tracker, BEFORE nvmsgconv
-pipeline.attach("osd", "add_message_meta_probe", "metadata generator")
-```
-
-Reference: `deepstream_test4_app` sample
-(`/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/pipeline_api/deepstream_test4_app/deepstream_test4.py`)
-
-#### Approach 2: Custom EventMessageGenerator (Full Control)
-
-For multi-camera pipelines where you need control over sensor mappings:
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator, SensorInfo
-
-class EventMessageGenerator(BatchMetadataOperator):
-    """Generate EventMessageUserMetadata for downstream nvmsgconv."""
-
-    def __init__(self, sensor_map, labels):
-        super().__init__()
-        self._sensor_map = sensor_map  # dict: source_id -> SensorInfo or str
-        self._labels = labels          # list of class label strings
-
-    def handle_metadata(self, batch_meta, frame_interval=1):
-        for frame_meta in batch_meta.frame_items:
-            frame_num = frame_meta.frame_number
-            for object_meta in frame_meta.object_items:
-                if not (frame_num % frame_interval):
-                    event_msg = batch_meta.acquire_event_message_meta()
-                    if event_msg:
-                        source_id = frame_meta.source_id
-                        sensor_info = self._sensor_map.get(source_id)
-                        sensor_id = sensor_info.sensor_id if sensor_info else "N/A"
-                        uri = sensor_info.uri if sensor_info else "N/A"
-                        event_msg.generate(
-                            object_meta, frame_meta, sensor_id, uri, self._labels
-                        )
-                        frame_meta.append(event_msg)
-
-# Attach probe upstream of nvmsgconv
-labels = ["car", "bicycle", "person", "roadsign"]
-sensor_map = {0: SensorInfo(sensor_id="Camera1", sensor_name="cam1", uri="file:///video1.mp4")}
-pipeline.attach("tracker", Probe("event_msg_gen", EventMessageGenerator(sensor_map, labels)))
-```
-
-Reference: `deepstream_test5_app` sample
-(`/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/pipeline_api/deepstream_test5_app/deepstream_test5.py`)
-
-### SensorInfo Class
-
-Used to map source IDs to sensor metadata for `EventMessageGenerator`:
-
-```python
-from pyservicemaker import SensorInfo
-
-sensor_info = SensorInfo(
-    sensor_id="Camera1",       # Unique sensor identifier string
-    sensor_name="front_cam",   # Human-readable name
-    uri="rtsp://host/stream1"  # Source URI
-)
-```
-
----
-
-## YAML Configuration Support
-
-Pipelines can be created from YAML configuration files (for pipeline structure definition):
-
-```python
-pipeline = Pipeline("pipeline-name", "/path/to/pipeline_config.yml")
-```
-
-**Note**: This YAML config is for **pipeline structure** (elements, links, probes). The nvinfer `config-file-path` can point to either a YAML file (`.yml`) or INI-style text file (`.txt`) - both formats are supported.
-
-### YAML Structure Example (Pipeline Definition)
-
-```yaml
-pipeline:
-  name: my-pipeline
-  elements:
-    - name: src
-      type: filesrc
-      properties:
-        location: /path/to/video.h264
-    
-    - name: parser
-      type: h264parse
-    
-    - name: decoder
-      type: nvv4l2decoder
-    
-    - name: mux
-      type: nvstreammux
-      properties:
-        batch-size: 1
-        width: 1920
-        height: 1080
-    
-    - name: infer
-      type: nvinfer
-      properties:
-        # nvinfer supports both YAML (.yml) and INI-style (.txt) config formats
-        config-file-path: /path/to/pgie_config.yml
-    
-    - name: osd
-      type: nvosdbin
-    
-    - name: sink
-      type: nveglglessink
-  
-  links:
-    - [src, parser, decoder]
-    - [decoder, mux]
-    - [mux, infer, osd, sink]
-  
-  probes:
-    - element: infer
-      probe-name: my-probe
-      probe-type: custom
-      operator: MyOperator
-```
-
-### nvinfer Configuration (Both Formats Supported)
-
-The `config-file-path` for nvinfer supports **both YAML and INI-style text formats**:
-
-**YAML Format** (`.yml`) - Recommended:
-```yaml
-# pgie_config.yml - YAML format for nvinfer
-property:
-  gpu-id: 0
-  net-scale-factor: 0.00392156862745098
-  onnx-file: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-  labelfile-path: /opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-  batch-size: 1
-  process-mode: 1
-  model-color-format: 0
-  network-mode: 2
-  num-detected-classes: 4
-  cluster-mode: 2
-
-class-attrs-all:
-  topk: 20
-  pre-cluster-threshold: 0.2
-```
-
-**INI-style Format** (`.txt`):
-```ini
-# pgie_config.txt - INI-style format for nvinfer
-[property]
-gpu-id=0
-net-scale-factor=0.00392156862745098
-onnx-file=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx
-labelfile-path=/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/labels.txt
-batch-size=1
-process-mode=1
-model-color-format=0
-network-mode=2
-num-detected-classes=4
-cluster-mode=2
-
-[class-attrs-all]
-topk=20
-pre-cluster-threshold=0.2
-```
-
----
-
-## Common Patterns and Examples
-
-### Pattern 1: Single Stream with Detection
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator
-import platform
-
-def single_stream_detection(video_path, config_path):
-    pipeline = (Pipeline("single-stream")
-        .add("filesrc", "src", {"location": video_path})
-        .add("h264parse", "parser")
-        .add("nvv4l2decoder", "decoder")
-        .add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-        .add("nvinfer", "infer", {"config-file-path": config_path})
-        .add("nvosdbin", "osd")
-        .add("nv3dsink" if platform.processor() == "aarch64" else "nveglglessink", "sink")
-        .link("src", "parser", "decoder")
-        .link(("decoder", "mux"), ("", "sink_%u"))
-        .link("mux", "infer", "osd", "sink")
-        .start()
-        .wait())
-```
-
-### Pattern 2: Multi-Stream with Detection
-
-**Pattern 2a: Multi-Stream from Files**
-```python
-def multi_stream_detection(video_paths, config_path):
-    pipeline = Pipeline("multi-stream")
-    
-    # Add sources
-    for i, path in enumerate(video_paths):
-        pipeline.add("filesrc", f"src{i}", {"location": path})
-        pipeline.add("h264parse", f"parser{i}")
-        pipeline.add("nvv4l2decoder", f"decoder{i}")
-    
-    # Add muxer
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": len(video_paths),
-        "width": 1920,
-        "height": 1080
-    })
-    
-    # Add processing elements
-    pipeline.add("nvinfer", "infer", {"config-file-path": config_path})
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nveglglessink", "sink")
-    
-    # Link sources to muxer
-    for i in range(len(video_paths)):
-        pipeline.link(f"src{i}", f"parser{i}", f"decoder{i}")
-        pipeline.link((f"decoder{i}", "mux"), ("", "sink_%u"))  # CRITICAL: Use "sink_%u", NOT f"sink_{i}"
-    
-    # Link processing chain
-    pipeline.link("mux", "infer", "osd", "sink")
-    pipeline.start().wait()
-```
-
-**Pattern 2b: Multi-Stream RTSP with nvurisrcbin**
-```python
-def multi_rtsp_stream_detection(rtsp_urls, config_path):
-    """
-    Process multiple RTSP streams using nvurisrcbin.
-    
-    Args:
-        rtsp_urls: List of RTSP stream URLs (e.g., ["rtsp://...", "rtsp://..."])
-        config_path: Path to inference config file
-    """
-    pipeline = Pipeline("multi-rtsp-stream")
-    
-    # Add RTSP sources with nvurisrcbin (auto-detects codec and creates dynamic pads)
-    for i, url in enumerate(rtsp_urls):
-        pipeline.add("nvurisrcbin", f"src{i}", {"uri": url})
-    
-    # Add muxer for batching
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": len(rtsp_urls),
-        "width": 1920,
-        "height": 1080,
-        "batched-push-timeout": 40000,
-        "live-source": 1  # Important for RTSP streams
-    })
-    
-    # Add processing elements
-    pipeline.add("nvinfer", "infer", {"config-file-path": config_path, "batch-size": len(rtsp_urls)})
-    pipeline.add("nvmultistreamtiler", "tiler", {"rows": 2, "columns": 2})
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nveglglessink", "sink")
-    
-    # Link sources to muxer - CRITICAL: Use "sink_%u" pad template, NOT f"sink_{i}"
-    for i in range(len(rtsp_urls)):
-        # nvurisrcbin has dynamic src pad, so link directly to mux sink pad template
-        pipeline.link((f"src{i}", "mux"), ("", "sink_%u"))  # CORRECT - pad template auto-assigns sink_0, sink_1, etc.
-        # WRONG: pipeline.link((f"src{i}", "mux"), ("", f"sink_{i}"))  # This will FAIL!
-    
-    # Link processing chain
-    pipeline.link("mux", "infer", "tiler", "osd", "sink")
-    pipeline.start().wait()
-```
-
-### Pattern 3: Custom Metadata Processing
-
-```python
-class CustomProcessor(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # Count objects by class
-            class_counts = {}
-            for obj in frame_meta.object_items:
-                class_id = obj.class_id
-                class_counts[class_id] = class_counts.get(class_id, 0) + 1
-            
-            # Add text overlay
-            display_meta = batch_meta.acquire_display_meta()
-            text = osd.Text()
-            text.display_text = f"Objects: {sum(class_counts.values())}".encode('ascii')
-            text.x_offset = 10
-            text.y_offset = 10
-            text.font.name = osd.FontFamily.Serif
-            text.font.size = 12
-            text.font.color = osd.Color(1.0, 1.0, 1.0, 1.0)
-            display_meta.add_text(text)
-            frame_meta.append(display_meta)
-
-# Attach probe
-pipeline.attach("infer", Probe("processor", CustomProcessor()))
-```
-
-### Pattern 4: Tensor-Based Custom Postprocessing
-
-```python
-class TensorConverter(postprocessing.ObjectDetectorOutputConverter):
-    def __call__(self, output_layers):
-        outputs = []
-        # Extract tensors
-        bbox_layer = output_layers.get('bbox')
-        conf_layer = output_layers.get('conf')
-        
-        if bbox_layer and conf_layer:
-            import torch
-            bbox = torch.utils.dlpack.from_dlpack(bbox_layer)
-            conf = torch.utils.dlpack.from_dlpack(conf_layer)
-            
-            # Process tensors and convert to [class_id, conf, x1, y1, x2, y2]
-            # ... processing logic ...
-            
-        return outputs
-
-class TensorProcessor(BatchMetadataOperator):
-    def __init__(self):
-        super().__init__()
-        self._converter = TensorConverter()
-    
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            for tensor_meta in frame_meta.tensor_items:
-                output_layers = tensor_meta.as_tensor_output().get_layers()
-                objects = self._converter(output_layers)
-                
-                # Create object metadata
-                for obj in objects:
-                    obj_meta = batch_meta.acquire_object_meta()
-                    obj_meta.class_id = obj[0]
-                    obj_meta.confidence = obj[1]
-                    obj_meta.rect_params.left = obj[2]
-                    obj_meta.rect_params.top = obj[3]
-                    obj_meta.rect_params.width = obj[4] - obj[2]
-                    obj_meta.rect_params.height = obj[5] - obj[3]
-                    frame_meta.append(obj_meta)
-
-# Enable tensor output in nvinfer
-pipeline["infer"].set({"output-tensor-meta": 1})
-pipeline.attach("infer", Probe("tensor-processor", TensorProcessor()))
-```
-
-### Pattern 5: Cloud Integration (Kafka)
-
-```python
-from kafka import KafkaProducer
-import json
-
-class KafkaSender(BatchMetadataOperator):
-    def __init__(self, kafka_config):
-        super().__init__()
-        self.producer = KafkaProducer(
-            bootstrap_servers=kafka_config['servers'],
-            value_serializer=lambda v: json.dumps(v).encode('utf-8')
-        )
-        self.topic = kafka_config['topic']
-    
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            objects = [
-                {
-                    "class_id": obj.class_id,
-                    "confidence": obj.confidence,
-                    "bbox": {
-                        "left": obj.rect_params.left,
-                        "top": obj.rect_params.top,
-                        "width": obj.rect_params.width,
-                        "height": obj.rect_params.height
-                    },
-                    "object_id": obj.object_id  # Tracking ID assigned by tracker
-                }
-                for obj in frame_meta.object_items
-            ]
-            
-            message = {
-                "frame_number": frame_meta.frame_number,
-                "source_id": frame_meta.source_id,
-                "buffer_pts": frame_meta.buffer_pts,  # PTS timestamp in nanoseconds
-                "objects": objects
-            }
-            
-            self.producer.send(topic=self.topic, value=message)
-    
-    def __del__(self):
-        if hasattr(self, 'producer'):
-            self.producer.flush()
-            self.producer.close()
-
-# Usage
-kafka_config = {
-    "servers": "localhost:9092",
-    "topic": "analytics"
-}
-pipeline.attach("infer", Probe("kafka-sender", KafkaSender(kafka_config)))
-```
-
----
-
-## Best Practices
-
-1. **Use Pipeline API for fine-grained control**, Flow API for rapid prototyping
-2. **Always use hardware-accelerated decoders** (nvv4l2decoder)
-3. **Configure appropriate batch sizes** for your use case
-4. **Use probes for custom processing** instead of modifying plugins
-5. **Handle KeyboardInterrupt** properly (use multiprocessing.Process)
-6. **Flush and close Kafka producers** in cleanup methods
-7. **Use tensor metadata** for custom postprocessing when needed
-8. **Match tracker dimensions** to inference input dimensions
-9. **Use YAML configs** for complex pipelines to improve maintainability
-10. **Monitor GPU memory** when processing multiple streams
-11. **Use correct Queue types for inter-process/thread communication**:
-    - `queue.Queue` → Use with `threading.Thread` (same process)
-    - `multiprocessing.Queue` → Use with `multiprocessing.Process` (cross-process)
-    - Using `queue.Queue` with `multiprocessing.Process` will silently lose data!
-
----
-
-## Error Handling
-
-```python
-from multiprocessing import Process
-import sys
-
-def run_pipeline():
-    try:
-        pipeline.start().wait()
-    except Exception as e:
-        print(f"Pipeline error: {e}")
-        sys.exit(1)
-
-if __name__ == "__main__":
-    process = Process(target=run_pipeline)
-    try:
-        process.start()
-        process.join()
-    except KeyboardInterrupt:
-        print("\nInterrupted. Terminating...")
-        process.terminate()
-        process.join()
-```
-
----
-
-## Pipeline State and Message Handling API
-
-### Pipeline States
-
-DeepStream pipelines follow GStreamer state transitions:
-
-| State | Description |
-|-------|-------------|
-| `PipelineState.NULL` | Initial state, no resources allocated |
-| `PipelineState.READY` | Resources allocated, not processing |
-| `PipelineState.PAUSED` | Paused, ready to play |
-| `PipelineState.PLAYING` | Processing data |
-
-### Pipeline Methods for State Management
-
-#### `prepare(message_handler)`
-Prepare the pipeline for activation with a message handler.
-
-**Parameters**:
-- `message_handler` (callable): Function to receive pipeline messages
-
-**Returns**: Pipeline instance (for method chaining)
-
-**Example**:
-```python
-def on_message(message):
-    if isinstance(message, StateTransitionMessage):
-        print(f"State changed to: {message.new_state}")
-    elif isinstance(message, DynamicSourceMessage):
-        print(f"Source event: {message.source_id}")
-
-pipeline.prepare(on_message)
-```
-
-#### `activate()`
-Activate the pipeline (set to PLAYING state).
-
-**Returns**: Pipeline instance (for method chaining)
-
-#### `deactivate()`
-Deactivate the pipeline (set to NULL state).
-
-**Returns**: Pipeline instance (for method chaining)
-
-#### `wait()`
-Wait for the pipeline to complete (blocking).
-
-**Returns**: None
-
-### Message Types
-
-#### StateTransitionMessage
-Indicates a pipeline state change.
-
-**Properties**:
-- `origin` (str): Element name that changed state
-- `old_state` (PipelineState): Previous state
-- `new_state` (PipelineState): New state
-
-**Example**:
-```python
-from pyservicemaker import StateTransitionMessage, PipelineState
-
-def on_message(message):
-    if isinstance(message, StateTransitionMessage):
-        if message.new_state == PipelineState.PLAYING:
-            print(f"Element {message.origin} is now playing")
-        elif message.new_state == PipelineState.NULL:
-            print(f"Element {message.origin} stopped")
-```
-
-#### DynamicSourceMessage
-Indicates a dynamic source change (add/remove).
-
-**Properties**:
-- `source_id` (int): Unique source identifier
-- `source_added` (bool): True if added, False if removed
-- `sensor_id` (str): Sensor identifier
-- `sensor_name` (str): Human-readable sensor name
-- `uri` (str): Source URI (for added sources)
-
-**Example**:
-```python
-from pyservicemaker import DynamicSourceMessage
-
-sensor_map = {}
-
-def on_message(message):
-    if isinstance(message, DynamicSourceMessage):
-        if message.source_added:
-            sensor_map[message.source_id] = {
-                "sensor_id": message.sensor_id,
-                "sensor_name": message.sensor_name,
-                "uri": message.uri
-            }
-            print(f"Added source: {message.sensor_name}")
-        else:
-            if message.source_id in sensor_map:
-                del sensor_map[message.source_id]
-            print(f"Removed source: {message.source_id}")
-```
-
-### Complete Message Handling Example
-
-```python
-from pyservicemaker import (
-    Pipeline, PipelineState, StateTransitionMessage,
-    DynamicSourceMessage, SensorInfo, utils
-)
-
-def run_pipeline_with_messages(config_file):
-    """Pipeline with comprehensive message handling"""
-    pipeline = Pipeline("message-aware-pipeline", config_file=config_file)
-    
-    # Track sources
-    active_sources = {}
-    
-    # Performance monitor
-    perf_monitor = utils.PerfMonitor(
-        batch_size=4,
-        interval=5,
-        source_type="nvmultiurisrcbin"
-    )
-    perf_monitor.apply(pipeline["tiler"], "sink")
-    
-    def handle_message(message):
-        """Handle pipeline messages"""
-        if isinstance(message, StateTransitionMessage):
-            # Handle state transitions
-            if message.new_state == PipelineState.PLAYING:
-                if message.origin == "sink":
-                    print("Pipeline fully started")
-            elif message.new_state == PipelineState.NULL:
-                print(f"Element {message.origin} stopped")
-        
-        elif isinstance(message, DynamicSourceMessage):
-            # Handle dynamic source changes
-            source_id = message.source_id
-            
-            if message.source_added:
-                # Track new source
-                active_sources[source_id] = SensorInfo(
-                    sensor_id=message.sensor_id,
-                    sensor_name=message.sensor_name,
-                    uri=message.uri
-                )
-                
-                # Add to performance monitor
-                perf_monitor.add_stream(
-                    source_id=source_id,
-                    uri=message.uri,
-                    sensor_id=message.sensor_id,
-                    sensor_name=message.sensor_name
-                )
-                
-                print(f"Source added: {message.sensor_name} ({message.uri})")
-            else:
-                # Remove source
-                if source_id in active_sources:
-                    del active_sources[source_id]
-                perf_monitor.remove_stream(source_id)
-                print(f"Source removed: {source_id}")
-    
-    # Prepare with message handler
-    pipeline.prepare(handle_message)
-    
-    # Activate and wait
-    pipeline.activate()
-    pipeline.wait()
-
-# Run
-run_pipeline_with_messages("pipeline_config.yaml")
-```
-
----
-
-## Signal Handling API
-
-### Signal Module
-
-The `signal` module provides classes for custom signal handling.
-
-#### Emitter Class
-Base class for signal emitters.
-
-**Methods**:
-- `attach(signal_name, element)`: Attach signal to element
-- `set(properties)`: Set properties on the emitter
-
-#### Handler Class
-Base class for signal handlers.
-
-### Smart Recording Signals
-
-Smart recording uses signals for start/stop events.
-
-**Signal Names**:
-- `"start-sr"`: Start smart recording
-- `"stop-sr"`: Stop smart recording
-- `"sr-done"`: Recording complete
-
-**Example**:
-```python
-from pyservicemaker import Pipeline, CommonFactory
-
-pipeline = Pipeline("smart-recording")
-# ... build pipeline ...
-
-# Create smart recording controller
-sr_controller = CommonFactory.create("smart_recording_action", "sr_controller")
-
-if sr_controller:
-    sr_controller.set({
-        "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-        "conn-str": "localhost;9092",
-        "topic-list": "sr-events"
-    })
-    
-    # Attach signals to source element
-    sr_controller.attach("start-sr", pipeline["src"])
-    sr_controller.attach("stop-sr", pipeline["src"])
-    
-    # Attach signal handler for completion
-    pipeline.attach("src", "smart_recording_signal", "sr", "sr-done")
-```
-
----
-
-## Dynamic Source Management
-
-### nvmultiurisrcbin Properties
-
-For dynamic source management, use `nvmultiurisrcbin`:
-
-| Property | Type | Description |
-|----------|------|-------------|
-| `uri-list` | string | Comma-separated initial URIs |
-| `sensor-id-list` | string | Comma-separated sensor IDs |
-| `sensor-name-list` | string | Comma-separated sensor names |
-| `max-batch-size` | int | Maximum number of sources |
-
-### Adding/Removing Sources Dynamically
-
-Sources are added/removed via REST API or programmatically through source management APIs.
-
-```python
-from pyservicemaker import Pipeline, SourceConfig, SensorInfo
-
-# Load initial sources from config
-source_config = SourceConfig()
-source_config.load("sources.yaml")
-
-# Create pipeline
-pipeline = Pipeline("dynamic-sources", config_file="pipeline.yaml")
-
-# Initial sensors
-for i, sensor in enumerate(source_config.sensor_list):
-    print(f"Initial source {i}: {sensor.sensor_name}")
-
-# Handle dynamic changes via message handler
-def on_message(message):
-    if isinstance(message, DynamicSourceMessage):
-        if message.source_added:
-            print(f"New source: {message.sensor_name}")
-        else:
-            print(f"Source removed: {message.source_id}")
-
-pipeline.prepare(on_message)
-pipeline.activate()
-pipeline.wait()
-```
-
----
-
-## SourceManager API (nvdsdynamicsrcbin)
-
-`SourceManager` is a `SignalEmitter` that dynamically adds and removes sources on `nvdsdynamicsrcbin` at runtime. Unlike `nvmultiurisrcbin` (which uses REST API / config-based management), `SourceManager` gives direct programmatic control over individual file/URI sources through signal actions.
-
-### Import
-
-```python
-from pyservicemaker._pydeepstream.signal import SourceManager
-```
-
-### Class: SourceManager
-
-Inherits from `signal.Emitter` → `Object`.
-
-**Constructor**:
-```python
-source_mgr = SourceManager("source_manager")
-```
-
-**Parameters**:
-- `name` (str): Name of the SourceManager instance
-
-### Methods
-
-#### `attach(action_name, element)`
-Attach the SourceManager to a pipeline element for a given action. Must be called for each action before using it.
-
-**Supported actions**:
-- `"add-source"` — enables `add_source()`
-- `"remove-source"` — enables `remove_source()`
-- `"terminate"` — enables `terminate()`
-
-**Parameters**:
-- `action_name` (str): One of `"add-source"`, `"remove-source"`, `"terminate"`
-- `element`: The pipeline element (Node) to attach to — must be an `nvdsdynamicsrcbin`
-
-**Example**:
-```python
-dsb_node = pipeline["dynamicsrcbin"]
-source_mgr.attach("add-source", dsb_node)
-source_mgr.attach("remove-source", dsb_node)
-source_mgr.attach("terminate", dsb_node)
-```
-
-#### `add_source(source_name)`
-Add a source (file path or URI) to the `nvdsdynamicsrcbin`.
-
-**Parameters**:
-- `source_name` (str): File path or URI of the source to add
-
-**Returns**: `int` — a unique source ID (>= 0), or `-1` if the add failed
-
-**Example**:
-```python
-sid = source_mgr.add_source("/path/to/video.h264")
-if sid < 0:
-    print("Failed to add source")
-```
-
-#### `remove_source(source_id)`
-Remove a previously added source by its ID.
-
-**Parameters**:
-- `source_id` (int): The unique ID returned by `add_source()`
-
-**Example**:
-```python
-source_mgr.remove_source(sid)
-```
-
-#### `terminate()`
-Signal that no more sources will be added. After all currently queued sources finish processing, an EOS (End of Stream) is sent downstream.
-
-**Example**:
-```python
-source_mgr.terminate()
-```
-
----
-
-This comprehensive API reference should help you build DeepStream applications using the Python Service Maker API effectively.
-
diff --git a/skills/deepstream/deepstream-dev/references/tracker_config.md b/skills/deepstream/deepstream-dev/references/tracker_config.md
deleted file mode 100644
index 2bf6cec0..00000000
--- a/skills/deepstream/deepstream-dev/references/tracker_config.md
+++ /dev/null
@@ -1,1296 +0,0 @@
-# nvtracker Configuration Reference
-
-## Overview
-
-The `nvtracker` GStreamer plugin provides multi-object tracking capabilities in DeepStream pipelines. It tracks objects detected by inference engines across video frames, assigning unique tracking IDs and maintaining object trajectories. The plugin works with a reference low-level tracker library (`NvMultiObjectTracker`) that implements multiple tracking algorithms in a unified, composable architecture.
-
-## Prerequisites
-
-### Required System Dependencies
-
-The tracker library (`libnvds_nvmultiobjecttracker.so`) requires the **libmosquitto** library for MQTT-based communication features (used by multi-view tracking). This must be installed before using the tracker.
-
-**Install on Ubuntu/Debian:**
-```bash
-sudo apt-get update
-sudo apt-get install -y libmosquitto1
-```
-
-**Install on RHEL/CentOS:**
-```bash
-sudo yum install mosquitto
-```
-
-**Common Error if Missing:**
-```
-gstnvtracker: Failed to open low-level lib at /opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so
-dlopen error: libmosquitto.so.1: cannot open shared object file: No such file or directory
-gstnvtracker: Failed to initialize low level lib.
-```
-
-If you see this error, install libmosquitto1 as shown above.
-
----
-
-## Unified Tracker Architecture
-
-The NvMultiObjectTracker library employs a **modular, composable architecture**. Different tracker algorithms share common modules (data association, target management, state estimation) while differing in core functionalities (visual tracking, deep association metric, segmentation).
-
-### Module Composition by Tracker Type
-
-| Module | IOU | NvSORT | NvDCF | NvDeepSORT | MaskTracker |
-|--------|-----|--------|-------|------------|-------------|
-| **State Estimator** | - | Kalman (Regular) | Kalman (Simple) | Kalman (Regular) | Kalman (Simple) |
-| **Data Association** | Yes | Yes (Cascaded) | Yes (Cascaded) | Yes (Cascaded) | Yes (Cascaded) |
-| **Visual Tracker (DCF)** | - | - | Yes | - | - |
-| **Re-ID Network** | - | - | Optional | Yes | - |
-| **Segmenter (SAM2)** | - | - | - | - | Yes |
-| **Object Model Projection** | - | - | Optional (SV3DT) | - | - |
-| **Pose Estimator** | - | - | Optional (SV3DT) | - | - |
-| **Target Management** | Yes | Yes | Yes | Yes | Yes |
-| **Target Re-Association** | - | - | Optional | - | - |
-
-### Tracker Algorithm Summary
-
-| Algorithm | Library | Use Case | GPU Usage | Accuracy |
-|-----------|---------|----------|-----------|----------|
-| **IOU** | `libnvds_nvmultiobjecttracker.so` | Bare-minimum baseline, simple scenes | Very Low | Low |
-| **NvSORT** | `libnvds_nvmultiobjecttracker.so` | Balanced performance with medium/high accuracy detectors | Very Low | Medium |
-| **NvDCF** | `libnvds_nvmultiobjecttracker.so` | High accuracy, robust against occlusion, supports PGIE interval > 0 | Medium | High |
-| **NvDeepSORT** | `libnvds_nvmultiobjecttracker.so` | Re-identification, objects with similar appearance | Low | High |
-| **MaskTracker** | `libnvds_nvmultiobjecttracker.so` | Precise segmentation + tracking using SAM2 (Developer Preview) | High | Very High |
-
-**Library Location**: `/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so`
-
----
-
-## GObject Properties
-
-### Required Properties
-
-| Property | Type | Description |
-|----------|------|-------------|
-| `ll-lib-file` | string | Path to low-level tracker library |
-| `ll-config-file` | string | Path to tracker configuration file. When sub-batches are used, specify multiple configs delimited by semicolon |
-
-### Optional Properties
-
-| Property | Type | Default | Description |
-|----------|------|---------|-------------|
-| `tracker-width` | int | 0 | Tracker input width in pixels (0=auto) |
-| `tracker-height` | int | 0 | Tracker input height in pixels (0=auto) |
-| `gpu-id` | int | 0 | GPU device ID |
-| `display-tracking-id` | int | 1 | Show tracking ID in OSD (0/1) |
-| `tracking-id-reset-mode` | int | 0 | ID reset behavior: 0=no reset, 1=reset on stream reset, 2=reset on EOS, 3=both |
-| `tracking-surface-type` | int | 0 | Surface type for tracking |
-| `compute-hw` | int | 0 | Compute engine for scaling: 0=Default, 1=GPU, 2=VIC (Jetson only) |
-| `input-tensor-meta` | int | 0 | Use tensor metadata from upstream (nvdspreprocess) |
-| `tensor-meta-gie-id` | int | -1 | GIE ID for tensor metadata (valid only if input-tensor-meta=1) |
-| `user-meta-pool-size` | int | 16 | Tracker user metadata buffer pool size. Increase if you see "Unable to acquire a user meta buffer" warning |
-| `sub-batches` | string | - | Sub-batch configuration (see Sub-batching section) |
-| `sub-batch-err-recovery-trial-cnt` | int | 3 | Max reinit trials on sub-batch error. -1=infinite |
-
-### Usage Example
-
-```python
-pipeline.add("nvtracker", "tracker", {
-    "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-    "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml",
-    "tracker-width": 640,
-    "tracker-height": 384,
-    "gpu-id": 0,
-    "display-tracking-id": 1
-})
-```
-
----
-
-## Sub-batching
-
-The sub-batching feature allows splitting the input frame batch into multiple sub-batches, each processed by a **separate instance** of the low-level tracker library on dedicated threads. This enables:
-
-- **Parallel processing** to minimize GPU idling due to CPU compute blocks
-- **Different configs per sub-batch** (different algorithms, backends, parameters)
-- **Scaling beyond 128 streams** (VPI backend limit per instance)
-
-### Configuration Options
-
-**Option 1: Static source-to-sub-batch mapping**
-```
-# Semicolon-delimited arrays of source IDs
-sub-batches=0,1;2,3
-# Sources 0,1 -> sub-batch 0; Sources 2,3 -> sub-batch 1
-```
-
-**Option 2: Dynamic sub-batch sizing**
-```
-# Colon-delimited sub-batch sizes
-sub-batches=2:2
-# Two sub-batches, each accommodating up to 2 streams
-```
-
-### Multiple Config Files with Sub-batches
-
-When sub-batches are configured, specify one config file per sub-batch using semicolons:
-```
-ll-config-file=config_tracker_NvDCF_accuracy.yml;config_tracker_NvSORT.yml;config_tracker_IOU.yml
-sub-batches=0,1;2;3
-```
-
-### Use Case: Mixed Algorithms
-```ini
-[tracker]
-enable=1
-tracker-width=960
-tracker-height=544
-ll-lib-file=/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so
-ll-config-file=config_tracker_NvDCF_accuracy.yml;config_tracker_NvSORT.yml
-sub-batches=0,1;2,3
-```
-
-### Use Case: PVA Backend on Jetson
-```ini
-[tracker]
-ll-config-file=config_tracker_NvDCF_accuracy.yml;config_tracker_NvDCF_accuracy_PVA.yml
-sub-batches=0,1;2,3
-```
-
-> **Note**: The optimal sub-batches configuration depends on pipeline elements, hardware config, etc. Start with a single batch and keep splitting until an optimal performance point is reached.
-
----
-
-## Tracker Configuration File (YAML)
-
-The low-level tracker configuration is a YAML file with the following sections.
-
-### Configuration File Structure
-
-```yaml
-%YAML:1.0
-
-BaseConfig:
-  minDetectorConfidence: 0.0
-
-TargetManagement:
-  maxTargetsPerStream: 150
-  probationAge: 4
-  maxShadowTrackingAge: 38
-  earlyTerminationAge: 1
-
-TrajectoryManagement:
-  useUniqueID: 0
-
-DataAssociator:
-  dataAssociatorType: 0
-  associationMatcherType: 0  # GREEDY=0, CASCADED=1
-
-StateEstimator:
-  stateEstimatorType: 0  # DUMMY=0, SIMPLE=1, REGULAR=2, SIMPLE_LOC=3
-
-# Algorithm-specific sections (only one active):
-VisualTracker:    # For NvDCF
-ReID:             # For NvDeepSORT or NvDCF with Re-Assoc
-Segmenter:        # For MaskTracker
-
-# SV3DT-specific sections (NvDCF with stateEstimatorType=3):
-ObjectModelProjection:  # Camera model + 3D projection output
-PoseEstimator:          # Body pose estimation for 3D height
-```
-
----
-
-## Configuration Sections Reference
-
-### BaseConfig
-
-| Parameter | Type | Default | Description | Dynamic |
-|-----------|------|---------|-------------|---------|
-| `minDetectorConfidence` | float | 0.0 | Detections below this confidence are discarded | Yes |
-
-### TargetManagement
-
-Controls the lifecycle of tracked targets through three states: **Tentative** -> **Active** -> **Inactive** (shadow tracking).
-
-| Parameter | Type | Description | Dynamic |
-|-----------|------|-------------|---------|
-| `maxTargetsPerStream` | int | Max targets per stream (includes shadow-tracked). Pre-allocates GPU memory | No |
-| `preserveStreamUpdateOrder` | bool | Deterministic ID order across runs (single-threaded update) | No |
-| `enableBboxUnClipping` | bool | Restore bboxes clipped by image border | Yes |
-| `minIouDiff4NewTarget` | float | New detection is discarded if IOU with any existing target exceeds this | Yes |
-| `minTrackerConfidence` | float | Below this confidence, target enters shadow mode [0.0, 1.0] | Yes |
-| `probationAge` | int | Frames in Tentative mode before target becomes Active (Late Activation) | Yes |
-| `maxShadowTrackingAge` | int | Max frames of shadow tracking before termination | Yes |
-| `earlyTerminationAge` | int | If shadowTrackingAge reaches this during Tentative period, target is terminated early | Yes |
-| `searchRegionPaddingScale` | float | Search region size as multiple of bbox diagonal (NvDCF) | Yes |
-| `outputTerminatedTracks` | bool | Export terminated track history to metadata | No |
-| `outputShadowTracks` | bool | Export shadow track data to metadata | No |
-| `terminatedTrackFilename` | string | File prefix for saving terminated tracks | No |
-
-#### Target State Transitions
-
-```
-  New Detection -> [Tentative] ---- (survives probationAge) ---> [Active]
-                      |                                            |
-                      | (earlyTerminationAge)                      | (no detection match for a while,
-                      v                                            |  or confidence < minTrackerConfidence)
-                  [Terminated]                                     v
-                                                              [Inactive / Shadow]
-                                                                   |
-                                                                   | (maxShadowTrackingAge exceeded)
-                                                                   v
-                                                              [Terminated]
-```
-
-### TrajectoryManagement
-
-Controls unique ID generation and target re-association.
-
-| Parameter | Type | Description |
-|-----------|------|-------------|
-| `useUniqueID` | bool | Use 64-bit unique ID (random upper 32-bit per stream + sequential lower 32-bit) |
-| `enableReAssoc` | bool | Enable motion-based target re-association |
-| `minMatchingScore4Overall` | float | Min total score for re-association |
-| `minTrackletMatchingScore` | float | Min tracklet IOU similarity for re-association |
-| `minMatchingScore4ReidSimilarity` | float | Min ReID score for re-association |
-| `matchingScoreWeight4TrackletSimilarity` | float | Weight for tracklet similarity in re-association |
-| `matchingScoreWeight4ReidSimilarity` | float | Weight for ReID similarity in re-association |
-| `minTrajectoryLength4Projection` | int | Min tracklet length to create projected trajectory |
-| `prepLength4TrajectoryProjection` | int | Trajectory length used for projection state estimation |
-| `trajectoryProjectionLength` | int | Length of projected trajectory |
-| `maxAngle4TrackletMatching` | float | Max angle difference for tracklet matching [degrees] |
-| `minSpeedSimilarity4TrackletMatching` | float | Min speed similarity for tracklet matching |
-| `minBboxSizeSimilarity4TrackletMatching` | float | Min bbox size similarity for tracklet matching |
-| `maxTrackletMatchingTimeSearchRange` | int | Time search range for tracklet matching |
-| `trajectoryProjectionProcessNoiseScale` | float | Process noise scale for trajectory projection |
-| `trajectoryProjectionMeasurementNoiseScale` | float | Measurement noise scale for trajectory projection |
-| `trackletSpacialSearchRegionScale` | float | Spatial search region for peer tracklet |
-| `reidExtractionInterval` | int | Frame interval for ReID feature extraction per target. -1=first frame only |
-
-### DataAssociator
-
-| Parameter | Type | Default | Description | Dynamic |
-|-----------|------|---------|-------------|---------|
-| `dataAssociatorType` | int | 0 | Data associator type {DEFAULT=0} | No |
-| `associationMatcherType` | int | 0 | Matching algorithm {GREEDY=0, CASCADED=1} | No |
-| `checkClassMatch` | bool | true | Only associate same-class objects | No |
-| `usePrediction4Assoc` | bool | false | Use predicted state for association instead of last known state | Yes |
-| **Similarity Thresholds** |||||
-| `minMatchingScore4Overall` | float | 0.0 | Min total matching score | Yes |
-| `minMatchingScore4SizeSimilarity` | float | 0.0 | Min bbox size similarity | Yes |
-| `minMatchingScore4Iou` | float | 0.0 | Min IOU score | Yes |
-| `minMatchingScore4VisualSimilarity` | float | 0.0 | Min visual similarity (NvDCF only) | Yes |
-| `minMatchingScore4ReidSimilarity` | float | 0.0 | Min ReID similarity (NvDeepSORT only) | Yes |
-| **Similarity Weights** |||||
-| `matchingScoreWeight4Iou` | float | 1.0 | Weight for IOU | Yes |
-| `matchingScoreWeight4SizeSimilarity` | float | 0.0 | Weight for size similarity | Yes |
-| `matchingScoreWeight4VisualSimilarity` | float | 0.0 | Weight for visual similarity (NvDCF) | Yes |
-| `matchingScoreWeight4ReidSimilarity` | float | 0.0 | Weight for ReID similarity (NvDeepSORT) | Yes |
-| **Tentative Detection** |||||
-| `tentativeDetectorConfidence` | float | 0.5 | Below this but above minDetectorConfidence = tentative detection | Yes |
-| `minMatchingScore4TentativeIou` | float | 0.0 | Min IOU for tentative detection matching | Yes |
-| **Mahalanobis Distance (NvDeepSORT)** |||||
-| `thresholdMahalanobis` | float | -1.0 | Max Mahalanobis distance. Negative = disabled | Yes |
-
-#### Cascaded Data Association (associationMatcherType: 1)
-
-The cascaded matcher performs multi-stage matching with different priorities:
-
-1. **Stage 1**: Confirmed detections <-> validated targets (joint similarity metrics)
-2. **Stage 2**: Tentative detections <-> remaining active targets (IOU only)
-3. **Stage 3**: Remaining confirmed detections <-> tentative targets (IOU only)
-
-Total matching score formula:
-
-`totalScore = w_iou * IOU + w_size * sizeSimilarity + w_reid * reidSimilarity + w_visual * visualSimilarity`
-
-### StateEstimator
-
-| Parameter | Type | Description |
-|-----------|------|-------------|
-| `stateEstimatorType` | int | Estimator type: **DUMMY=0**, **SIMPLE_BBOX_KF=1**, **REGULAR_BBOX_KF=2**, **SIMPLE_LOCATION_KF=3** |
-
-**SIMPLE_BBOX_KF (type=1)**: 6-state Kalman filter `{x, y, w, h, dx, dy}` with absolute noise values:
-
-| Parameter | Description |
-|-----------|-------------|
-| `processNoiseVar4Loc` | Process noise for bbox center |
-| `processNoiseVar4Size` | Process noise for bbox size |
-| `processNoiseVar4Vel` | Process noise for velocity |
-| `measurementNoiseVar4Detector` | Measurement noise from detector |
-| `measurementNoiseVar4Tracker` | Measurement noise from visual tracker (NvDCF) |
-
-**REGULAR_BBOX_KF (type=2)**: 8-state Kalman filter `{x, y, w, h, dx, dy, dw, dh}` with height-proportional noise:
-
-| Parameter | Description |
-|-----------|-------------|
-| `noiseWeightVar4Loc` | Noise weight proportional to bbox height (location) |
-| `noiseWeightVar4Vel` | Noise weight proportional to bbox height (velocity) |
-| `useAspectRatio` | Use aspect ratio `a` instead of width `w` in state vector (used by NvDeepSORT) |
-
-**SIMPLE_LOCATION_KF (type=3)**: 4-state Kalman filter `{x, y, dx, dy}` for 3D world coordinate tracking (SV3DT). Tracks the projected foot location in image space rather than bounding box. The bounding box is reconstructed by projecting a 3D cylinder model (from `ObjectModelProjection`) back onto the image. **Does NOT use `processNoiseVar4Size`** since bbox size is derived from the 3D model projection rather than estimated directly.
-
-| Parameter | Description |
-|-----------|-------------|
-| `processNoiseVar4Loc` | Process noise for foot location in image space |
-| `processNoiseVar4Vel` | Process noise for velocity |
-| `measurementNoiseVar4Detector` | Measurement noise from detector |
-| `measurementNoiseVar4Tracker` | Measurement noise from visual tracker (NvDCF) |
-
-> **Note**: When using `stateEstimatorType: 3`, the `ObjectModelProjection` section is required. The `PoseEstimator` section is optional but recommended for more accurate height estimation.
-
-### VisualTracker (NvDCF)
-
-| Parameter | Type | Description | Dynamic |
-|-----------|------|-------------|---------|
-| `visualTrackerType` | int | **DUMMY=0**, **NvDCF_legacy=1**, **NvDCF_VPI=2** | No |
-| `useColorNames` | bool | Use ColorNames feature (10 channels) | No |
-| `useHog` | bool | Use HOG feature (18 channels) | No |
-| `useHighPrecisionFeature` | bool | 16-bit precision (vs 8-bit) | No |
-| `featureImgSizeLevel` | int | Feature image size {1=12x12, 2=18x18, 3=24x24, 4=30x30, 5=36x36} per channel | No |
-| `featureFocusOffsetFactor_y` | float | Hanning window center Y offset [-0.5, 0.5]. Negative moves up (good for surveillance) | Yes |
-| `filterLr` | float | DCF filter learning rate [0.0, 1.0] | Yes |
-| `filterChannelWeightsLr` | float | Channel weights learning rate [0.0, 1.0] | Yes |
-| `gaussianSigma` | float | Gaussian sigma for desired response [pixels] | Yes |
-| `vpiBackend4DcfTracker` | int | VPI backend: **CUDA=1**, **PVA=2** (Jetson only). Valid when visualTrackerType=2 | No |
-
-#### PVA Backend Limitations (VPI)
-- Max 512 objects per tracker instance
-- Max 33 streams per instance (use sub-batching for more)
-- Only supports: `useColorNames: 1`, `useHog: 1`, `featureImgSizeLevel: 3`
-
-### ReID (Re-Identification)
-
-| Parameter | Type | Description |
-|-----------|------|-------------|
-| `reidType` | int | **DUMMY=0**, **NvDEEPSORT=1**, **REASSOC=2** (re-association only), **BOTH=3** |
-| `batchSize` | int | ReID network batch size |
-| `workspaceSize` | int | TensorRT workspace (MB) |
-| `reidFeatureSize` | int | Output feature dimension |
-| `reidHistorySize` | int | Max features kept per target (gallery size) |
-| `inferDims` | [int] | Network input dims [C, H, W] |
-| `networkMode` | int | Precision: FP32=0, FP16=1, INT8=2 |
-| `inputOrder` | int | NCHW=0, NHWC=1 |
-| `colorFormat` | int | RGB=0, BGR=1 |
-| `offsets` | [float] | Per-channel subtraction values |
-| `netScaleFactor` | float | Scale factor after offset: `y = netScaleFactor * (x - offsets)` |
-| `keepAspc` | bool | Preserve aspect ratio when resizing |
-| `useVPICropScaler` | bool | Use VPI for crop and scale |
-| `addFeatureNormalization` | bool | L2 normalize output features |
-| `minVisibility4GalleryUpdate` | float | Min visibility to add ReID embedding to gallery (SV3DT only, e.g. 0.6) |
-| `outputReidTensor` | bool | Export ReID features to user meta |
-| `tltEncodedModel` | string | TAO model path |
-| `tltModelKey` | string | TAO model key |
-| `onnxFile` | string | ONNX model path |
-| `modelEngineFile` | string | Pre-built TensorRT engine path |
-| `calibrationTableFile` | string | INT8 calibration table path |
-
-### Segmenter (MaskTracker)
-
-| Parameter | Type | Description |
-|-----------|------|-------------|
-| `segmenterType` | int | **DUMMY=0**, **SAM2=1** |
-| `segmenterConfigPath` | string | Path to segmenter config (e.g., `config_tracker_module_Segmenter.yml`) |
-
-The segmenter config file defines four TensorRT-accelerated sub-networks (ImageEncoder, MaskDecoder, MemoryAttention, MemoryEncoder) and memory management parameters. See MaskTracker section for details.
-
-### ObjectModelProjection (SV3DT)
-
-Used for Single-View 3D Tracking (SV3DT). Projects a 3D cylinder model onto the image plane using camera calibration to estimate per-object visibility, foot location, and convex hull. This enables the tracker to recover complete bounding boxes and foot positions even under partial occlusion.
-
-| Parameter | Type | Description |
-|-----------|------|-------------|
-| `cameraModelFilepath` | list[string] | Camera calibration file path per stream (one entry per stream, ordered by stream index) |
-| `outputVisibility` | bool | Output per-object visibility (0.0\~1.0) estimated from occlusion via 3D model |
-| `outputFootLocation` | bool | Output foot location in image and world coordinates, estimated from 3D model projection |
-| `outputConvexHull` | bool | Output convex hull vertices for each object estimated from 3D cylinder model |
-| `minPoseConfidence` | float | Minimum pose keypoint confidence for adaptive height estimation (0.0\~1.0) |
-
-**Camera Model File (`camInfo.yml`):**
-
-The camera model file provides the 3x4 camera projection matrix and a cylinder model representing the tracked object (human). The projection matrix maps 3D world coordinates to 2D image coordinates.
-
-```yaml
-%YAML:1.0
-
-# 3x4 camera projection matrix (row-major)
-# Maps 3D world coordinates (X, Y, Z) to 2D image coordinates (u, v)
-projectionMatrix_3x4:
-  - 2582.5691623002185
-  - -485.10283397043617
-  - 650.27745033162591
-  - -89466.605755471101
-  - -423.46809686390498
-  - 1044.6870098337931
-  - 2461.1283636622838
-  - -214284.36100320917
-  - -0.25563255317172684
-  - -0.90495941862094287
-  - 0.34014768617197644
-  - -1181.960782357068
-
-# Cylinder model dimensions for human (cm)
-modelInfo:
-  height: 205    # Height of the cylinder model
-  radius: 33     # Radius of the cylinder model
-```
-
-> **Note**: The camera must be **static** (fixed position and orientation). The projection matrix can be obtained through standard camera calibration procedures. For multi-stream setups, provide one `camInfo.yml` per camera in the `cameraModelFilepath` list.
-
-### PoseEstimator (SV3DT)
-
-Estimates 2D body pose to determine precise target height for the 3D cylinder model. Used in conjunction with `ObjectModelProjection` for SV3DT. When enabled, the BodyPose3DNet model infers key body joints to compute the actual individual height rather than using a fixed default height.
-
-| Parameter | Type | Description |
-|-----------|------|-------------|
-| `poseEstimatorType` | int | **0**=Disabled (use fixed-height model, match head to bbox top edge), **1**=Enabled (use BodyPose3DNet for precise height estimation) |
-| `useVPICropScaler` | bool | Use VPI backend for cropping and scaling |
-| `batchSize` | int | Batch size for pose estimation inference |
-| `workspaceSize` | int | TensorRT workspace size (MB) |
-| `inferDims` | [int] | Network input dims [C, H, W], e.g. `[3, 256, 192]` |
-| `networkMode` | int | Precision: FP32=0, FP16=1, INT8=2 |
-| `inputOrder` | int | NCHW=0, NHWC=1 |
-| `colorFormat` | int | RGB=0, BGR=1 |
-| `offsets` | [float] | Per-channel subtraction values |
-| `netScaleFactor` | float | Scale factor after offset subtraction |
-| `onnxFile` | string | Path to BodyPose3DNet ONNX model |
-| `modelEngineFile` | string | Pre-built TensorRT engine path |
-| `poseInferenceInterval` | int | Frame interval for pose inference. **-1**=first frame only (determine height once per target, most efficient) |
-
-> **Note**: When `poseEstimatorType: 0`, no pose model is needed. The tracker uses a fixed-height human model matching the head to the bbox top edge. This is less accurate but has zero additional compute cost. When `poseEstimatorType: 1`, the BodyPose3DNet model (`bodypose3dnet_accuracy.onnx`) is required.
-
----
-
-## Tracker Algorithm Configurations
-
-### IOU Tracker
-
-**Best for**: Bare-minimum baseline, sparse objects, detector runs every frame.
-
-```yaml
-%YAML:1.0
-
-BaseConfig:
-  minDetectorConfidence: 0
-
-TargetManagement:
-  preserveStreamUpdateOrder: 0
-  maxTargetsPerStream: 150
-  minIouDiff4NewTarget: 0.5
-  probationAge: 4
-  maxShadowTrackingAge: 38
-  earlyTerminationAge: 1
-
-TrajectoryManagement:
-  useUniqueID: 0
-
-DataAssociator:
-  dataAssociatorType: 0
-  associationMatcherType: 0    # GREEDY
-  checkClassMatch: 1
-  minMatchingScore4Overall: 0.0
-  minMatchingScore4SizeSimilarity: 0.0
-  minMatchingScore4Iou: 0.0
-  matchingScoreWeight4SizeSimilarity: 0.4
-  matchingScoreWeight4Iou: 0.6
-```
-
-### NvSORT Tracker
-
-**Best for**: Balanced performance with medium/high accuracy detectors. Uses Kalman filter + cascaded data association.
-
-```yaml
-%YAML:1.0
-
-BaseConfig:
-  minDetectorConfidence: 0.1345
-
-TargetManagement:
-  enableBboxUnClipping: 0
-  maxTargetsPerStream: 300
-  minIouDiff4NewTarget: 0.5780
-  minTrackerConfidence: 0.8216
-  probationAge: 5
-  maxShadowTrackingAge: 26
-  earlyTerminationAge: 1
-
-TrajectoryManagement:
-  useUniqueID: 0
-
-DataAssociator:
-  dataAssociatorType: 0
-  associationMatcherType: 1    # CASCADED
-  checkClassMatch: 1
-  minMatchingScore4Overall: 0.2543
-  minMatchingScore4SizeSimilarity: 0.4019
-  minMatchingScore4Iou: 0.2159
-  matchingScoreWeight4SizeSimilarity: 0.1365
-  matchingScoreWeight4Iou: 0.3836
-  tentativeDetectorConfidence: 0.2331
-  minMatchingScore4TentativeIou: 0.2867
-  usePrediction4Assoc: 1
-
-StateEstimator:
-  stateEstimatorType: 2    # REGULAR_BBOX_KF
-  noiseWeightVar4Loc: 0.0301
-  noiseWeightVar4Vel: 0.0017
-  useAspectRatio: 1
-```
-
-### NvDCF Tracker (Performance)
-
-**Best for**: High accuracy, robust against occlusions, supports PGIE interval > 0.
-
-```yaml
-%YAML:1.0
-
-BaseConfig:
-  minDetectorConfidence: 0.0430
-
-TargetManagement:
-  enableBboxUnClipping: 1
-  preserveStreamUpdateOrder: 0
-  maxTargetsPerStream: 150
-  minIouDiff4NewTarget: 0.7418
-  minTrackerConfidence: 0.4009
-  probationAge: 2
-  maxShadowTrackingAge: 51
-  earlyTerminationAge: 1
-
-TrajectoryManagement:
-  useUniqueID: 0
-
-DataAssociator:
-  dataAssociatorType: 0
-  associationMatcherType: 1    # CASCADED
-  checkClassMatch: 1
-  minMatchingScore4Overall: 0.4290
-  minMatchingScore4SizeSimilarity: 0.3627
-  minMatchingScore4Iou: 0.2575
-  minMatchingScore4VisualSimilarity: 0.5356
-  matchingScoreWeight4VisualSimilarity: 0.3370
-  matchingScoreWeight4SizeSimilarity: 0.4354
-  matchingScoreWeight4Iou: 0.3656
-  tentativeDetectorConfidence: 0.2008
-  minMatchingScore4TentativeIou: 0.5296
-
-StateEstimator:
-  stateEstimatorType: 1    # SIMPLE_BBOX_KF
-  processNoiseVar4Loc: 1.5110
-  processNoiseVar4Size: 1.3159
-  processNoiseVar4Vel: 0.0300
-  measurementNoiseVar4Detector: 3.0283
-  measurementNoiseVar4Tracker: 8.1505
-
-VisualTracker:
-  visualTrackerType: 2    # NvDCF_VPI
-  useColorNames: 1
-  useHog: 0
-  featureImgSizeLevel: 2
-  featureFocusOffsetFactor_y: -0.2000
-  filterLr: 0.0750
-  filterChannelWeightsLr: 0.1000
-  gaussianSigma: 0.7500
-```
-
-### NvDCF Tracker (Accuracy with Re-Association)
-
-Enables Re-Association for long-term tracking with ReID.
-
-```yaml
-%YAML:1.0
-
-BaseConfig:
-  minDetectorConfidence: 0.1894
-
-TargetManagement:
-  enableBboxUnClipping: 1
-  maxTargetsPerStream: 150
-  minIouDiff4NewTarget: 0.3686
-  minTrackerConfidence: 0.1513
-  probationAge: 2
-  maxShadowTrackingAge: 42
-  earlyTerminationAge: 1
-
-TrajectoryManagement:
-  useUniqueID: 0
-  enableReAssoc: 1
-  minMatchingScore4Overall: 0.6622
-  minTrackletMatchingScore: 0.2940
-  minMatchingScore4ReidSimilarity: 0.0771
-  matchingScoreWeight4TrackletSimilarity: 0.7981
-  matchingScoreWeight4ReidSimilarity: 0.3848
-  minTrajectoryLength4Projection: 34
-  prepLength4TrajectoryProjection: 58
-  trajectoryProjectionLength: 33
-  maxAngle4TrackletMatching: 67
-  minSpeedSimilarity4TrackletMatching: 0.0574
-  minBboxSizeSimilarity4TrackletMatching: 0.1013
-  maxTrackletMatchingTimeSearchRange: 27
-  trajectoryProjectionProcessNoiseScale: 0.0100
-  trajectoryProjectionMeasurementNoiseScale: 100
-  trackletSpacialSearchRegionScale: 0.0100
-  reidExtractionInterval: 8
-
-DataAssociator:
-  dataAssociatorType: 0
-  associationMatcherType: 1    # CASCADED
-  checkClassMatch: 1
-  minMatchingScore4Overall: 0.0222
-  minMatchingScore4SizeSimilarity: 0.3552
-  minMatchingScore4Iou: 0.0548
-  minMatchingScore4VisualSimilarity: 0.5043
-  matchingScoreWeight4VisualSimilarity: 0.3951
-  matchingScoreWeight4SizeSimilarity: 0.6003
-  matchingScoreWeight4Iou: 0.4033
-  tentativeDetectorConfidence: 0.1024
-  minMatchingScore4TentativeIou: 0.2852
-
-StateEstimator:
-  stateEstimatorType: 1    # SIMPLE_BBOX_KF
-  processNoiseVar4Loc: 6810.8668
-  processNoiseVar4Size: 1541.8647
-  processNoiseVar4Vel: 1348.4874
-  measurementNoiseVar4Detector: 100.0000
-  measurementNoiseVar4Tracker: 293.3238
-
-VisualTracker:
-  visualTrackerType: 2    # NvDCF_VPI
-  useColorNames: 1
-  useHog: 1
-  featureImgSizeLevel: 3
-  featureFocusOffsetFactor_y: -0.1054
-  filterLr: 0.0767
-  filterChannelWeightsLr: 0.0339
-  gaussianSigma: 0.5687
-
-ReID:
-  reidType: 2    # REASSOC only
-  batchSize: 100
-  workspaceSize: 1000
-  reidFeatureSize: 256
-  reidHistorySize: 100
-  inferDims: [3, 256, 128]
-  networkMode: 1    # FP16
-  inputOrder: 0
-  colorFormat: 0
-  offsets: [123.6750, 116.2800, 103.5300]
-  netScaleFactor: 0.01735207
-  keepAspc: 1
-  useVPICropScaler: 1
-  addFeatureNormalization: 1
-  tltEncodedModel: "/opt/nvidia/deepstream/deepstream/samples/models/Tracker/resnet50_market1501.etlt"
-  tltModelKey: "nvidia_tao"
-```
-
-### NvDeepSORT Tracker
-
-**Best for**: Re-identification across views, objects with similar appearance. Requires a Re-ID model.
-
-```yaml
-%YAML:1.0
-
-BaseConfig:
-  minDetectorConfidence: 0.0762
-
-TargetManagement:
-  preserveStreamUpdateOrder: 0
-  maxTargetsPerStream: 150
-  minIouDiff4NewTarget: 0.9847
-  minTrackerConfidence: 0.4314
-  probationAge: 2
-  maxShadowTrackingAge: 68
-  earlyTerminationAge: 1
-
-TrajectoryManagement:
-  useUniqueID: 0
-
-DataAssociator:
-  dataAssociatorType: 0
-  associationMatcherType: 1    # CASCADED
-  checkClassMatch: 1
-  thresholdMahalanobis: 12.1875
-  minMatchingScore4Overall: 0.1794
-  minMatchingScore4SizeSimilarity: 0.3291
-  minMatchingScore4Iou: 0.2364
-  minMatchingScore4ReidSimilarity: 0.7505
-  matchingScoreWeight4SizeSimilarity: 0.7178
-  matchingScoreWeight4Iou: 0.4551
-  matchingScoreWeight4ReidSimilarity: 0.3197
-  tentativeDetectorConfidence: 0.2479
-  minMatchingScore4TentativeIou: 0.2376
-
-StateEstimator:
-  stateEstimatorType: 2    # REGULAR_BBOX_KF
-  noiseWeightVar4Loc: 0.0503
-  noiseWeightVar4Vel: 0.0037
-  useAspectRatio: 1
-
-ReID:
-  reidType: 1    # NvDEEPSORT
-  batchSize: 100
-  workspaceSize: 1000
-  reidFeatureSize: 256
-  reidHistorySize: 100
-  inferDims: [3, 256, 128]
-  networkMode: 1    # FP16
-  inputOrder: 0
-  colorFormat: 0
-  offsets: [123.6750, 116.2800, 103.5300]
-  netScaleFactor: 0.01735207
-  keepAspc: 1
-  useVPICropScaler: 1
-  addFeatureNormalization: 1
-  tltEncodedModel: "/opt/nvidia/deepstream/deepstream/samples/models/Tracker/resnet50_market1501.etlt"
-  tltModelKey: "nvidia_tao"
-  modelEngineFile: "/opt/nvidia/deepstream/deepstream/samples/models/Tracker/resnet50_market1501.etlt_b100_gpu0_fp16.engine"
-```
-
-**Setup ReID model:**
-```bash
-mkdir -p /opt/nvidia/deepstream/deepstream/samples/models/Tracker/
-wget 'https://api.ngc.nvidia.com/v2/models/nvidia/tao/reidentificationnet/versions/deployable_v1.0/files/resnet50_market1501.etlt' \
-  -P /opt/nvidia/deepstream/deepstream/samples/models/Tracker/
-```
-
-### MaskTracker (Developer Preview)
-
-**Best for**: Precise object segmentation + tracking using SAM2. Works with diverse object classes.
-
-```yaml
-%YAML:1.0
-
-BaseConfig:
-  minDetectorConfidence: 0.3529
-
-TargetManagement:
-  enableBboxUnClipping: 1
-  preserveStreamUpdateOrder: 0
-  maxTargetsPerStream: 150
-  minIouDiff4NewTarget: 0.7608
-  minTrackerConfidence: 0.6223
-  probationAge: 4
-  maxShadowTrackingAge: 84
-  earlyTerminationAge: 1
-
-DataAssociator:
-  dataAssociatorType: 0
-  associationMatcherType: 1    # CASCADED
-  checkClassMatch: 1
-  minMatchingScore4Overall: 0.0293
-  minMatchingScore4SizeSimilarity: 0.1047
-  minMatchingScore4Iou: 0.0437
-  matchingScoreWeight4SizeSimilarity: 0.2410
-  matchingScoreWeight4Iou: 0.8590
-  tentativeDetectorConfidence: 0.1866
-  minMatchingScore4TentativeIou: 0.3660
-
-TrajectoryManagement:
-  useUniqueID: 0
-
-StateEstimator:
-  stateEstimatorType: 1    # SIMPLE_BBOX_KF
-  processNoiseVar4Loc: 2856.7104
-  processNoiseVar4Size: 8157.1946
-  processNoiseVar4Vel: 2602.8703
-  measurementNoiseVar4Detector: 0.1000
-  measurementNoiseVar4Tracker: 8.6695
-
-Segmenter:
-  segmenterType: 1    # SAM2
-  segmenterConfigPath: "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_module_Segmenter.yml"
-```
-
-**Setup SAM2 model:**
-```bash
-git clone https://github.com/NVIDIA-AI-IOT/deepstream_tools.git
-cd deepstream_tools/sam2-onnx-tensorrt
-bash run.sh
-```
-
-The segmentation mask is stored in `mask_params` field of `NvDsObjectMeta`. Set `display-mask=1` in OSD config to visualize.
-
-### NvDCF 3D Tracker (SV3DT)
-
-**Best for**: Tracking people in 3D physical world coordinates from a static camera. Estimates foot location, body visibility, and convex hull using camera calibration and a 3D cylinder human model. Recovers complete bounding boxes even under partial occlusion.
-
-**Overview**: Single-View 3D Tracking (SV3DT) extends NvDCF with 3D state estimation. Instead of tracking bounding box coordinates directly, it tracks object positions in 3D world coordinates by projecting a cylinder model using the camera projection matrix. Key capabilities:
-
-- **3D world coordinate tracking**: Estimates object foot position in real-world coordinates
-- **Occlusion-aware bounding box recovery**: Reconstructs complete bounding boxes from partially occluded objects
-- **Visibility estimation**: Computes per-object visibility ratio (0.0\~1.0) based on mutual occlusion
-- **Convex hull output**: Provides projected 3D model convex hull vertices for each tracked object
-- **Pose-based height estimation**: Optionally uses BodyPose3DNet to determine individual person height
-
-**Prerequisites**:
-- Static camera with known camera projection matrix (`camInfo.yml`)
-- PeopleNet or similar person detector as PGIE
-- ReID model (e.g., `resnet50_market1501.etlt`) for re-association
-- BodyPose3DNet ONNX model (optional, for `poseEstimatorType: 1`)
-
-**Setup models:**
-```bash
-# peoplenet model
-mkdir -p PeopleNet
-cd PeopleNet; wget --no-check-certificate --content-disposition https://api.ngc.nvidia.com/v2/models/nvidia/tao/peoplenet/versions/deployable_quantized_onnx_v2.6.3/zip -O peoplenet_deployable_quantized_onnx_v2.6.3.zip; unzip peoplenet_deployable_quantized_onnx_v2.6.3.zip
-```
-
-The model files are now stored in PeopleNet directory as
-
-```
-PeopleNet
-  ├── labels.txt
-  ├── resnet34_peoplenet.onnx
-  └── ...
-```
-
-```bash
-mkdir -p /opt/nvidia/deepstream/deepstream/samples/models/Tracker/
-
-# ReID model
-wget 'https://api.ngc.nvidia.com/v2/models/nvidia/tao/reidentificationnet/versions/deployable_v1.0/files/resnet50_market1501.etlt' \
-  -P /opt/nvidia/deepstream/deepstream/samples/models/Tracker/
-
-# BodyPose3DNet model (for poseEstimatorType: 1)
-wget 'https://api.ngc.nvidia.com/v2/models/nvidia/tao/bodypose3dnet/versions/deployable_accuracy_onnx_1.0/files/bodypose3dnet_accuracy.onnx' \
-  -P /opt/nvidia/deepstream/deepstream/samples/models/Tracker/
-```
-
-**Full Configuration (`config_tracker_NvDCF_accuracy_3D.yml`):**
-
-```yaml
-%YAML:1.0
-
-BaseConfig:
-  minDetectorConfidence: 0.1894
-
-TargetManagement:
-  enableBboxUnClipping: 1
-  preserveStreamUpdateOrder: 0
-  maxTargetsPerStream: 150
-  minIouDiff4NewTarget: 0.3686
-  minTrackerConfidence: 0.1513
-  probationAge: 2
-  maxShadowTrackingAge: 42
-  earlyTerminationAge: 1
-  # Export terminated tracklets
-  outputTerminatedTracks: 1
-  terminatedTrackFilename: track_dump_
-
-TrajectoryManagement:
-  useUniqueID: 0
-  enableReAssoc: 1
-  minMatchingScore4Overall: 0.6622
-  minTrackletMatchingScore: 0.2940
-  minMatchingScore4ReidSimilarity: 0.0771
-  matchingScoreWeight4TrackletSimilarity: 0.7981
-  matchingScoreWeight4ReidSimilarity: 0.3848
-  minTrajectoryLength4Projection: 34
-  prepLength4TrajectoryProjection: 58
-  trajectoryProjectionLength: 33
-  maxAngle4TrackletMatching: 67
-  minSpeedSimilarity4TrackletMatching: 0.0574
-  minBboxSizeSimilarity4TrackletMatching: 0.1013
-  maxTrackletMatchingTimeSearchRange: 27
-  trajectoryProjectionProcessNoiseScale: 0.0100
-  trajectoryProjectionMeasurementNoiseScale: 100
-  trackletSpacialSearchRegionScale: 0.0100
-  reidExtractionInterval: 8
-
-DataAssociator:
-  dataAssociatorType: 0
-  associationMatcherType: 1    # CASCADED
-  checkClassMatch: 1
-  minMatchingScore4Overall: 0.0222
-  minMatchingScore4SizeSimilarity: 0.3552
-  minMatchingScore4Iou: 0.0548
-  minMatchingScore4VisualSimilarity: 0.5043
-  matchingScoreWeight4VisualSimilarity: 0.3951
-  matchingScoreWeight4SizeSimilarity: 0.6003
-  matchingScoreWeight4Iou: 0.4033
-  tentativeDetectorConfidence: 0.1024
-  minMatchingScore4TentativeIou: 0.2852
-
-StateEstimator:
-  stateEstimatorType: 3    # SIMPLE_LOCATION_KF (3D)
-  # Note: NO processNoiseVar4Size (bbox size derived from 3D model projection)
-  processNoiseVar4Loc: 6810.8668
-  processNoiseVar4Vel: 1348.4874
-  measurementNoiseVar4Detector: 100.0000
-  measurementNoiseVar4Tracker: 293.3238
-
-ObjectModelProjection:
-  cameraModelFilepath:    # one camInfo.yml per stream
-    - configs/camInfo.yml
-  outputVisibility: 1
-  outputFootLocation: 1
-  outputConvexHull: 1
-  minPoseConfidence: 0.5
-
-VisualTracker:
-  visualTrackerType: 2    # NvDCF_VPI
-  vpiBackend4DcfTracker: 1    # CUDA
-  useColorNames: 1
-  useHog: 1
-  featureImgSizeLevel: 3
-  featureFocusOffsetFactor_y: -0.1054
-  filterLr: 0.0767
-  filterChannelWeightsLr: 0.0339
-  gaussianSigma: 0.5687
-
-ReID:
-  reidType: 2    # REASSOC only
-  batchSize: 100
-  workspaceSize: 1000
-  reidFeatureSize: 256
-  reidHistorySize: 100
-  inferDims: [3, 256, 128]
-  networkMode: 1    # FP16
-  inputOrder: 0
-  colorFormat: 0
-  offsets: [123.6750, 116.2800, 103.5300]
-  netScaleFactor: 0.01735207
-  keepAspc: 1
-  useVPICropScaler: 1
-  addFeatureNormalization: 1
-  minVisibility4GalleryUpdate: 0.6    # Only update ReID gallery when visibility >= 0.6
-  tltEncodedModel: "/opt/nvidia/deepstream/deepstream/samples/models/Tracker/resnet50_market1501.etlt"
-  tltModelKey: "nvidia_tao"
-  modelEngineFile: "/opt/nvidia/deepstream/deepstream/samples/models/Tracker/resnet50_market1501.etlt_b100_gpu0_fp16.engine"
-
-PoseEstimator:
-  poseEstimatorType: 1    # 1=BodyPose3DNet, 0=disabled (fixed height)
-  useVPICropScaler: 1
-  batchSize: 1
-  workspaceSize: 1000
-  inferDims: [3, 256, 192]
-  networkMode: 1    # FP16
-  inputOrder: 0
-  colorFormat: 0
-  offsets: [123.6750, 116.2800, 103.5300]
-  netScaleFactor: 0.00392156
-  onnxFile: "/opt/nvidia/deepstream/deepstream/samples/models/Tracker/bodypose3dnet_accuracy.onnx"
-  modelEngineFile: "/opt/nvidia/deepstream/deepstream/samples/models/Tracker/bodypose3dnet_accuracy.onnx_b1_gpu0_fp16.engine"
-  poseInferenceInterval: -1    # -1 = first frame only (determine height once per target)
-```
-
-> **Key Differences from Standard NvDCF Accuracy Config:**
-> - `stateEstimatorType: 3` instead of `1` — uses 3D location KF instead of bbox KF
-> - `StateEstimator` has NO `processNoiseVar4Size` — bbox size is derived from the 3D model projection, not estimated
-> - `ObjectModelProjection` section — camera calibration and 3D output controls
-> - `PoseEstimator` section — optional body pose for height estimation
-> - `minVisibility4GalleryUpdate: 0.6` in `ReID` — prevents occluded appearances from corrupting the gallery
-> - `outputTerminatedTracks: 1` + `terminatedTrackFilename` — exports track history for evaluation
-
-#### Multi-Stream Camera Configuration
-
-For multi-stream setups, provide one camera calibration file per stream in the `cameraModelFilepath` list:
-
-```yaml
-ObjectModelProjection:
-  cameraModelFilepath:
-    - configs/camInfo_stream0.yml    # stream 0
-    - configs/camInfo_stream1.yml    # stream 1
-    - configs/camInfo_stream2.yml    # stream 2
-  outputVisibility: 1
-  outputFootLocation: 1
-  outputConvexHull: 1
-  minPoseConfidence: 0.5
-```
-
-Each camera must have its own calibrated projection matrix since cameras have different positions and orientations.
-
-#### SV3DT Output Formats
-
-**MOT Format** (`track_dump_<stream_id>.txt`):
-
-When `outputTerminatedTracks: 1` and `terminatedTrackFilename` are set, terminated tracklets are saved in extended MOT format:
-
-```
-<frame>, <id>, <bb_left>, <bb_top>, <bb_width>, <bb_height>, <conf>, <foot_world_x>, <foot_world_y>, <class_id>, -1, <visibility>, <foot_image_x>, <foot_image_y>, <convex_hull_points...>
-```
-
-| Field | Description |
-|-------|-------------|
-| `frame` | Frame number |
-| `id` | Target tracking ID |
-| `bb_left, bb_top, bb_width, bb_height` | Recovered bounding box (complete, not clipped by occlusion) |
-| `conf` | Detection confidence |
-| `foot_world_x, foot_world_y` | Foot location in 3D world coordinates |
-| `class_id` | Object class ID |
-| `visibility` | Visibility ratio (0.0\~1.0), where 1.0 = fully visible |
-| `foot_image_x, foot_image_y` | Foot location in image coordinates |
-| `convex_hull_points` | Convex hull vertex coordinates from 3D cylinder projection |
-
-**KITTI Format** (`track_results/` directory):
-
-Track results can also be exported in KITTI tracking format for evaluation with standard benchmarks.
-
----
-
-## Tracker Comparisons and Tradeoffs
-
-| Tracker | GPU Usage | Accuracy | Visual Features | Key Advantage | Best Use Case |
-|---------|-----------|----------|-----------------|---------------|---------------|
-| **IOU** | Very Low | Low | No | Lightest weight | Sparse objects, detector every frame |
-| **NvSORT** | Very Low | Medium | No | Kalman + cascaded matching | Medium/high accuracy detectors |
-| **NvDCF** | Medium | High | DCF correlation filter | Robust to occlusion, supports PGIE interval > 0, tracker confidence output | Complex scenes, partial occlusion |
-| **NvDeepSORT** | Low | High | Re-ID network | Discriminative appearance matching | Similar-looking objects, multi-camera |
-| **MaskTracker** | High | Very High | SAM2 segmentation | Precise segmentation masks, works across object classes | Segmentation + tracking, diverse objects |
-| **NvDCF 3D (SV3DT)** | Medium-High | High | DCF + 3D model + optional pose | 3D world tracking, occlusion-aware bbox, foot location | Static camera surveillance, people tracking in physical space |
-
-> **Note**: IOU and NvSORT do not require video frame data (only bounding boxes). NvDCF and NvDeepSORT require NV12 or RGBA frames. MaskTracker requires frames for SAM2 inference.
-
-> **tracker_confidence**: Only NvDCF generates per-object tracker confidence values. For IOU, NvSORT, NvDeepSORT, and MaskTracker, `tracker_confidence` is set to `1.0` by default.
-
----
-
-## Dynamic Runtime Configuration
-
-The tracker supports parameter updates at runtime without restarting the pipeline. Only parameters marked as **Dynamic=Yes** in the tables above are supported.
-
-### REST API
-
-```bash
-curl -XPOST 'http://localhost:9000/api/v1/nvtracker/config-path' -d '{
-  "stream": {
-    "stream_id": "0",
-    "config_path": "trackerUpdate.yaml"
-  }
-}'
-```
-
-### GStreamer Event
-
-Use `gst_nvevent_nvtracker_config_update` to trigger a config update from within the application.
-
-### C++ API
-
-`NvMOT_UpdateParams(contextHandle, configStr)` accepts a YAML config string directly (no file on disk required).
-
-### Control Section (Dynamic Only)
-
-```yaml
-Control:
-  tracker-reset: 1  # Soft reset: removes all tracks and track history
-```
-
-> **Note**: Reconfiguring any stream in a batch re-configures all streams in that batch/sub-batch.
-
----
-
-## Pipeline Integration
-
-### Basic Usage
-
-```python
-from pyservicemaker import Pipeline
-import platform
-
-def tracking_pipeline(video_path, infer_config):
-    pipeline = Pipeline("tracking-pipeline")
-
-    # Source and decoding
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Inference
-    pipeline.add("nvinfer", "pgie", {"config-file-path": infer_config})
-
-    # Tracker
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml",
-        "tracker-width": 640,
-        "tracker-height": 384
-    })
-
-    # Display
-    pipeline.add("nvosdbin", "osd")
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink")
-
-    # Link
-    pipeline.link("src", "parser", "decoder")
-    pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "pgie", "tracker", "osd", "sink")
-
-    pipeline.start().wait()
-```
-
-### SV3DT (Single-View 3D Tracking) with PeopleNet
-
-SV3DT reuses the `tracking_pipeline` structure above -- only the PGIE config and the `nvtracker` properties change. Splice these settings into that pipeline (do **not** call this snippet on its own; it assumes `pipeline`, `MUX_WIDTH`, and `MUX_HEIGHT` from the surrounding `tracking_pipeline` definition):
-
-```python
-# Call as: tracking_pipeline(video_path, "config_pgie_peoplenet.yml")
-# Then override the tracker block with the 3D config below.
-
-# --- nvtracker overrides for SV3DT ---
-# Replace the "tracker" element added in tracking_pipeline with:
-pipeline.add("nvtracker", "tracker", {
-    # 3D tracker library + config (from deepstream_reference_apps/deepstream-tracker-3d)
-    "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-    "ll-config-file": "config_tracker_NvDCF_accuracy_3D.yml",  # references camInfo.yml
-
-    # SV3DT requires tracker dimensions to match the muxer / camera calibration,
-    # not the inference input -- otherwise the 3D cylinder projection is wrong.
-    "tracker-width": MUX_WIDTH,    # e.g. 1920
-    "tracker-height": MUX_HEIGHT,  # e.g. 1080
-
-    "gpu-id": 0,
-    "display-tracking-id": 1,
-})
-```
-
-**Key deltas vs. the basic `tracking_pipeline`:**
-
-| Property | Basic NvDCF | SV3DT |
-|----------|-------------|-------|
-| `ll-config-file` | `config_tracker_NvDCF_perf.yml` | `config_tracker_NvDCF_accuracy_3D.yml` (+ `camInfo.yml`) |
-| `tracker-width` / `tracker-height` | Match inference (e.g. 640x384) | **Must match muxer/calibration** (e.g. 1920x1080) |
-| PGIE | Any detector | PeopleNet (SV3DT models humans) |
-
-### Accessing Tracking Data
-
-```python
-from pyservicemaker import BatchMetadataOperator
-
-class TrackingAnalyzer(BatchMetadataOperator):
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            print(f"Frame {frame_meta.frame_number}:")
-
-            for obj_meta in frame_meta.object_items:
-                print(f"  Object: class={obj_meta.class_id}, "
-                      f"object_id={obj_meta.object_id}, "
-                      f"confidence={obj_meta.confidence:.2f}, "
-                      f"tracker_confidence={obj_meta.tracker_confidence:.2f}")
-```
-
----
-
-## Performance Tuning
-
-### Tracker Dimensions
-
-Match tracker dimensions to inference input for best performance:
-
-```python
-# If inference uses 960x544, match tracker
-pipeline.add("nvtracker", "tracker", {
-    "tracker-width": 960,
-    "tracker-height": 544,
-    # ...
-})
-```
-
-### Track Lifecycle Parameters
-
-| Scene Type | maxShadowTrackingAge | probationAge | earlyTerminationAge |
-|------------|---------------------|--------------|---------------------|
-| Simple | 15 | 2 | 1 |
-| Moderate | 30 | 3 | 1 |
-| Complex/Occlusion | 60 | 5 | 2 |
-
-### Memory Pre-allocation
-
-Total GPU memory is proportional to: `(number of streams) x maxTargetsPerStream`. The library pre-allocates all memory during init -- no growth during runtime.
-
-### Accuracy Tuning
-
-DeepStream 7.0+ includes **PipeTuner** for automatic accuracy tuning. It explores the parameter space and finds optimal parameters for metrics like HOTA, MOTA, and IDF1.
-
----
-
-## Miscellaneous Data Output
-
-The tracker can output additional data via `NvDsTargetMiscDataBatch` (controlled by `user-meta-pool-size`):
-
-| Data Type | Enable Config | Description |
-|-----------|---------------|-------------|
-| **Past-frame data** | `enablePastFrame: 1` | Tracked data from Tentative period, reported after activation |
-| **Terminated tracks** | `outputTerminatedTracks: 1` | Full trajectory history for terminated targets |
-| **Shadow tracks** | `outputShadowTracks: 1` | Shadow tracking target data (not otherwise visible) |
-
----
-
-## Sample Configuration Files
-
-```
-/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/
-|-- config_tracker_IOU.yml                # Fast IOU tracker (GREEDY)
-|-- config_tracker_NvSORT.yml             # NvSORT (CASCADED + Regular KF)
-|-- config_tracker_NvDCF_max_perf.yml     # NvDCF maximum performance
-|-- config_tracker_NvDCF_perf.yml         # NvDCF balanced performance
-|-- config_tracker_NvDCF_accuracy.yml     # NvDCF highest accuracy (Re-Assoc + ReID)
-|-- config_tracker_NvDeepSORT.yml         # NvDeepSORT with ReID
-|-- config_tracker_MaskTracker.yml        # MaskTracker with SAM2
-|-- config_tracker_module_Segmenter.yml   # Segmenter module config for MaskTracker
-
-# SV3DT 3D Tracker config (from deepstream_reference_apps):
-# https://github.com/NVIDIA-AI-IOT/deepstream_reference_apps/tree/master/deepstream-tracker-3d
-|-- config_tracker_NvDCF_accuracy_3D.yml   # NvDCF 3D tracking (SV3DT)
-|-- camInfo.yml                            # Camera calibration for SV3DT
-```
-
----
-
-## Common Issues
-
-### Issue 1: Tracking IDs Not Appearing
-
-**Cause**: OSD not configured to display tracking IDs.
-
-**Solution**:
-```python
-pipeline.add("nvtracker", "tracker", {
-    "display-tracking-id": 1,
-})
-```
-
-### Issue 2: Frequent ID Switches
-
-**Cause**: Low matching thresholds or short shadow tracking age.
-
-**Solutions**:
-- Increase `maxShadowTrackingAge` in tracker config
-- Increase `minMatchingScore4Iou` and similarity weights
-- Switch from GREEDY to CASCADED matching (`associationMatcherType: 1`)
-- Consider using NvDCF or NvDeepSORT for visual/ReID-based matching
-
-### Issue 3: Too Many Simultaneous Tracks
-
-**Solution**: Reduce `maxTargetsPerStream` and/or increase `minDetectorConfidence` in BaseConfig.
-
-### Issue 4: "Unable to acquire a user meta buffer"
-
-**Cause**: Buffer pool exhausted when downstream is slow to release.
-
-**Solution**: Increase `user-meta-pool-size` from default 16 to 64 or higher.
-
-### Issue 5: Failed to Open Low-Level Lib
-
-**Cause**: Missing `libmosquitto1` dependency.
-
-**Solution**: `sudo apt-get install -y libmosquitto1`
-
-### Issue 6: NvDCF Performance Bottleneck on Jetson
-
-**Solution**: Use PVA backend to offload DCF operations from GPU:
-```yaml
-VisualTracker:
-  visualTrackerType: 2
-  vpiBackend4DcfTracker: 2  # PVA backend
-```
-
----
-
-## Related Documentation
-
-- **GStreamer Plugins Overview**: `gstreamer_plugins.md`
-- **Service Maker Python API**: `service_maker_api.md`
-- **nvinfer Configuration**: `nvinfer_config.md`
-- **Use Cases & Pipelines**: `use_cases_pipelines.md`
-- **Official Docs**: https://docs.nvidia.com/metropolis/deepstream/dev-guide/text/DS_plugin_gst-nvtracker.html
diff --git a/skills/deepstream/deepstream-dev/references/troubleshooting.md b/skills/deepstream/deepstream-dev/references/troubleshooting.md
deleted file mode 100644
index 4728c3e2..00000000
--- a/skills/deepstream/deepstream-dev/references/troubleshooting.md
+++ /dev/null
@@ -1,966 +0,0 @@
-# DeepStream Common Errors and Troubleshooting Guide
-
-## Overview
-
-This document provides a quick reference for common errors encountered when developing DeepStream applications, along with their causes and solutions.
-
----
-
-## Python API Errors
-
-### Error: `RuntimeError: Probe failure` when attaching `measure_fps_probe`
-
-**Symptom**: Pipeline crashes with `RuntimeError: Probe failure` and message `unable to add probe fps-probe`.
-
-**Cause**: The built-in `measure_fps_probe` cannot be attached to sink elements (`nveglglessink`, `nv3dsink`, `filesink`). It can only be attached to processing elements that have both sink and src pads.
-
-**Wrong Code**:
-```python
-pipeline.attach("sink", "measure_fps_probe", "fps-probe")  # ❌ CRASH - sink has no src pad
-```
-
-**Solution**:
-```python
-# Attach to a processing element instead
-pipeline.attach("pgie", "measure_fps_probe", "fps-probe")   # ✅ Works
-pipeline.attach("osd", "measure_fps_probe", "fps-probe")     # ✅ Works
-```
-
----
-
-### Error: `TypeError: object of type 'iterator' has no len()`
-
-**Symptom**: Crash when trying to get length of metadata items.
-
-**Cause**: `frame_meta.object_items`, `frame_meta.tensor_items`, and `frame_meta.user_items` return **iterators**, not lists.
-
-**Wrong Code**:
-```python
-count = len(frame_meta.object_items)  # ❌ CRASH
-```
-
-**Solution**:
-```python
-# Count by iterating
-obj_count = 0
-for obj in frame_meta.object_items:
-    obj_count += 1
-    process(obj)
-
-# Or convert to list first (if needed)
-objects = list(frame_meta.object_items)
-count = len(objects)
-```
-
----
-
-### Error: `pad template "sink_X" not found`
-
-**Symptom**: Pipeline fails to link elements with error about missing pad.
-
-**Cause**: Using literal pad names like `"sink_0"` instead of pad template `"sink_%u"`.
-
-**Wrong Code**:
-```python
-pipeline.link((f"decoder{i}", "mux"), ("", f"sink_{i}"))  # ❌ FAILS
-pipeline.link((f"decoder{i}", "mux"), ("", "sink_0"))     # ❌ FAILS
-```
-
-**Solution**:
-```python
-# Use pad template - GStreamer auto-assigns sink_0, sink_1, etc.
-pipeline.link((f"decoder{i}", "mux"), ("", "sink_%u"))  # ✅ CORRECT
-```
-
----
-
-### Error: Data not reaching downstream (Queue appears empty)
-
-**Symptom**: 
-- Pipeline runs without errors
-- No data reaches Kafka, VLM, or other downstream processing
-- Statistics show 0 batches/messages processed
-
-**Cause**: Using `queue.Queue` with `multiprocessing.Process`.
-
-**Wrong Code**:
-```python
-from multiprocessing import Process
-from queue import Queue  # ❌ Wrong queue type
-
-class Processor:
-    def __init__(self):
-        self.batch_queue = Queue()  # Won't work across processes!
-    
-    def start(self):
-        process = Process(target=self._run, args=(self.batch_queue,))
-        process.start()  # Data put in child process never reaches parent
-```
-
-**Solution**:
-```python
-# Option 1: Use multiprocessing.Queue for processes
-from multiprocessing import Process, Queue as MPQueue
-
-class Processor:
-    def __init__(self):
-        self.batch_queue = MPQueue()  # ✅ Works across processes
-
-# Option 2: Use threading instead
-import threading
-from queue import Queue
-
-class Processor:
-    def __init__(self):
-        self.batch_queue = Queue()  # ✅ OK for threads
-    
-    def start(self):
-        thread = threading.Thread(target=self._run, args=(self.batch_queue,))
-        thread.start()  # Works because threads share memory
-```
-
----
-
-### Error: `ModuleNotFoundError: No module named 'pyservicemaker'` inside virtual environment
-
-**Symptom**: Application crashes on import when run inside a Python virtual environment:
-```
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator
-ModuleNotFoundError: No module named 'pyservicemaker'
-```
-
-**Cause**: `pyservicemaker` is installed system-wide but a standard `python3 -m venv` does **not** inherit system packages. Any DeepStream app run inside such a venv cannot find `pyservicemaker`.
-
-**Solution**: Install `pyservicemaker` (and its `pyyaml` dependency) inside the virtual environment:
-```bash
-source venv/bin/activate
-pip install /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl pyyaml
-```
-
-> **Note for generated READMEs**: When generating setup instructions that create a virtual environment, always include the `pyservicemaker` install step in the venv setup so users don't hit this error.
-
----
-
-## Configuration Errors
-
-### Error: `Configuration file parsing failed`
-
-**Symptom**: nvinfer fails to load configuration file.
-
-**Common Causes**:
-
-1. **Wrong section name in YAML**:
-```yaml
-# ❌ WRONG
-model:
-  onnx-file: /path/to/model.onnx
-
-# ✅ CORRECT
-property:
-  onnx-file: /path/to/model.onnx
-```
-
-2. **Mixing YAML/INI syntax**:
-```yaml
-# ❌ WRONG (INI syntax in .yml file)
-[property]
-onnx-file=/path/to/model.onnx
-
-# ✅ CORRECT (YAML syntax)
-property:
-  onnx-file: /path/to/model.onnx
-```
-
-3. **Missing indentation in YAML**:
-```yaml
-# ❌ WRONG
-property:
-gpu-id: 0
-
-# ✅ CORRECT
-property:
-  gpu-id: 0
-```
-
----
-
-### Error: `Model file not found`
-
-**Symptom**: nvinfer cannot find model file.
-
-**Solution**: Verify paths exist and use absolute paths:
-```python
-import os
-
-# Verify path exists
-model_path = "/opt/nvidia/deepstream/deepstream/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx"
-if not os.path.exists(model_path):
-    print(f"Model not found: {model_path}")
-```
-
-**DeepStream 9.0 Model Locations**:
-```
-/opt/nvidia/deepstream/deepstream/samples/models/
-├── Primary_Detector/
-│   └── resnet18_trafficcamnet_pruned.onnx
-├── Secondary_VehicleMake/
-│   └── resnet18_vehiclemakenet_pruned.onnx
-└── Secondary_VehicleTypes/
-    └── resnet18_vehicletypenet_pruned.onnx
-```
-
----
-
-### Error: `num-detected-classes mismatch`
-
-**Symptom**: Incorrect detection results or crashes.
-
-**Cause**: `num-detected-classes` doesn't match model output.
-
-**Solution**: Check your model's output and set correctly:
-```yaml
-property:
-  num-detected-classes: 4  # Must match model
-  labelfile-path: /path/to/labels.txt  # Should have 4 lines
-```
-
----
-
-## Pipeline Errors
-
-### Error: `Element could not be created`
-
-**Symptom**: Pipeline fails to create GStreamer element.
-
-**Common Causes**:
-
-1. **Missing plugin**: Element not installed
-```bash
-# Check if element exists
-gst-inspect-1.0 nvinfer
-```
-
-2. **Wrong element name**:
-```python
-# ❌ Wrong
-pipeline.add("nvv4ldecoder", "decoder")  # Typo
-
-# ✅ Correct
-pipeline.add("nvv4l2decoder", "decoder")
-```
-
-3. **Missing DeepStream libraries**:
-```bash
-# Set library path
-export LD_LIBRARY_PATH=/opt/nvidia/deepstream/deepstream/lib:$LD_LIBRARY_PATH
-```
-
----
-
-### Error: `Failed to open low-level lib` (Tracker)
-
-**Symptom**: Tracker fails to initialize with error:
-```
-gstnvtracker: Failed to open low-level lib at /opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so
-dlopen error: libmosquitto.so.1: cannot open shared object file: No such file or directory
-gstnvtracker: Failed to initialize low level lib.
-```
-
-**Cause**: The tracker library requires `libmosquitto` (MQTT client library) as a dependency.
-
-**Solution**: Install the mosquitto library:
-```bash
-# Ubuntu/Debian
-sudo apt-get update
-sudo apt-get install -y libmosquitto1
-
-# RHEL/CentOS
-sudo yum install mosquitto
-```
-
-> **Important**: `libmosquitto1` is the client *library* only. If you also need to run an MQTT broker locally (e.g., `mosquitto &`) or use CLI tools like `mosquitto_sub` / `mosquitto_pub` for testing, you must install **separate** packages:
-> ```bash
-> sudo apt-get install -y mosquitto           # broker daemon
-> sudo apt-get install -y mosquitto-clients   # CLI tools (mosquitto_pub, mosquitto_sub)
-> ```
-
----
-
-### Error: `Command 'mosquitto' not found`
-
-**Symptom**: Running `mosquitto &` to start a local MQTT broker fails:
-```
-Command 'mosquitto' not found, but can be installed with:
-apt install mosquitto
-```
-
-**Cause**: The `mosquitto` broker package is separate from `libmosquitto1` (client library). Installing `libmosquitto1` does NOT install the broker.
-
-**Solution**:
-```bash
-sudo apt-get install -y mosquitto mosquitto-clients
-```
-
----
-
-### Error: `Linking failed between elements`
-
-**Symptom**: Elements cannot be linked.
-
-**Common Causes**:
-
-1. **Incompatible caps**: Format mismatch between elements
-```python
-# Add videoconvert if formats don't match
-pipeline.add("nvvideoconvert", "convert")
-pipeline.link("element1", "convert", "element2")
-```
-
-2. **Wrong pad names**:
-```python
-# ❌ Wrong
-pipeline.link(("src", "mux"), ("video", "sink"))
-
-# ✅ Correct - check actual pad names
-pipeline.link(("src", "mux"), ("", "sink_%u"))
-```
-
----
-
-### Error: `Pipeline stalled` or `No frames received`
-
-**Symptom**: Pipeline starts but no output appears.
-
-**Common Causes**:
-
-1. **Missing queue elements**:
-```python
-# Add queues after tee
-pipeline.add("tee", "tee")
-pipeline.add("queue", "queue1")
-pipeline.add("queue", "queue2")
-pipeline.link(("tee", "queue1"), ("src_%u", ""))
-pipeline.link(("tee", "queue2"), ("src_%u", ""))
-```
-
-2. **Sync issues with live sources**:
-```python
-# Disable sync for live streams
-pipeline.add("nveglglessink", "sink", {"sync": 0})
-
-# Set live-source on muxer
-pipeline.add("nvstreammux", "mux", {"live-source": 1})
-```
-
-3. **appsink not emitting signals**:
-```python
-# Enable signal emission
-pipeline.add("appsink", "sink", {"emit-signals": True, "sync": False})
-```
-
----
-
-### Error: `Resource busy` or `Device not found`
-
-**Symptom**: GPU or video device unavailable.
-
-**Solutions**:
-
-1. **Check GPU availability**:
-```bash
-nvidia-smi
-```
-
-2. **Verify correct GPU ID**:
-```yaml
-property:
-  gpu-id: 0  # Use correct GPU ID
-```
-
-3. **Check decoder device**:
-```bash
-ls /dev/nvidia*
-```
-
----
-
-## Memory Errors
-
-### Error: `CUDA out of memory`
-
-**Symptom**: Application crashes with memory error.
-
-**Solutions**:
-
-1. **Reduce batch size**:
-```python
-pipeline.add("nvstreammux", "mux", {"batch-size": 2})  # Reduce from 8
-```
-
-2. **Reduce resolution**:
-```python
-pipeline.add("nvstreammux", "mux", {
-    "batch-size": 4,
-    "width": 1280,   # Reduce from 1920
-    "height": 720    # Reduce from 1080
-})
-```
-
-3. **Use FP16 instead of FP32**:
-```yaml
-property:
-  network-mode: 2  # FP16
-```
-
-4. **Monitor GPU memory**:
-```bash
-watch -n 1 nvidia-smi
-```
-
----
-
-### Error: `Buffer corruption` or `Segmentation fault`
-
-**Symptom**: Random crashes when processing buffers.
-
-**Cause**: Not cloning buffer tensors before async processing.
-
-**Wrong Code**:
-```python
-def consume(self, buffer):
-    tensor = buffer.extract(0)  # ❌ Direct use
-    # Tensor may be reused/freed by pipeline
-```
-
-**Solution**:
-```python
-def consume(self, buffer):
-    tensor = buffer.extract(0).clone()  # ✅ Clone first
-    # Now safe for async processing
-```
-
----
-
-## Inference Errors
-
-### Error: `setDimensions` fails with dynamic ONNX model (negative dimensions)
-
-**Symptom**: TensorRT engine build fails immediately with repeated `setDimensions` errors:
-```
-ERROR: [TRT]: IOptimizationProfile::setDimensions: Error Code 3: API Usage Error
-  (Parameter check failed, condition: std::all_of(dims.d, dims.d + dims.nbDims,
-  [](int32_t x) noexcept { return x >= 0; }))
-ERROR: ../nvdsinfer/nvdsinfer_model_builder.cpp:1263 Explicit config dims is invalid
-ERROR: ../nvdsinfer/nvdsinfer_model_builder.cpp:906 Failed to configure builder options
-ERROR: ../nvdsinfer/nvdsinfer_model_builder.cpp:595 failed to build trt engine.
-```
-
-**Cause**: The ONNX model has **dynamic input shapes** (e.g., exported with `dynamic=True` in Ultralytics, or with dynamic batch/height/width axes). Dynamic dimensions are stored as symbolic names in the ONNX file, which TensorRT reads as `-1`. Without `infer-dims`, nvinfer passes these `-1` values to TensorRT's `setDimensions`, which requires all dimensions to be >= 0.
-
-This is extremely common with models from Ultralytics (YOLO), HuggingFace, and other frameworks that default to dynamic exports.
-
-**Diagnosis** — check if your ONNX model has dynamic dimensions:
-```bash
-python -c "
-import onnx
-m = onnx.load('model.onnx')
-for inp in m.graph.input:
-    dims = []
-    for d in inp.type.tensor_type.shape.dim:
-        dims.append(d.dim_param if d.dim_param else d.dim_value)
-    print(f'{inp.name}: {dims}')
-"
-# If output shows symbolic names like 'batch', 'height', 'width' → dynamic model
-# If output shows integers like [1, 3, 640, 640] → static model (infer-dims not needed)
-```
-
-**Solution**: Add `infer-dims` to the nvinfer config with the concrete C;H;W dimensions:
-
-```yaml
-# YAML format
-property:
-  onnx-file: model.onnx
-  infer-dims: 3;640;640  # C;H;W — concrete dimensions for the dynamic input
-```
-
-```ini
-# INI format
-[property]
-onnx-file=model.onnx
-infer-dims=3;640;640
-```
-
-> **Note**: The batch dimension is handled by `batch-size` — `infer-dims` only specifies C;H;W. Delete any stale `.engine` files after adding `infer-dims` so TensorRT rebuilds the engine with the correct optimization profile.
-
----
-
-### Error: `TensorRT engine build failed` (general)
-
-**Symptom**: First-time model loading takes long then fails.
-
-**Solutions**:
-
-1. **Check for dynamic ONNX dimensions first** (see `setDimensions` error above)
-
-2. **Check ONNX model compatibility**:
-```bash
-# Verify ONNX model
-python -c "import onnx; onnx.checker.check_model('model.onnx')"
-```
-
-3. **Provide pre-built engine file**:
-```yaml
-property:
-  model-engine-file: /path/to/model.engine
-```
-
-4. **Check CUDA/TensorRT versions**:
-```bash
-# Engine must match installed TensorRT version
-nvcc --version
-dpkg -l | grep tensorrt
-```
-
----
-
-### Error: `Output layer not found`
-
-**Symptom**: Custom postprocessing can't find expected output layers.
-
-**Solution**: List actual output layers:
-```python
-def handle_metadata(self, batch_meta):
-    for frame_meta in batch_meta.frame_items:
-        for tensor_meta in frame_meta.tensor_items:
-            layers = tensor_meta.as_tensor_output().get_layers()
-            print(f"Available layers: {list(layers.keys())}")
-            # Use actual layer names
-```
-
----
-
-### Error: `Secondary GIE not processing`
-
-**Symptom**: Secondary inference not running on detected objects.
-
-**Causes and Solutions**:
-
-1. **Wrong process-mode**:
-```yaml
-property:
-  process-mode: 2  # Must be 2 for secondary
-```
-
-2. **Wrong operate-on-gie-id**:
-```yaml
-property:
-  process-mode: 2
-  operate-on-gie-id: 1  # Must match primary GIE unique-id
-```
-
-3. **Wrong operate-on-class-ids**:
-```yaml
-property:
-  process-mode: 2
-  operate-on-gie-id: 1
-  operate-on-class-ids: 0  # Must match class IDs from primary
-```
-
----
-
-## Display Errors
-
-### Error: `Could not open display`
-
-**Symptom**: Rendering fails on headless systems.
-
-**Solution**: Use fakesink for headless operation:
-```python
-# Check if display is available
-import os
-if "DISPLAY" not in os.environ:
-    pipeline.add("fakesink", "sink")
-else:
-    pipeline.add("nveglglessink", "sink")
-```
-
-Or use file output:
-```python
-pipeline.add("nvvideoconvert", "convert")
-pipeline.add("nvv4l2h264enc", "encoder")
-pipeline.add("h264parse", "parser")
-pipeline.add("mp4mux", "mux")
-pipeline.add("filesink", "sink", {"location": "output.mp4"})
-```
-
----
-
-### Error: `Platform not supported`
-
-**Symptom**: Sink element fails on Jetson or x86.
-
-**Solution**: Use platform-specific sink:
-```python
-import platform
-
-if platform.processor() == "aarch64":
-    # Jetson
-    pipeline.add("nv3dsink", "sink")
-else:
-    # x86
-    pipeline.add("nveglglessink", "sink")
-```
-
----
-
-## Kafka/Message Broker Errors
-
-### Error: `unable to open shared library` / `Failed to start` (missing librdkafka)
-
-**Symptom**: Any pipeline using `nvmsgbroker` with the Kafka protocol adapter fails at startup:
-```
-WARN nvmsgbroker gstnvmsgbroker.cpp:404:legacy_gst_nvmsgbroker_start:<msgbroker> error: unable to open shared library
-WARN basesink gstbasesink.c:5906:gst_base_sink_change_state:<msgbroker> error: Failed to start
-Unable to set the pipeline to the playing state.
-```
-
-**Cause**: DeepStream's Kafka protocol adapter (`libnvds_kafka_proto.so`) dynamically links against `librdkafka.so.1`, which is **NOT** bundled with the DeepStream SDK and not installed by default.
-
-**Diagnosis**:
-```bash
-ldd /opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so | grep "not found"
-# Output: librdkafka.so.1 => not found
-```
-
-**Solution**:
-```bash
-sudo apt-get install -y librdkafka-dev
-```
-
-> **Note**: This is different from the "unable to connect to broker library" error below, which is caused by wrong connection string format. This error is about a missing system library.
-
----
-
-### Error: `unable to connect to broker library` / `Failed to start`
-
-**Symptom**: Pipeline fails with error:
-```
-WARN nvmsgbroker: error: unable to connect to broker library
-WARN basesink: error: Failed to start
-Unable to set the pipeline to the playing state.
-```
-
-**Cause**: Wrong connection string format. DeepStream uses **semicolon (`;`)** separator, NOT colon (`:`).
-
-**Wrong Code**:
-```python
-# ❌ WRONG - colon separator
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "conn-str": "localhost:9092",  # Wrong!
-    # ...
-})
-```
-
-**Solution**:
-```python
-# ✅ CORRECT - semicolon separator
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "conn-str": "localhost;9092",  # Correct: use semicolon
-    # ...
-})
-```
-
----
-
-### Error: No messages reaching Kafka (pipeline runs but no output)
-
-**Symptom**: 
-- Pipeline runs without errors
-- Kafka consumer receives no messages
-- No error in logs
-
-**Cause**: `nvmsgconv` requires `NvDsEventMsgMeta` by default (`msg2p-newapi=0`), which is **NOT automatically generated** by inference or tracker plugins. Without either (a) setting `msg2p-newapi: True` or (b) attaching a probe that generates `EventMessageUserMetadata`, nvmsgconv silently produces zero messages.
-
-**Wrong Code**:
-```python
-# ❌ Without msg2p-newapi AND without EventMessageUserMetadata probe,
-# nvmsgconv has no input and produces no messages!
-pipeline.add("nvmsgconv", "msgconv", {
-    "config": msgconv_config,
-    "payload-type": 0
-})
-```
-
-**Solution A** (simple): Set `msg2p-newapi: True` to use the new API that reads directly from `NvDsObjectMeta`:
-```python
-# ✅ CORRECT - msg2p-newapi reads from NvDsObjectMeta directly
-pipeline.add("nvmsgconv", "msgconv", {
-    "config": msgconv_config,
-    "payload-type": 0,
-    "msg2p-newapi": True,  # CRITICAL: Enables direct object metadata reading
-    "frame-interval": 30   # Send message every 30 frames
-})
-```
-
-**Solution B** (legacy): Keep `msg2p-newapi: 0` and attach a probe to generate `EventMessageUserMetadata`:
-```python
-# Option B1: Use built-in probe (simplest)
-pipeline.attach("osd", "add_message_meta_probe", "metadata generator")
-
-# Option B2: Custom EventMessageGenerator (for multi-camera / custom sensor mappings)
-from pyservicemaker import Probe, BatchMetadataOperator
-
-class EventMessageGenerator(BatchMetadataOperator):
-    def __init__(self, sensor_map, labels):
-        super().__init__()
-        self._sensor_map = sensor_map
-        self._labels = labels
-
-    def handle_metadata(self, batch_meta, frame_interval=1):
-        for frame_meta in batch_meta.frame_items:
-            for object_meta in frame_meta.object_items:
-                event_msg = batch_meta.acquire_event_message_meta()
-                if event_msg:
-                    source_id = frame_meta.source_id
-                    sensor_info = self._sensor_map.get(source_id)
-                    sensor_id = sensor_info.sensor_id if sensor_info else "N/A"
-                    uri = sensor_info.uri if sensor_info else "N/A"
-                    event_msg.generate(object_meta, frame_meta, sensor_id, uri, self._labels)
-                    frame_meta.append(event_msg)
-
-# Attach UPSTREAM of nvmsgconv
-pipeline.attach("tracker", Probe("event_msg_gen", EventMessageGenerator(sensor_map, labels)))
-```
-
-**Reference samples**:
-- Built-in probe: `/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/pipeline_api/deepstream_test4_app/deepstream_test4.py`
-- Custom generator: `/opt/nvidia/deepstream/deepstream/service-maker/sources/apps/python/pipeline_api/deepstream_test5_app/deepstream_test5.py`
-
----
-
-### Error: `nvmsgbroker: Failed to send message`
-
-**Symptom**: Messages not reaching Kafka.
-
-**Solutions**:
-
-1. **Check connection string format** (semicolon, not colon):
-```python
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "conn-str": "localhost;9092",  # Use semicolon separator!
-    # ...
-})
-```
-
-2. **Verify Kafka is running**:
-```bash
-# Check Kafka
-kafka-topics.sh --list --bootstrap-server localhost:9092
-```
-
-3. **Check protocol library path**:
-```python
-pipeline.add("nvmsgbroker", "msgbroker", {
-    "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-    # ...
-})
-```
-
----
-
-### Error: `nvmsgbroker cannot have downstream elements`
-
-**Symptom**: Pipeline fails when linking elements after nvmsgbroker.
-
-**Cause**: nvmsgbroker is a **sink** element.
-
-**Wrong Code**:
-```python
-# ❌ Wrong - msgbroker is a sink
-pipeline.link("tracker", "msgconv", "msgbroker", "osd", "sink")
-```
-
-**Solution**: Use tee to split pipeline:
-```python
-# ✅ Correct - use tee to split
-pipeline.add("tee", "tee")
-pipeline.add("queue", "queue_msg")
-pipeline.add("queue", "queue_video")
-
-pipeline.link("tracker", "tee")
-pipeline.link(("tee", "queue_msg"), ("src_%u", ""))
-pipeline.link("queue_msg", "msgconv", "msgbroker")
-pipeline.link(("tee", "queue_video"), ("src_%u", ""))
-pipeline.link("queue_video", "osd", "sink")
-```
-
----
-
-## Debugging Tips
-
-### Enable GStreamer Debug Output
-
-```bash
-# Basic debugging
-export GST_DEBUG=3
-
-# Plugin-specific debugging
-export GST_DEBUG=nvinfer:5,nvstreammux:4
-
-# Write to file
-export GST_DEBUG_FILE=debug.log
-```
-
-### Debug Levels
-
-| Level | Name | Description |
-|-------|------|-------------|
-| 0 | NONE | No output |
-| 1 | ERROR | Errors only |
-| 2 | WARNING | Warnings and errors |
-| 3 | INFO | Informational messages |
-| 4 | DEBUG | Debug messages |
-| 5 | LOG | All log messages |
-
-### Check Plugin Availability
-
-```bash
-# List all DeepStream plugins
-gst-inspect-1.0 | grep nv
-
-# Check specific plugin
-gst-inspect-1.0 nvinfer
-gst-inspect-1.0 nvstreammux
-gst-inspect-1.0 nvtracker
-```
-
-### Pipeline Visualization
-
-```bash
-# Generate pipeline graph
-export GST_DEBUG_DUMP_DOT_DIR=/tmp/dots
-# Run pipeline, then:
-dot -Tpng /tmp/dots/*.dot > pipeline.png
-```
-
----
-
-## Quick Reference: Error → Solution
-
-| Error | Quick Fix |
-|-------|-----------|
-| `iterator has no len()` | Iterate to count, don't use `len()` |
-| `pad template not found` | Use `"sink_%u"` not `"sink_0"` |
-| Queue data loss | Use `multiprocessing.Queue` with `Process` |
-| Config parse failed | Use `property:` not `model:` in YAML |
-| `is-classifier` deprecation warning | Use `network-type: 1` instead of `is-classifier: 1`; omit both for detectors |
-| `min-boxes` unknown key warning | Use `minBoxes` (camelCase), not `min-boxes` |
-| `setDimensions` negative dims / engine build failed | Add `infer-dims=C;H;W` for dynamic ONNX models (e.g., `infer-dims=3;640;640`) |
-| Model not found | Use absolute paths, verify file exists |
-| Element not created | Check plugin name, set `LD_LIBRARY_PATH` |
-| Link failed | Add `nvvideoconvert` for format conversion |
-| Pipeline stalled | Add queues, check sync settings |
-| CUDA OOM | Reduce batch size, use FP16 |
-| Buffer corruption | Clone tensors before async use |
-| Secondary GIE inactive | Set `process-mode: 2`, check `operate-on-gie-id` |
-| No display | Use `fakesink` for headless |
-| Kafka connection failed | Use `localhost;9092` (semicolon, not colon) |
-| Kafka no messages | Set `msg2p-newapi: True`, OR attach `EventMessageUserMetadata` probe (see Kafka section) |
-| msgbroker downstream | Use `tee` to split pipeline |
-| Dynamic source stuck in PAUSED | Set `async: 0` on sink element |
-| No data from RTSP | Test URL with ffplay, check credentials |
-| `No module named 'pyservicemaker'` in venv | `pip install /opt/nvidia/deepstream/deepstream/service-maker/python/pyservicemaker*.whl pyyaml` inside the venv |
-
----
-
-## Dynamic Source Management Errors
-
-### Error: Stream added but stuck in PAUSED state
-
-**Symptom**: REST API returns success, `DynamicSourceMessage` received, but video doesn't display. Elements stay in PAUSED state.
-
-```
-[Pipeline] src -> READY
-[Pipeline] src -> PAUSED
-# Never transitions to PLAYING
-```
-
-**Cause**: Missing `async=0` on sink element. The sink waits for preroll (first buffer) before allowing state transitions, creating a deadlock.
-
-**Solution**:
-```python
-# ✅ CORRECT - async=0 is CRITICAL for dynamic sources
-pipeline.add("nveglglessink", "sink", {
-    "sync": 0,
-    "qos": 0,
-    "async": 0  # This is the fix
-})
-
-# ❌ WRONG - Will cause state transition deadlock
-pipeline.add("nveglglessink", "sink", {"sync": 0})
-```
-
----
-
-### Error: No data from source, reconnection attempts
-
-**Symptom**:
-```
-WARNING from dsnvurisrcbin0: No data from source since last 10 sec. Trying reconnection
-Could not send message. (Received end-of-file)
-```
-
-**Cause**: RTSP connection issue - invalid URL, authentication required, or network problem.
-
-**Solutions**:
-1. Test RTSP URL directly:
-```bash
-ffplay "rtsp://camera-ip/stream"
-```
-
-2. Include credentials in URL:
-```
-rtsp://username:password@camera-ip/stream
-```
-
-3. Try TCP-only mode:
-```python
-"select-rtp-protocol": 4  # TCP only instead of auto
-```
-
----
-
-### Anti-Pattern: Custom REST Server for Stream Management
-
-**❌ WRONG**: Implementing a separate Flask/FastAPI server for stream management.
-
-```python
-# Don't do this - adds complexity and potential bugs
-from flask import Flask
-app = Flask(__name__)
-
-@app.route('/add-camera')
-def add_camera():
-    # Custom implementation
-```
-
-**✅ CORRECT**: Use nvmultiurisrcbin's built-in REST server.
-
-```python
-pipeline.add("nvmultiurisrcbin", "src", {
-    "port": 9000,  # Built-in REST API at http://localhost:9000/api/v1/
-    # ...
-})
-```
-
-See `rest_api_dynamic.md` for complete REST API documentation.
-
----
-
-## Related Documentation
-
-- **GStreamer Plugins Overview**: `gstreamer_plugins.md`
-- **Service Maker Python API**: `service_maker_api.md`
-- **Best Practices**: `best_practices.md`
-- **nvinfer Configuration**: `nvinfer_config.md`
-- **Tracker Configuration**: `tracker_config.md`
diff --git a/skills/deepstream/deepstream-dev/references/use_cases_pipelines.md b/skills/deepstream/deepstream-dev/references/use_cases_pipelines.md
deleted file mode 100644
index ce41d228..00000000
--- a/skills/deepstream/deepstream-dev/references/use_cases_pipelines.md
+++ /dev/null
@@ -1,1079 +0,0 @@
-# Use Cases: Pipeline Construction Patterns
-
-## Overview
-
-This document covers two fundamental DeepStream pipeline construction patterns. **Part 1** explains how to build a simple video player -- reading video from a file or stream, decoding it with hardware acceleration, and displaying it on screen without any AI inference. **Part 2** builds on that foundation to construct multi-inference pipelines that chain primary and secondary inference engines for object detection, classification, and attribute extraction across one or more video streams.
-
----
-
-## Part 1: Simple Video Player
-
-### Use Case Requirements
-
-- Read video from file (H.264/H.265) or network stream (RTSP)
-- Hardware-accelerated video decoding
-- Display video on screen
-- Handle multiple video formats
-- Support for different platforms (x86_64 and ARM64/Jetson)
-
-### Pipeline Architecture
-
-#### Minimal Pipeline
-```
-Source -> Parser -> Decoder -> Converter -> Renderer
-```
-
-#### Detailed Pipeline Elements
-
-1. **Source**: `filesrc` (for files) or `nvurisrcbin` (for URIs)
-2. **Parser**: `h264parse` or `h265parse`
-3. **Decoder**: `nvv4l2decoder` (hardware-accelerated)
-4. **Converter**: `nvvideoconvert` (format conversion if needed)
-5. **Renderer**: `nveglglessink` (x86_64) or `nv3dsink` (Jetson)
-
-### Implementation Approaches
-
-#### Approach 1: Pipeline API (Python)
-
-**Language: Python**
-**Target Audience: Python developers**
-**Recommended for: Python applications**
-
-```python
-from pyservicemaker import Pipeline
-import platform
-import sys
-
-def simple_video_player(video_path):
-    """
-    Simple video player using DeepStream Pipeline API
-
-    Args:
-        video_path: Path to video file or URI (rtsp://, file://, etc.)
-    """
-    pipeline = Pipeline("simple-player")
-
-    # Determine if it's a URI or file path
-    if video_path.startswith(("rtsp://", "http://", "file://")):
-        # Use nvurisrcbin for URI-based sources
-        pipeline.add("nvurisrcbin", "src", {"uri": video_path})
-    else:
-        # Use filesrc for local files
-        pipeline.add("filesrc", "src", {"location": video_path})
-        # Add parser based on file extension or use qtdemux
-        if video_path.endswith(('.h264', '.264')):
-            pipeline.add("h264parse", "parser")
-        elif video_path.endswith(('.h265', '.265', '.hevc')):
-            pipeline.add("h265parse", "parser")
-        else:
-            # For MP4/MOV files, use qtdemux
-            pipeline.add("qtdemux", "demux")
-            pipeline.add("h264parse", "parser")
-
-    # Hardware-accelerated decoder
-    pipeline.add("nvv4l2decoder", "decoder")
-
-    # Video converter (may be needed for format conversion)
-    pipeline.add("nvvideoconvert", "converter", {"gpu-id": 0})
-
-    # Renderer (platform-specific)
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink", {"sync": 1})
-
-    # Link elements
-    if "nvurisrcbin" in [elem.name for elem in pipeline.elements]:
-        # nvurisrcbin handles parsing internally
-        pipeline.link("src", "decoder", "converter", "sink")
-    elif "qtdemux" in [elem.name for elem in pipeline.elements]:
-        # Handle qtdemux video pad
-        pipeline.link("src", "demux")
-        pipeline.link(("demux", "parser"), ("video_%u", ""))
-        pipeline.link("parser", "decoder", "converter", "sink")
-    else:
-        # Simple file with parser
-        pipeline.link("src", "parser", "decoder", "converter", "sink")
-
-    # Start and wait
-    try:
-        pipeline.start().wait()
-    except KeyboardInterrupt:
-        print("\nPlayback interrupted")
-    except Exception as e:
-        print(f"Error: {e}")
-
-if __name__ == "__main__":
-    if len(sys.argv) != 2:
-        print("Usage: python simple_player.py <video_file_or_uri>")
-        sys.exit(1)
-
-    simple_video_player(sys.argv[1])
-```
-
-#### Approach 2: Flow API (Python)
-
-**Language: Python**
-**Target Audience: Python developers**
-**Recommended for: Python applications**
-
-```python
-from pyservicemaker import Pipeline, Flow
-import platform
-import sys
-
-def simple_video_player_flow(video_path):
-    """
-    Simple video player using DeepStream Flow API
-    """
-    pipeline = Pipeline("simple-player-flow")
-    flow = Flow(pipeline)
-
-    # Flow API doesn't directly support simple playback
-    # This is a simplified example - Flow API is better for inference pipelines
-    # For simple playback, use Pipeline API instead
-
-    # However, we can still use Flow API with custom pipeline construction
-    # This requires manual pipeline building
-    pass
-
-if __name__ == "__main__":
-    simple_video_player_flow(sys.argv[1])
-```
-
-#### Approach 3: GStreamer Command Line
-
-```bash
-# For H.264 file
-gst-launch-1.0 filesrc location=/path/to/video.h264 ! \
-    h264parse ! \
-    nvv4l2decoder ! \
-    nvvideoconvert ! \
-    nveglglessink sync=1
-
-# For MP4 file
-gst-launch-1.0 filesrc location=/path/to/video.mp4 ! \
-    qtdemux ! \
-    h264parse ! \
-    nvv4l2decoder ! \
-    nvvideoconvert ! \
-    nveglglessink sync=1
-
-# For RTSP stream
-gst-launch-1.0 nvurisrcbin uri=rtsp://camera-ip/stream ! \
-    nvv4l2decoder ! \
-    nvvideoconvert ! \
-    nveglglessink sync=1
-
-# For Jetson platform
-gst-launch-1.0 filesrc location=/path/to/video.h264 ! \
-    h264parse ! \
-    nvv4l2decoder ! \
-    nvvideoconvert ! \
-    nv3dsink sync=1
-```
-
-#### Approach 4: C/C++ Application
-
-**Note: This section is specifically for C/C++ applications only. For Python applications, use Approach 1 (Pipeline API) or Approach 2 (Flow API) instead.**
-
-This approach demonstrates how to build a simple video player using the GStreamer C API directly. This is a native C/C++ implementation that provides low-level control over the GStreamer pipeline.
-
-**Language: C/C++**
-**Target Audience: C/C++ developers**
-**Not applicable for: Python applications**
-
-```c
-#include <gst/gst.h>
-#include <glib.h>
-
-typedef struct {
-    GstElement *pipeline;
-    GstElement *source;
-    GstElement *parser;
-    GstElement *decoder;
-    GstElement *converter;
-    GstElement *sink;
-} AppData;
-
-int main(int argc, char *argv[]) {
-    GstBus *bus;
-    GstMessage *msg;
-    AppData data;
-
-    // Initialize GStreamer
-    gst_init(&argc, &argv);
-
-    // Create elements
-    data.pipeline = gst_pipeline_new("simple-player");
-    data.source = gst_element_factory_make("filesrc", "source");
-    data.parser = gst_element_factory_make("h264parse", "parser");
-    data.decoder = gst_element_factory_make("nvv4l2decoder", "decoder");
-    data.converter = gst_element_factory_make("nvvideoconvert", "converter");
-
-    // Platform-specific sink
-    #ifdef __aarch64__
-        data.sink = gst_element_factory_make("nv3dsink", "sink");
-    #else
-        data.sink = gst_element_factory_make("nveglglessink", "sink");
-    #endif
-
-    if (!data.pipeline || !data.source || !data.parser ||
-        !data.decoder || !data.converter || !data.sink) {
-        g_printerr("Not all elements could be created.\n");
-        return -1;
-    }
-
-    // Set source location
-    g_object_set(data.source, "location", argv[1], NULL);
-
-    // Set sink sync
-    g_object_set(data.sink, "sync", 1, NULL);
-
-    // Add elements to pipeline
-    gst_bin_add_many(GST_BIN(data.pipeline),
-                      data.source, data.parser, data.decoder,
-                      data.converter, data.sink, NULL);
-
-    // Link elements
-    if (gst_element_link_many(data.source, data.parser, data.decoder,
-                              data.converter, data.sink, NULL) != TRUE) {
-        g_printerr("Elements could not be linked.\n");
-        gst_object_unref(data.pipeline);
-        return -1;
-    }
-
-    // Set pipeline to PLAYING state
-    gst_element_set_state(data.pipeline, GST_STATE_PLAYING);
-
-    // Wait for EOS or error
-    bus = gst_element_get_bus(data.pipeline);
-    msg = gst_bus_timed_pop_filtered(bus, GST_CLOCK_TIME_NONE,
-                                      GST_MESSAGE_ERROR | GST_MESSAGE_EOS);
-
-    // Cleanup
-    if (msg != NULL)
-        gst_message_unref(msg);
-    gst_object_unref(bus);
-    gst_element_set_state(data.pipeline, GST_STATE_NULL);
-    gst_object_unref(data.pipeline);
-
-    return 0;
-}
-```
-
-**End of C/C++ Implementation** - This section contains C/C++ code only. For Python implementations, refer to Approach 1 (Pipeline API) or Approach 2 (Flow API) above.
-
-### Enhanced Video Player Features
-
-#### Feature 1: Multi-Format Support
-
-```python
-from pyservicemaker import Pipeline
-import platform
-import os
-
-def detect_video_format(video_path):
-    """Detect video format from file extension"""
-    ext = os.path.splitext(video_path)[1].lower()
-    formats = {
-        '.h264': 'h264',
-        '.264': 'h264',
-        '.h265': 'h265',
-        '.265': 'h265',
-        '.hevc': 'h265',
-        '.mp4': 'mp4',
-        '.mov': 'mp4',
-        '.mkv': 'mkv'
-    }
-    return formats.get(ext, 'unknown')
-
-def multi_format_player(video_path):
-    """Video player supporting multiple formats"""
-    pipeline = Pipeline("multi-format-player")
-    format_type = detect_video_format(video_path)
-
-    # Source
-    if video_path.startswith(("rtsp://", "http://", "file://")):
-        pipeline.add("nvurisrcbin", "src", {"uri": video_path})
-        # nvurisrcbin handles format detection automatically
-        pipeline.add("nvv4l2decoder", "decoder")
-    else:
-        pipeline.add("filesrc", "src", {"location": video_path})
-
-        if format_type == 'h264':
-            pipeline.add("h264parse", "parser")
-            pipeline.add("nvv4l2decoder", "decoder")
-        elif format_type == 'h265':
-            pipeline.add("h265parse", "parser")
-            pipeline.add("nvv4l2decoder", "decoder")
-        elif format_type in ['mp4', 'mkv']:
-            demux_type = "qtdemux" if format_type == 'mp4' else "matroskademux"
-            pipeline.add(demux_type, "demux")
-            pipeline.add("h264parse", "parser")
-            pipeline.add("nvv4l2decoder", "decoder")
-        else:
-            print(f"Unsupported format: {format_type}")
-            return
-
-    # Converter and sink
-    pipeline.add("nvvideoconvert", "converter")
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink", {"sync": 1})
-
-    # Link based on format
-    if "nvurisrcbin" in [e.name for e in pipeline.elements]:
-        pipeline.link("src", "decoder", "converter", "sink")
-    elif "demux" in [e.name for e in pipeline.elements]:
-        pipeline.link("src", "demux")
-        pipeline.link(("demux", "parser"), ("video_%u", ""))
-        pipeline.link("parser", "decoder", "converter", "sink")
-    else:
-        pipeline.link("src", "parser", "decoder", "converter", "sink")
-
-    pipeline.start().wait()
-```
-
-#### Feature 2: Window Controls
-
-```python
-def video_player_with_controls(video_path):
-    """Video player with window positioning and sizing"""
-    pipeline = Pipeline("controlled-player")
-
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-    pipeline.add("nvvideoconvert", "converter")
-
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink", {
-        "sync": 1,
-        "window-x": 100,      # Window X position
-        "window-y": 100,      # Window Y position
-        "window-width": 1280, # Window width
-        "window-height": 720  # Window height
-    })
-
-    pipeline.link("src", "parser", "decoder", "converter", "sink")
-    pipeline.start().wait()
-```
-
-#### Feature 3: Frame Rate Control
-
-```python
-def video_player_with_framerate(video_path, fps=None):
-    """Video player with frame rate control"""
-    pipeline = Pipeline("framerate-player")
-
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-
-    # Add videorate for frame rate control
-    if fps:
-        pipeline.add("videorate", "rate")
-        pipeline.add("capsfilter", "caps", {
-            "caps": f"video/x-raw(memory:NVMM),framerate={fps}/1"
-        })
-
-    pipeline.add("nvvideoconvert", "converter")
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink", {"sync": 1})
-
-    if fps:
-        pipeline.link("src", "parser", "decoder", "rate", "caps", "converter", "sink")
-    else:
-        pipeline.link("src", "parser", "decoder", "converter", "sink")
-
-    pipeline.start().wait()
-```
-
-### Platform-Specific Considerations
-
-#### x86_64 (Desktop/Server)
-- Use `nveglglessink` for rendering
-- Supports multiple displays
-- Higher GPU memory bandwidth
-- Better for high-resolution playback
-
-#### ARM64 (Jetson)
-- Use `nv3dsink` for rendering
-- Optimized for power efficiency
-- Integrated GPU with shared memory
-- Better for embedded applications
-
-### Performance Optimization Tips
-
-1. **Always use hardware decoders**: `nvv4l2decoder` instead of software decoders
-2. **Provide headroom**: Bump `num-extra-surfaces` to prevent surface starvation
-3. **Use NVMM memory**: Keeps frames on GPU for nvvideoconvert/sinks
-4. **Sync to display**: Set `sync=1` on sink for smooth playback
-5. **Match resolutions**: Avoid unnecessary scaling
-
-### Error Handling
-
-```python
-from multiprocessing import Process
-import sys
-
-def safe_video_player(video_path):
-    """Video player with error handling"""
-    try:
-        pipeline = Pipeline("safe-player")
-        # ... pipeline construction ...
-        pipeline.start().wait()
-    except KeyboardInterrupt:
-        print("\nPlayback interrupted by user")
-    except Exception as e:
-        print(f"Error during playback: {e}")
-        sys.exit(1)
-
-if __name__ == "__main__":
-    process = Process(target=safe_video_player, args=(sys.argv[1],))
-    try:
-        process.start()
-        process.join()
-    except KeyboardInterrupt:
-        print("\nTerminating...")
-        process.terminate()
-        process.join()
-```
-
-### Common Issues and Solutions
-
-#### Issue 1: Black Screen
-**Solution**: Check if decoder is working, verify video format support
-
-#### Issue 2: Stuttering Playback
-**Solution**: check GPU utilization
-
-#### Issue 3: Format Not Supported
-**Solution**: Use `nvurisrcbin` for automatic format detection, or add appropriate parser
-
-#### Issue 4: High CPU Usage
-**Solution**: Ensure hardware decoder is used, not software decoder
-
----
-
-## Part 2: Multi-Inference Pipelines
-
-### Use Case Requirements
-
-- Detect objects using primary inference engine
-- Classify detected objects using secondary inference engines
-- Extract multiple attributes (e.g., vehicle make, vehicle type, color)
-- Process multiple video streams simultaneously
-- Track objects across frames
-- Visualize all inference results
-
-### Pipeline Architecture
-
-#### Cascaded Inference Pipeline
-```
-Source -> Decoder -> Muxer -> PGIE -> SGIE1 -> SGIE2 -> Tracker -> OSD -> Renderer
-```
-
-#### Parallel Inference Pipeline (Advanced)
-```
-Source -> Decoder -> Muxer -> PGIE -> [SGIE1, SGIE2] -> Merger -> Tracker -> OSD -> Renderer
-```
-
-### Implementation Approaches
-
-#### Approach 1: Cascaded Detection + Classification
-
-This is the most common pattern: detect objects first, then classify each detected object.
-
-##### Pipeline API Implementation
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator, osd
-import platform
-import sys
-
-def cascaded_inference_pipeline(video_path, pgie_config, sgie1_config, sgie2_config=None):
-    """
-    Cascaded inference: Detection -> Classification -> Attribute Detection
-
-    Args:
-        video_path: Path to video file
-        pgie_config: Primary GIE config (object detection)
-        sgie1_config: Secondary GIE config (first classification)
-        sgie2_config: Optional second secondary GIE config
-    """
-    pipeline = Pipeline("cascaded-inference")
-
-    # Source and decoding
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-
-    # Stream muxer (batch multiple streams if needed)
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": 1,
-        "width": 1920,
-        "height": 1080
-    })
-
-    # Primary Inference Engine (Object Detection)
-    pipeline.add("nvinfer", "pgie", {
-        "config-file-path": pgie_config,
-        "unique-id": 1
-    })
-
-    # Secondary Inference Engine 1 (Classification)
-    pipeline.add("nvinfer", "sgie1", {
-        "config-file-path": sgie1_config,
-        "unique-id": 2
-    })
-
-    # Secondary Inference Engine 2 (Optional - Additional Classification)
-    if sgie2_config:
-        pipeline.add("nvinfer", "sgie2", {
-            "config-file-path": sgie2_config,
-            "unique-id": 3
-        })
-
-    # Tracker
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml",
-        "tracker-width": 640,
-        "tracker-height": 384
-    })
-
-    # On-Screen Display
-    pipeline.add("nvosdbin", "osd", {
-        "gpu-id": 0
-    })
-
-    # Converter and Sink
-    pipeline.add("nvvideoconvert", "nvvideoconvert", {"gpu-id": 0})
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink", {"sync": 1})
-
-    # Link pipeline
-    pipeline.link("src", "parser", "decoder")
-    pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-
-    # Link inference chain
-    if sgie2_config:
-        pipeline.link("mux", "pgie", "sgie1", "sgie2", "tracker", "s", "nvvideoconvert", "sink")
-    else:
-        pipeline.link("mux", "pgie", "sgie1", "tracker", "s", "nvvideoconvert", "sink")
-
-    # Start pipeline
-    pipeline.start().wait()
-
-if __name__ == "__main__":
-    if len(sys.argv) < 4:
-        print("Usage: python cascaded_inference.py <video> <pgie_config> <sgie1_config> [sgie2_config]")
-        sys.exit(1)
-
-    sgie2 = sys.argv[4] if len(sys.argv) > 4 else None
-    cascaded_inference_pipeline(sys.argv[1], sys.argv[2], sys.argv[3], sgie2)
-```
-
-##### Configuration Files
-
-**Primary GIE Config (pgie_config.yml)**:
-```yaml
-property:
-  model-engine-file: /path/to/detector.engine
-  labelfile-path: /path/to/detector_labels.txt
-  batch-size: 1
-  net-scale-factor: 0.0039215697906911373
-  model-color-format: 0
-  num-detected-classes: 4
-  process-mode: 1
-  gie-unique-id: 1
-  network-mode: 0
-  cluster-mode: 2
-
-class-attrs-all:
-  topk: 20
-  nms-iou-threshold: 0.5
-  pre-cluster-threshold: 0.2
-```
-
-**Secondary GIE Config (sgie1_config.yml)**:
-```yaml
-property:
-  model-engine-file: /path/to/classifier.engine
-  labelfile-path: /path/to/classifier_labels.txt
-  batch-size: 16
-  net-scale-factor: 0.0039215697906911373
-  model-color-format: 0
-  process-mode: 2
-  network-mode: 0
-  network-type: 1
-  gie-unique-id: 2
-  operate-on-gie-id: 1
-  operate-on-class-ids: 0
-  classifier-async-mode: 1
-  classifier-threshold: 0.51
-```
-
-#### Approach 2: Multi-Stream with Cascaded Inference
-
-Process multiple video streams with cascaded inference on each stream.
-
-```python
-def multi_stream_cascaded_inference(video_paths, pgie_config, sgie_configs):
-    """
-    Multi-stream cascaded inference
-
-    Args:
-        video_paths: List of video file paths
-        pgie_config: Primary GIE config
-        sgie_configs: List of secondary GIE configs
-    """
-    pipeline = Pipeline("multi-stream-cascaded")
-    num_streams = len(video_paths)
-
-    # Add sources
-    for i, video_path in enumerate(video_paths):
-        pipeline.add("filesrc", f"src{i}", {"location": video_path})
-        pipeline.add("h264parse", f"parser{i}")
-        pipeline.add("nvv4l2decoder", f"decoder{i}")
-
-    # Stream muxer
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": num_streams,
-        "width": 1920,
-        "height": 1080
-    })
-
-    # Primary Inference
-    pipeline.add("nvinfer", "pgie", {
-        "config-file-path": pgie_config,
-        "unique-id": 1
-    })
-
-    # Secondary Inferences
-    for idx, sgie_config in enumerate(sgie_configs):
-        pipeline.add("nvinfer", f"sgie{idx+1}", {
-            "config-file-path": sgie_config,
-            "unique-id": idx + 2
-        })
-
-    # Tracker
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml"
-    })
-
-    # Stream demuxer
-    pipeline.add("nvstreamdemux", "demux")
-
-    # OSD and sinks for each stream
-    for i in range(num_streams):
-        pipeline.add("nvosdbin", f"osd{i}")
-        pipeline.add("nvvideoconvert", f"converter{i}")
-        pipeline.add("nveglglessink", f"sink{i}", {"sync": 1})
-
-    # Link sources to muxer
-    # CRITICAL: Always use "sink_%u" pad template for nvstreammux, NOT f"sink_{i}" or "sink_0"
-    for i in range(num_streams):
-        pipeline.link(f"src{i}", f"parser{i}", f"decoder{i}")
-        pipeline.link((f"decoder{i}", "mux"), ("", "sink_%u"))  # Pad template auto-assigns sink_0, sink_1, etc.
-
-    # Link inference chain
-    link_chain = ["mux", "pgie"]
-    for idx in range(len(sgie_configs)):
-        link_chain.append(f"sgie{idx+1}")
-    link_chain.extend(["tracker", "demux"])
-    pipeline.link(*link_chain)
-
-    # Link demuxer outputs to sinks
-    for i in range(num_streams):
-        pipeline.link((f"demux", f"osd{i}"), (f"src_{i}", ""))
-        pipeline.link(f"osd{i}", f"converter{i}", f"sink{i}")
-
-    pipeline.start().wait()
-```
-
-#### Approach 2b: Multi-Stream RTSP with nvurisrcbin and Cascaded Inference
-
-Process multiple RTSP streams using nvurisrcbin with cascaded inference.
-
-```python
-def multi_rtsp_cascaded_inference(rtsp_urls, pgie_config, sgie_configs):
-    """
-    Multi-stream RTSP cascaded inference using nvurisrcbin
-
-    Args:
-        rtsp_urls: List of RTSP stream URLs
-        pgie_config: Primary GIE config
-        sgie_configs: List of secondary GIE configs
-    """
-    pipeline = Pipeline("multi-rtsp-cascaded")
-    num_streams = len(rtsp_urls)
-
-    # Add RTSP sources with nvurisrcbin (handles codec detection and decoding automatically)
-    for i, url in enumerate(rtsp_urls):
-        pipeline.add("nvurisrcbin", f"src{i}", {"uri": url})
-
-    # Stream muxer
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": num_streams,
-        "width": 1920,
-        "height": 1080,
-        "batched-push-timeout": 40000,
-        "live-source": 1  # Important for RTSP streams
-    })
-
-    # Primary Inference
-    pipeline.add("nvinfer", "pgie", {
-        "config-file-path": pgie_config,
-        "unique-id": 1,
-        "batch-size": num_streams
-    })
-
-    # Secondary Inferences
-    for idx, sgie_config in enumerate(sgie_configs):
-        pipeline.add("nvinfer", f"sgie{idx+1}", {
-            "config-file-path": sgie_config,
-            "unique-id": idx + 2,
-            "batch-size": num_streams
-        })
-
-    # Tracker
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml"
-    })
-
-    # Tiler for multi-stream display
-    pipeline.add("nvmultistreamtiler", "tiler", {
-        "rows": 2,
-        "columns": 2,
-        "width": 1920,
-        "height": 1080
-    })
-
-    # OSD and sink
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nveglglessink", "sink", {"sync": 0})
-
-    # Link sources to muxer - CRITICAL: Use "sink_%u" pad template
-    # nvurisrcbin creates dynamic src pads, so link directly to mux sink pad template
-    for i in range(num_streams):
-        pipeline.link((f"src{i}", "mux"), ("", "sink_%u"))  # CORRECT - pad template auto-assigns
-        # WRONG: pipeline.link((f"src{i}", "mux"), ("", f"sink_{i}"))  # This will FAIL!
-
-    # Link inference chain
-    link_chain = ["mux", "pgie"]
-    for idx in range(len(sgie_configs)):
-        link_chain.append(f"sgie{idx+1}")
-    link_chain.extend(["tracker", "tiler", "osd", "sink"])
-    pipeline.link(*link_chain)
-
-    pipeline.start().wait()
-```
-
-#### Approach 3: Custom Postprocessing with Tensor Metadata
-
-Use custom postprocessing when built-in parsers don't support your model format.
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator, postprocessing, osd
-import torch  # pip install torch torchvision (not in base DS container)
-import torchvision.ops as ops
-
-class CustomDetectorConverter(postprocessing.ObjectDetectorOutputConverter):
-    """Custom converter for detection model outputs"""
-    NETWORK_WIDTH = 960
-    NETWORK_HEIGHT = 544
-
-    def __init__(self, threshold=0.5):
-        self.threshold = threshold
-
-    def __call__(self, output_layers):
-        """Convert tensor outputs to detection format"""
-        outputs = []
-
-        # Extract output layers (adjust names based on your model)
-        bbox_layer = output_layers.get('output_bbox/BiasAdd:0')
-        conf_layer = output_layers.get('output_cov/Sigmoid:0')
-
-        if bbox_layer is None or conf_layer is None:
-            return outputs
-
-        # Convert DLPack tensors to PyTorch
-        bbox_tensor = torch.utils.dlpack.from_dlpack(bbox_layer).to('cpu')
-        conf_tensor = torch.utils.dlpack.from_dlpack(conf_layer).to('cpu')
-
-        # Process detections
-        # ... custom processing logic ...
-
-        return outputs
-
-class CustomPostprocessor(BatchMetadataOperator):
-    """Custom postprocessor for tensor outputs"""
-    def __init__(self, converter):
-        super().__init__()
-        self.converter = converter
-        self.stream_width = 1920
-        self.stream_height = 1080
-
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # Process tensor metadata
-            for tensor_meta in frame_meta.tensor_items:
-                output_layers = tensor_meta.as_tensor_output().get_layers()
-                detections = self.converter(output_layers)
-
-                # Scale coordinates
-                scale_x = self.stream_width / self.converter.NETWORK_WIDTH
-                scale_y = self.stream_height / self.converter.NETWORK_HEIGHT
-
-                # Create object metadata
-                for det in detections:
-                    class_id, conf, x1, y1, x2, y2 = det
-
-                    obj_meta = batch_meta.acquire_object_meta()
-                    obj_meta.class_id = int(class_id)
-                    obj_meta.confidence = float(conf)
-                    obj_meta.rect_params.left = x1 * scale_x
-                    obj_meta.rect_params.top = y1 * scale_y
-                    obj_meta.rect_params.width = (x2 - x1) * scale_x
-                    obj_meta.rect_params.height = (y2 - y1) * scale_y
-                    obj_meta.rect_params.border_width = 2
-                    obj_meta.rect_params.border_color = osd.Color(1.0, 0.0, 0.0, 1.0)
-
-                    frame_meta.append(obj_meta)
-
-def custom_postprocessing_pipeline(video_path, infer_config):
-    """Pipeline with custom postprocessing"""
-    pipeline = Pipeline("custom-postprocess")
-
-    # Source and decoding
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Inference with tensor output
-    pipeline.add("nvinfer", "infer", {
-        "config-file-path": infer_config,
-        "output-tensor-meta": 1  # Enable tensor metadata output
-    })
-
-    # Disable built-in object metadata generation
-    pipeline["infer"].set({"filter-out-class-ids": "0;1;2;3"})
-
-    # Custom postprocessing
-    converter = CustomDetectorConverter(threshold=0.5)
-    postprocessor = CustomPostprocessor(converter)
-
-    # Tracker, OSD, Sink
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml"
-    })
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nvvideoconvert", "converter")
-    pipeline.add("nveglglessink", "sink", {"sync": 1})
-
-    # Link and attach probe
-    pipeline.link("src", "parser", "decoder")
-    pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "infer", "tracker", "osd", "converter", "sink")
-    pipeline.attach("infer", Probe("postprocess", postprocessor))
-
-    pipeline.start().wait()
-```
-
-#### Approach 4: Preprocessing + Inference Pipeline
-
-Use custom preprocessing before inference for ROI-based processing.
-
-```python
-def preprocessing_inference_pipeline(video_path, preprocess_config, infer_config):
-    """Pipeline with custom preprocessing"""
-    pipeline = Pipeline("preprocess-inference")
-
-    # Source and decoding
-    pipeline.add("filesrc", "src", {"location": video_path})
-    pipeline.add("h264parse", "parser")
-    pipeline.add("nvv4l2decoder", "decoder")
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Custom preprocessing
-    pipeline.add("nvdspreprocess", "preprocess", {
-        "config-file": preprocess_config,
-        "gpu-id": 0
-    })
-
-    # Inference with tensor input
-    pipeline.add("nvinfer", "infer", {
-        "config-file-path": infer_config,
-        "input-tensor-meta": 1,  # Use tensor metadata from preprocessing
-        "batch-size": 1
-    })
-
-    # Postprocessing (if needed)
-    pipeline.add("nvdspostprocess", "postprocess", {
-        "postprocesslib-name": "/path/to/libpostprocess.so",
-        "postprocesslib-config-file": "/path/to/postprocess_config.yml"
-    })
-
-    # Tracker, OSD, Sink
-    pipeline.add("nvtracker", "tracker", {
-        "ll-lib-file": "/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so",
-        "ll-config-file": "/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml"
-    })
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nvvideoconvert", "converter")
-    pipeline.add("nveglglessink", "sink", {"sync": 1})
-
-    # Link
-    pipeline.link("src", "parser", "decoder")
-    pipeline.link(("decoder", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "preprocess", "infer", "postprocess", "tracker", "osd", "converter", "sink")
-
-    pipeline.start().wait()
-```
-
-### Metadata Processing Examples
-
-#### Example 1: Extract All Inference Results
-
-```python
-class InferenceResultExtractor(BatchMetadataOperator):
-    """Extract and print all inference results"""
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            print(f"\nFrame {frame_meta.frame_number}:")
-
-            for obj_meta in frame_meta.object_items:
-                print(f"  Object:")
-                print(f"    Class ID: {obj_meta.class_id}")
-                print(f"    Confidence: {obj_meta.confidence:.2f}")
-                print(f"    BBox: ({obj_meta.rect_params.left:.1f}, "
-                      f"{obj_meta.rect_params.top:.1f}, "
-                      f"{obj_meta.rect_params.width:.1f}, "
-                      f"{obj_meta.rect_params.height:.1f})")
-                print(f"    Object ID (Tracking): {obj_meta.object_id}")
-
-                # Check for secondary inference results
-                # Secondary results are stored in object metadata
-                # Access via obj_meta.obj_user_meta_list
-```
-
-#### Example 2: Filter Objects by Confidence
-
-```python
-class ConfidenceFilter(BatchMetadataOperator):
-    """Filter objects by confidence threshold"""
-    def __init__(self, threshold=0.5):
-        super().__init__()
-        self.threshold = threshold
-
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # Remove low-confidence objects
-            objects_to_remove = []
-            for obj_meta in frame_meta.object_items:
-                if obj_meta.confidence < self.threshold:
-                    objects_to_remove.append(obj_meta)
-
-            # Note: Direct removal may not be supported
-            # Instead, mark them or filter in downstream processing
-```
-
-#### Example 3: Aggregate Statistics
-
-```python
-class StatisticsAggregator(BatchMetadataOperator):
-    """Aggregate statistics across frames"""
-    def __init__(self):
-        super().__init__()
-        self.class_counts = {}
-        self.total_frames = 0
-
-    def handle_metadata(self, batch_meta):
-        self.total_frames += len(batch_meta.frame_items)
-
-        for frame_meta in batch_meta.frame_items:
-            for obj_meta in frame_meta.object_items:
-                class_id = obj_meta.class_id
-                self.class_counts[class_id] = self.class_counts.get(class_id, 0) + 1
-
-    def print_statistics(self):
-        print(f"\nStatistics:")
-        print(f"Total frames processed: {self.total_frames}")
-        print(f"Class distribution:")
-        for class_id, count in self.class_counts.items():
-            print(f"  Class {class_id}: {count} objects")
-```
-
-### Performance Optimization
-
-#### Batch Size Optimization
-
-```python
-def optimize_batch_size(num_streams, gpu_memory_gb):
-    """Calculate optimal batch size"""
-    # Rule of thumb: 1GB GPU memory per stream for 1080p
-    max_batch = min(num_streams, gpu_memory_gb)
-    # Use power of 2 for better GPU utilization
-    batch_size = 1
-    while batch_size * 2 <= max_batch:
-        batch_size *= 2
-    return batch_size
-```
-
-#### Inference Precision Selection
-
-```python
-# In inference config file:
-# network-mode: 0 = FP32 (highest accuracy, slowest)
-# network-mode: 1 = FP16 (good balance)
-# network-mode: 2 = INT8 (fastest, may need calibration)
-
-# For production, typically use FP16:
-infer_config = {
-    "network-mode": 1  # FP16
-}
-```
-
-### Common Patterns
-
-#### Pattern 1: Vehicle Detection + Make/Type Classification
-
-```python
-# PGIE: Vehicle detection (cars, trucks, buses)
-# SGIE1: Vehicle make classification (Toyota, Honda, Ford, etc.)
-# SGIE2: Vehicle type classification (sedan, SUV, truck, etc.)
-
-pipeline.link("mux", "pgie", "sgie1", "sgie2", "tracker", "osd", "sink")
-```
-
-#### Pattern 2: Person Detection + Attribute Classification
-
-```python
-# PGIE: Person detection
-# SGIE1: Gender classification
-# SGIE2: Age estimation
-# SGIE3: Clothing classification
-
-pipeline.link("mux", "pgie", "sgie1", "sgie2", "sgie3", "tracker", "osd", "sink")
-```
-
-#### Pattern 3: Multi-Model Ensemble
-
-```python
-# Run multiple detection models and merge results
-# Requires custom postprocessing to combine outputs
-```
-
-### Best Practices
-
-1. **Use appropriate batch sizes**: Match number of streams
-2. **Cascade inferences properly**: Ensure operate-on-gie-id is correct
-3. **Filter classes appropriately**: Use operate-on-class-ids
-4. **Optimize inference precision**: Use FP16 for production
-5. **Monitor GPU memory**: Adjust batch sizes accordingly
-6. **Use tracker after all inferences**: Ensures consistent tracking
-7. **Test with representative data**: Use real-world video samples
diff --git a/skills/deepstream/deepstream-dev/references/utilities_config.md b/skills/deepstream/deepstream-dev/references/utilities_config.md
deleted file mode 100644
index ecbc1e38..00000000
--- a/skills/deepstream/deepstream-dev/references/utilities_config.md
+++ /dev/null
@@ -1,1504 +0,0 @@
-# Utilities and Configuration Classes
-
-## Overview
-
-The `pyservicemaker` module and its `utils` submodule provide a collection of utility classes for monitoring, configuration management, and helper patterns used in DeepStream application development. This document covers:
-
-- **Part 1 -- Performance Monitoring Utilities**: Real-time FPS measurement, stream-level performance tracking, dynamic source monitoring, and model engine file hot-swapping via `PerfMonitor` and `EngineFileMonitor`.
-- **Part 2 -- Configuration and Helper Classes**: Source configuration management (`SourceConfig`, `SensorInfo`, `CameraInfo`), smart recording configuration (`SmartRecordConfig`), custom postprocessing interfaces (`PostProcessing`, `ObjectDetectorOutputConverter`), and factory-based plugin creation (`CommonFactory`).
-
----
-
-# Part 1: Performance Monitoring Utilities
-
-The `pyservicemaker.utils` module provides utilities for monitoring pipeline performance and managing model engine files. These utilities are essential for:
-- Real-time FPS (Frames Per Second) measurement
-- Stream-level performance tracking
-- Dynamic source monitoring
-- Model engine file hot-swapping (On-The-Fly updates)
-- Production deployment monitoring
-
-## Core Classes
-
-### PerfMonitor
-
-A performance monitoring utility that tracks FPS and throughput for DeepStream pipelines.
-
-**Constructor**:
-```python
-from pyservicemaker import utils
-
-perf_monitor = utils.PerfMonitor(
-    batch_size=4,              # Number of streams in batch
-    interval=1,                # Measurement interval in seconds
-    source_type="nvurisrcbin", # Source element type name
-    show_name=True             # Show stream names in output
-)
-```
-
-**Parameters**:
-- `batch_size` (int): Number of streams in the pipeline batch
-- `interval` (int): Performance measurement interval in seconds
-- `source_type` (str): Type name of the source bin (e.g., "nvurisrcbin", "nvmultiurisrcbin")
-- `show_name` (bool): Whether to show stream names in performance logs (default: True)
-
-**Methods**:
-
-#### `apply(element, pad_name)`
-Attach the performance monitor to a pipeline element.
-
-**Parameters**:
-- `element`: Pipeline element to monitor (typically tiler or sink)
-- `pad_name` (str): Name of the pad to monitor (typically "sink")
-
-**Example**:
-```python
-perf_monitor.apply(pipeline["tiler"], "sink")
-```
-
-#### `add_stream(source_id, uri, sensor_id, sensor_name)`
-Add a new stream to monitor (for dynamic sources).
-
-**Parameters**:
-- `source_id` (int): Unique source ID
-- `uri` (str): Stream URI
-- `sensor_id` (str): Sensor identifier
-- `sensor_name` (str): Sensor name
-
-#### `remove_stream(source_id)`
-Remove a stream from monitoring.
-
-**Parameters**:
-- `source_id` (int): Source ID to remove
-
-#### `pause()`
-Pause performance monitoring.
-
-#### `resume()`
-Resume performance monitoring.
-
-### EngineFileMonitor
-
-Monitors TensorRT engine files and triggers On-The-Fly (OTF) model updates when files change.
-
-**Constructor**:
-```python
-from pyservicemaker import utils
-
-engine_monitor = utils.EngineFileMonitor(
-    infer_element,           # nvinfer element
-    engine_file_path         # Path to engine file to monitor
-)
-```
-
-**Parameters**:
-- `infer_element`: The `nvinfer` element to update when engine file changes
-- `engine_file_path` (str): Path to the TensorRT engine file to monitor
-
-**Properties**:
-- `started` (bool): Whether the monitor has been started
-
-**Methods**:
-
-#### `start()`
-Start monitoring the engine file for changes.
-
-**Returns**: bool (True if started successfully)
-
-#### `stop()`
-Stop monitoring the engine file.
-
-**Returns**: bool (True if stopped successfully)
-
-## Performance Monitoring Usage Patterns
-
-### Pattern 1: Basic FPS Monitoring
-
-Monitor FPS for a single-stream pipeline.
-
-```python
-from pyservicemaker import Pipeline, utils
-import platform
-
-def pipeline_with_fps_monitoring(video_uri, config_path):
-    """Pipeline with FPS monitoring"""
-    pipeline = Pipeline("fps-monitored-pipeline")
-
-    # Build pipeline
-    pipeline.add("nvurisrcbin", "src", {"uri": video_uri})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-    pipeline.add("nvinfer", "infer", {"config-file-path": config_path})
-    pipeline.add("nvmultistreamtiler", "tiler", {"rows": 1, "columns": 1})
-    pipeline.add("nvosdbin", "osd")
-
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink")
-
-    # Link elements
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "infer", "tiler", "osd", "sink")
-
-    # Create and apply performance monitor
-    perf_monitor = utils.PerfMonitor(
-        batch_size=1,
-        interval=1,  # Report every second
-        source_type="nvurisrcbin",
-        show_name=True
-    )
-
-    # Apply to tiler's sink pad
-    perf_monitor.apply(pipeline["tiler"], "sink")
-
-    # Start pipeline
-    pipeline.start().wait()
-
-# Run with FPS monitoring
-pipeline_with_fps_monitoring(
-    "file:///path/to/video.mp4",
-    "/path/to/config.yml"
-)
-```
-
-**Output Example**:
-```
-**PERF: FPS 0 (Avg) 29.87
-**PERF: FPS 0 (Avg) 30.02
-**PERF: FPS 0 (Avg) 29.95
-```
-
-### Pattern 2: Multi-Stream FPS Monitoring
-
-Monitor FPS for multiple streams with names.
-
-```python
-from pyservicemaker import Pipeline, utils
-import platform
-
-def multi_stream_fps_monitoring(stream_uris, config_path):
-    """Monitor FPS for multiple streams"""
-    pipeline = Pipeline("multi-stream-fps")
-
-    # Add sources
-    for i, uri in enumerate(stream_uris):
-        pipeline.add("nvurisrcbin", f"src{i}", {"uri": uri})
-
-    # Add muxer
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": len(stream_uris),
-        "width": 1920,
-        "height": 1080
-    })
-
-    # Add processing
-    pipeline.add("nvinfer", "infer", {"config-file-path": config_path})
-    pipeline.add("nvmultistreamtiler", "tiler", {
-        "rows": 2,
-        "columns": 2,
-        "width": 1920,
-        "height": 1080
-    })
-    pipeline.add("nvosdbin", "osd")
-
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink")
-
-    # Link sources
-    for i in range(len(stream_uris)):
-        pipeline.link((f"src{i}", "mux"), ("", "sink_%u"))
-
-    pipeline.link("mux", "infer", "tiler", "osd", "sink")
-
-    # Create performance monitor
-    perf_monitor = utils.PerfMonitor(
-        batch_size=len(stream_uris),
-        interval=2,  # Report every 2 seconds
-        source_type="nvurisrcbin",
-        show_name=True  # Show stream names
-    )
-
-    # Apply monitor
-    perf_monitor.apply(pipeline["tiler"], "sink")
-
-    # Start pipeline
-    pipeline.start().wait()
-
-# Monitor 4 streams
-streams = [
-    "file:///path/to/video1.mp4",
-    "file:///path/to/video2.mp4",
-    "rtsp://camera1/stream",
-    "rtsp://camera2/stream"
-]
-multi_stream_fps_monitoring(streams, "/path/to/config.yml")
-```
-
-**Output Example**:
-```
-**PERF: FPS 0 (Avg) 29.87
-**PERF: FPS 1 (Avg) 29.92
-**PERF: FPS 2 (Avg) 30.15
-**PERF: FPS 3 (Avg) 29.78
-```
-
-### Pattern 3: Dynamic Source Monitoring
-
-Monitor performance with dynamically added/removed sources.
-
-```python
-from pyservicemaker import (
-    Pipeline, PipelineState, StateTransitionMessage,
-    DynamicSourceMessage, utils, SensorInfo
-)
-
-def dynamic_source_fps_monitoring(initial_sources, config_path):
-    """Monitor FPS with dynamic source addition/removal"""
-    pipeline = Pipeline("dynamic-fps-monitoring", config_file=config_path)
-
-    # Sensor map to track sources
-    sensor_map = {}
-
-    # Initialize with static sources
-    for i, source in enumerate(initial_sources):
-        sensor_map[i] = SensorInfo(
-            sensor_id=f"sensor_{i}",
-            sensor_name=f"Camera {i}",
-            uri=source
-        )
-
-    # Create performance monitor
-    perf_monitor = utils.PerfMonitor(
-        batch_size=len(initial_sources),
-        interval=1,
-        source_type="nvmultiurisrcbin",
-        show_name=True
-    )
-
-    # Apply to tiler
-    perf_monitor.apply(pipeline["tiler"], "sink")
-
-    # Message handler for dynamic sources
-    def on_message(message):
-        if isinstance(message, DynamicSourceMessage):
-            source_id = message.source_id
-
-            if message.source_added:
-                # Add new stream to monitoring
-                sensor_map[source_id] = SensorInfo(
-                    sensor_id=message.sensor_id,
-                    sensor_name=message.sensor_name,
-                    uri=message.uri
-                )
-
-                perf_monitor.add_stream(
-                    source_id=source_id,
-                    sensor_id=message.sensor_id,
-                    sensor_name=message.sensor_name,
-                    uri=message.uri
-                )
-
-                print(f"Added stream {source_id}: {message.sensor_name}")
-            else:
-                # Remove stream from monitoring
-                if source_id in sensor_map:
-                    del sensor_map[source_id]
-
-                perf_monitor.remove_stream(source_id)
-                print(f"Removed stream {source_id}")
-
-    # Prepare pipeline with message handler
-    pipeline.prepare(on_message)
-
-    # Start pipeline
-    pipeline.activate()
-    pipeline.wait()
-
-# Start with 2 sources (more can be added dynamically via API)
-initial = [
-    "file:///path/to/video1.mp4",
-    "file:///path/to/video2.mp4"
-]
-dynamic_source_fps_monitoring(initial, "/path/to/config.yml")
-```
-
-### Pattern 4: Performance Monitoring with Pause/Resume
-
-Control monitoring based on pipeline state.
-
-```python
-from pyservicemaker import Pipeline, utils
-import time
-import threading
-
-def controlled_fps_monitoring(video_uri, config_path):
-    """FPS monitoring with pause/resume control"""
-    pipeline = Pipeline("controlled-monitoring")
-
-    # Build pipeline
-    pipeline.add("nvurisrcbin", "src", {"uri": video_uri})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-    pipeline.add("nvinfer", "infer", {"config-file-path": config_path})
-    pipeline.add("nvmultistreamtiler", "tiler", {"rows": 1, "columns": 1})
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nveglglessink", "sink")
-
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "infer", "tiler", "osd", "sink")
-
-    # Create performance monitor
-    perf_monitor = utils.PerfMonitor(
-        batch_size=1,
-        interval=1,
-        source_type="nvurisrcbin"
-    )
-    perf_monitor.apply(pipeline["tiler"], "sink")
-
-    # Control thread
-    def control_monitoring():
-        time.sleep(10)
-        print("Pausing monitoring...")
-        perf_monitor.pause()
-
-        time.sleep(5)
-        print("Resuming monitoring...")
-        perf_monitor.resume()
-
-    control_thread = threading.Thread(target=control_monitoring, daemon=True)
-    control_thread.start()
-
-    # Start pipeline
-    pipeline.start().wait()
-
-controlled_fps_monitoring("file:///path/to/video.mp4", "/path/to/config.yml")
-```
-
-### Pattern 5: Model Engine Hot-Swapping
-
-Monitor and automatically reload updated model engine files.
-
-```python
-from pyservicemaker import Pipeline, PipelineState, StateTransitionMessage, utils
-import platform
-
-def pipeline_with_otf_model_update(video_uri, config_path):
-    """Pipeline with On-The-Fly model engine updates"""
-    pipeline = Pipeline("otf-model-update")
-
-    # Build pipeline
-    pipeline.add("nvurisrcbin", "src", {"uri": video_uri})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-    pipeline.add("nvinfer", "pgie", {"config-file-path": config_path})
-    pipeline.add("nvosdbin", "osd")
-
-    sink_type = "nv3dsink" if platform.processor() == "aarch64" else "nveglglessink"
-    pipeline.add(sink_type, "sink")
-
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "pgie", "osd", "sink")
-
-    # Get engine file path from nvinfer element
-    engine_file = pipeline["pgie"].get("model-engine-file")
-
-    # Create engine file monitor
-    model_engine_monitor = utils.EngineFileMonitor(
-        pipeline["pgie"],
-        engine_file
-    )
-
-    # Message handler to start monitor when pipeline is ready
-    def on_message(message):
-        if isinstance(message, StateTransitionMessage):
-            if message.new_state == PipelineState.PLAYING and message.origin == "sink":
-                if not model_engine_monitor.started:
-                    print("Starting model engine monitor...")
-                    model_engine_monitor.start()
-
-    pipeline.prepare(on_message)
-
-    # Start pipeline
-    pipeline.activate()
-    pipeline.wait()
-
-# Pipeline will automatically reload model when engine file changes
-pipeline_with_otf_model_update(
-    "file:///path/to/video.mp4",
-    "/path/to/pgie_config.yml"
-)
-```
-
-### Pattern 6: Combined Performance and Model Monitoring
-
-Use both utilities together for production deployment. This pattern also uses `SourceConfig` and `SensorInfo` (see Part 2 below for details on those classes).
-
-```python
-from pyservicemaker import (
-    Pipeline, PipelineState, StateTransitionMessage,
-    DynamicSourceMessage, utils, SensorInfo, SourceConfig
-)
-import platform
-
-def production_pipeline_monitoring(source_config_file, pipeline_config_file):
-    """Production pipeline with full monitoring"""
-    # Load configuration
-    source_config = SourceConfig()
-    source_config.load(source_config_file)
-
-    # Create pipeline
-    pipeline = Pipeline("production-pipeline", config_file=pipeline_config_file)
-
-    # Sensor map
-    sensor_map = {}
-    for i, sensor in enumerate(source_config.sensor_list):
-        sensor_map[i] = sensor
-
-    # Create performance monitor
-    perf_monitor = utils.PerfMonitor(
-        batch_size=len(source_config.sensor_list),
-        interval=5,  # Report every 5 seconds
-        source_type=source_config.source_type,
-        show_name=True
-    )
-    perf_monitor.apply(pipeline["tiler"], "sink")
-
-    # Create model engine monitor
-    engine_file = pipeline["pgie"].get("model-engine-file")
-    model_engine_monitor = utils.EngineFileMonitor(
-        pipeline["pgie"],
-        engine_file
-    )
-
-    # Message handler
-    def on_message(message):
-        if isinstance(message, StateTransitionMessage):
-            if message.new_state == PipelineState.PLAYING and message.origin == "sink":
-                # Start monitors when pipeline is playing
-                if not model_engine_monitor.started:
-                    model_engine_monitor.start()
-                    print("Model engine monitoring started")
-
-        elif isinstance(message, DynamicSourceMessage):
-            source_id = message.source_id
-
-            if message.source_added:
-                sensor_map[source_id] = SensorInfo(
-                    sensor_id=message.sensor_id,
-                    sensor_name=message.sensor_name,
-                    uri=message.uri
-                )
-                perf_monitor.add_stream(
-                    source_id=source_id,
-                    sensor_id=message.sensor_id,
-                    sensor_name=message.sensor_name,
-                    uri=message.uri
-                )
-                print(f"Stream added: {message.sensor_name}")
-            else:
-                if source_id in sensor_map:
-                    del sensor_map[source_id]
-                perf_monitor.remove_stream(source_id)
-                print(f"Stream removed: {source_id}")
-
-    pipeline.prepare(on_message)
-
-    # Start pipeline
-    pipeline.activate()
-    pipeline.wait()
-
-# Run production pipeline
-production_pipeline_monitoring(
-    "source_config.yaml",
-    "pipeline_config.yaml"
-)
-```
-
-### Pattern 7: Custom FPS Logging
-
-Capture FPS data for custom analysis.
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator, utils
-import time
-import json
-
-class FPSLogger(BatchMetadataOperator):
-    """Custom FPS logger"""
-    def __init__(self, log_file="fps_log.json"):
-        super().__init__()
-        self.log_file = log_file
-        self.frame_count = 0
-        self.start_time = time.time()
-        self.last_log_time = self.start_time
-        self.fps_data = []
-
-    def handle_metadata(self, batch_meta):
-        self.frame_count += len(batch_meta.frame_items)
-
-        current_time = time.time()
-        elapsed = current_time - self.last_log_time
-
-        if elapsed >= 1.0:  # Log every second
-            fps = self.frame_count / elapsed
-
-            log_entry = {
-                "timestamp": current_time,
-                "fps": fps,
-                "total_frames": self.frame_count,
-                "elapsed_total": current_time - self.start_time
-            }
-
-            self.fps_data.append(log_entry)
-            print(f"FPS: {fps:.2f}")
-
-            # Save to file
-            with open(self.log_file, 'w') as f:
-                json.dump(self.fps_data, f, indent=2)
-
-            self.frame_count = 0
-            self.last_log_time = current_time
-
-def pipeline_with_custom_fps_logging(video_uri, config_path):
-    """Pipeline with custom FPS logging"""
-    pipeline = Pipeline("custom-fps-logging")
-
-    # Build pipeline
-    pipeline.add("nvurisrcbin", "src", {"uri": video_uri})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-    pipeline.add("nvinfer", "infer", {"config-file-path": config_path})
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nveglglessink", "sink")
-
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "infer", "osd", "sink")
-
-    # Attach custom FPS logger
-    from pyservicemaker import Probe
-    fps_logger = FPSLogger("custom_fps_log.json")
-    pipeline.attach("infer", Probe("fps_logger", fps_logger))
-
-    # Also use built-in performance monitor
-    perf_monitor = utils.PerfMonitor(
-        batch_size=1,
-        interval=1,
-        source_type="nvurisrcbin"
-    )
-    perf_monitor.apply(pipeline["osd"], "sink")
-
-    pipeline.start().wait()
-
-pipeline_with_custom_fps_logging("file:///path/to/video.mp4", "/path/to/config.yml")
-```
-
-## Performance Monitoring Best Practices
-
-### 1. Choose Appropriate Monitoring Interval
-```python
-# For real-time monitoring
-perf_monitor = utils.PerfMonitor(batch_size=4, interval=1)
-
-# For less frequent updates (production)
-perf_monitor = utils.PerfMonitor(batch_size=4, interval=5)
-
-# For detailed analysis
-perf_monitor = utils.PerfMonitor(batch_size=4, interval=0.5)
-```
-
-### 2. Monitor at Appropriate Pipeline Point
-```python
-# Monitor after tiler (recommended for multi-stream)
-perf_monitor.apply(pipeline["tiler"], "sink")
-
-# Monitor at final sink
-perf_monitor.apply(pipeline["sink"], "sink")
-
-# Monitor after inference
-perf_monitor.apply(pipeline["infer"], "src")
-```
-
-### 3. Start Engine Monitor After Pipeline is Ready
-```python
-def on_message(message):
-    if isinstance(message, StateTransitionMessage):
-        if message.new_state == PipelineState.PLAYING:
-            if not model_engine_monitor.started:
-                model_engine_monitor.start()
-```
-
-### 4. Keep References to Monitors
-```python
-# Store monitors to prevent garbage collection
-reference_holders = []
-reference_holders.append(perf_monitor)
-reference_holders.append(model_engine_monitor)
-```
-
-### 5. Handle Dynamic Sources Properly
-```python
-# Add stream
-perf_monitor.add_stream(
-    source_id=source_id,
-    sensor_id=sensor_id,
-    sensor_name=sensor_name,
-    uri=uri
-)
-
-# Remove stream
-perf_monitor.remove_stream(source_id)
-```
-
-## Performance Tips
-
-### 1. Monitoring Overhead
-- Performance monitoring has minimal overhead (~0.1% CPU)
-- Use longer intervals (5-10 seconds) for production
-- Disable `show_name` if not needed to reduce string operations
-
-### 2. Engine File Monitoring
-- Engine monitor uses inotify (Linux) for efficient file watching
-- Minimal overhead when file doesn't change
-- Automatic reload triggers brief inference pause
-
-### 3. Multi-Stream Monitoring
-- Per-stream FPS tracking has negligible overhead
-- Batch size should match actual number of streams
-- Update batch size when adding/removing dynamic sources
-
-## Performance Monitoring Common Use Cases
-
-### 1. Production Deployment Monitoring
-Monitor FPS and model updates in production systems.
-
-### 2. Performance Benchmarking
-Measure and log FPS for different configurations.
-
-### 3. Dynamic Stream Management
-Track performance as streams are added/removed.
-
-### 4. Model A/B Testing
-Monitor performance during model hot-swapping.
-
-### 5. Quality of Service (QoS) Monitoring
-Ensure FPS meets SLA requirements.
-
-### 6. Resource Utilization Analysis
-Correlate FPS with system resource usage.
-
-## Performance Monitoring Troubleshooting
-
-### Issue 1: No FPS Output
-**Solution**: Ensure monitor is applied to correct element and pad, verify pipeline is running
-
-### Issue 2: Incorrect FPS Values
-**Solution**: Check batch_size matches actual number of streams, verify monitoring point
-
-### Issue 3: Engine Monitor Not Triggering
-**Solution**: Ensure monitor is started after pipeline is PLAYING, verify file path is correct
-
-### Issue 4: Memory Leak with Dynamic Sources
-**Solution**: Always call `remove_stream()` when removing sources, keep references to monitors
-
-## Performance Monitoring Summary
-
-The performance monitoring utilities provide essential capabilities for production DeepStream applications:
-
-1. **PerfMonitor**: Real-time FPS tracking and throughput measurement
-   - Per-stream FPS monitoring
-   - Dynamic source support
-   - Pause/resume capability
-   - Minimal overhead
-
-2. **EngineFileMonitor**: Automatic model engine hot-swapping
-   - File change detection
-   - Automatic inference engine reload
-   - Zero-downtime model updates
-   - Production-ready OTF updates
-
-Key features:
-- Real-time performance metrics
-- Multi-stream support
-- Dynamic source tracking
-- Model hot-swapping
-- Production deployment ready
-- Minimal performance overhead
-
-These utilities are essential for monitoring, debugging, and maintaining DeepStream applications in production environments.
-
----
-
-# Part 2: Configuration and Helper Classes
-
-The `pyservicemaker` module provides several configuration and helper classes that simplify DeepStream application development. These classes handle:
-- Source configuration management (video streams, cameras)
-- Smart recording configuration
-- Custom postprocessing interfaces
-- Common factory patterns
-- Signal handling and events
-
-## Core Classes
-
-### SourceConfig
-
-A configuration manager for video sources and cameras.
-
-**Constructor**:
-```python
-from pyservicemaker import SourceConfig
-
-source_config = SourceConfig()
-```
-
-**Properties**:
-- `sensor_list`: List of `SensorInfo` objects (for URI-based sources)
-- `camera_list`: List of `CameraInfo` objects (for physical cameras)
-- `source_type`: Type of source bin (e.g., "nvurisrcbin", "nvmultiurisrcbin", "camerabin")
-- `source_properties`: Dictionary of source properties
-
-**Methods**:
-
-#### `load(config_file)`
-Load source configuration from a YAML file.
-
-**Parameters**:
-- `config_file` (str): Path to YAML configuration file
-
-**Example**:
-```python
-from pyservicemaker import SourceConfig
-
-config = SourceConfig()
-config.load("source_config.yaml")
-
-print(f"Source type: {config.source_type}")
-print(f"Number of sensors: {len(config.sensor_list)}")
-
-for sensor in config.sensor_list:
-    print(f"  Sensor ID: {sensor.sensor_id}")
-    print(f"  Name: {sensor.sensor_name}")
-    print(f"  URI: {sensor.uri}")
-```
-
-**YAML Configuration Format**:
-
-```yaml
-# For URI-based sources (files, RTSP streams)
-source-list:
-  - uri: "file:///path/to/video1.mp4"
-    sensor-id: "sensor-001"
-    sensor-name: "Camera 1"
-
-  - uri: "rtsp://192.168.1.100/stream"
-    sensor-id: "sensor-002"
-    sensor-name: "Camera 2"
-
-source-config:
-  source-bin: "nvurisrcbin"
-  properties:
-    gpu-id: 0
-    cudadec-memtype: 0
-
-# For physical cameras (CSI, V4L2)
-camera-list:
-  - camera-type: "CSI"
-    camera-video-format: "NV12"
-    camera-width: 1920
-    camera-height: 1080
-    camera-fps-n: 30
-    camera-fps-d: 1
-    camera-csi-sensor-id: 0
-    gpu-id: 0
-    nvbuf-mem-type: 0
-
-  - camera-type: "V4L2"
-    camera-video-format: "NV12"
-    camera-width: 1280
-    camera-height: 720
-    camera-fps-n: 30
-    camera-fps-d: 1
-    camera-v4l2-dev-node: 0
-    gpu-id: 0
-    nvbuf-mem-type: 0
-    nvvideoconvert-copy-hw: 0
-```
-
-### SensorInfo
-
-Named tuple containing sensor information for URI-based sources.
-
-**Fields**:
-- `sensor_id` (str): Unique sensor identifier
-- `sensor_name` (str): Human-readable sensor name
-- `uri` (str): Video source URI
-
-**Example**:
-```python
-from pyservicemaker import SensorInfo
-
-sensor = SensorInfo(
-    sensor_id="cam-001",
-    sensor_name="Front Door Camera",
-    uri="rtsp://192.168.1.100/stream"
-)
-
-print(f"ID: {sensor.sensor_id}")
-print(f"Name: {sensor.sensor_name}")
-print(f"URI: {sensor.uri}")
-```
-
-### CameraInfo
-
-Named tuple containing camera configuration for physical cameras.
-
-**Fields**:
-- `camera_type` (str): Camera type ("CSI" or "V4L2")
-- `camera_video_format` (str): Video format (e.g., "NV12", "RGB")
-- `camera_width` (int): Frame width in pixels
-- `camera_height` (int): Frame height in pixels
-- `camera_fps_n` (int): Frame rate numerator
-- `camera_fps_d` (int): Frame rate denominator
-- `camera_csi_sensor_id` (int): CSI sensor ID (for CSI cameras)
-- `camera_v4l2_dev_node` (int): V4L2 device node (for V4L2 cameras)
-- `gpu_id` (int): GPU ID to use
-- `nvbuf_mem_type` (int): Buffer memory type
-- `nvvideoconvert_copy_hw` (int): Hardware copy mode
-
-**Example**:
-```python
-from pyservicemaker import CameraInfo
-
-# CSI camera configuration
-csi_camera = CameraInfo(
-    camera_type="CSI",
-    camera_video_format="NV12",
-    camera_width=1920,
-    camera_height=1080,
-    camera_fps_n=30,
-    camera_fps_d=1,
-    camera_csi_sensor_id=0,
-    camera_v4l2_dev_node=None,
-    gpu_id=0,
-    nvbuf_mem_type=0,
-    nvvideoconvert_copy_hw=0
-)
-```
-
-### SmartRecordConfig
-
-Configuration dataclass for smart recording functionality.
-
-**Constructor**:
-```python
-from pyservicemaker import SmartRecordConfig
-
-config = SmartRecordConfig(
-    proto_lib="/path/to/libnvds_kafka_proto.so",
-    conn_str="localhost;9092",
-    msgconv_config_file="/path/to/msgconv_config.txt",
-    proto_config_file="/path/to/proto_config.txt",
-    topic_list="smart-recording-events",
-    smart_rec_cache=30,
-    smart_rec_container=0,
-    smart_rec_dir_path="./recordings",
-    smart_rec_mode=0
-)
-```
-
-**Required Parameters**:
-- `proto_lib` (str): Path to protocol library (e.g., Kafka protocol library)
-- `conn_str` (str): Connection string for message broker (e.g., "localhost;9092")
-- `msgconv_config_file` (str): Path to message converter configuration file
-- `proto_config_file` (str): Path to protocol configuration file
-- `topic_list` (str): Comma-separated list of topics for message publishing
-
-**Optional Parameters**:
-- `smart_rec_cache` (int): Cache size in seconds (default: 20, range: 0-4294967295)
-- `smart_rec_container` (int): Container format (0=MP4, 1=MKV, default: 0)
-- `smart_rec_dir_path` (str): Directory to save recordings (default: ".")
-- `smart_rec_mode` (int): Recording mode (0=audio+video, 1=video only, 2=audio only, default: 0)
-
-**Example**:
-```python
-from pyservicemaker import SmartRecordConfig
-
-# Create smart recording configuration
-sr_config = SmartRecordConfig(
-    proto_lib="/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-    conn_str="localhost;9092",
-    msgconv_config_file="/opt/nvidia/deepstream/deepstream/sources/libs/kafka_protocol_adaptor/cfg_kafka.txt",
-    proto_config_file="/opt/nvidia/deepstream/deepstream/sources/libs/kafka_protocol_adaptor/cfg_kafka.txt",
-    topic_list="sr-events",
-    smart_rec_cache=30,      # 30 seconds cache
-    smart_rec_container=0,   # MP4 format
-    smart_rec_dir_path="./recordings",
-    smart_rec_mode=0         # Record audio and video
-)
-```
-
-### PostProcessing (Abstract Base Class)
-
-Base class for custom tensor output postprocessing.
-
-**Abstract Method**:
-
-#### `__call__(output_layers)`
-Convert output tensors to real-world representation.
-
-**Parameters**:
-- `output_layers` (Dict): Dictionary of (layer_name, tensor) pairs
-
-**Returns**: Any (depends on implementation)
-
-**Example**:
-```python
-from pyservicemaker import postprocessing
-import torch
-
-class CustomPostProcessing(postprocessing.PostProcessing):
-    def __call__(self, output_layers):
-        # Extract tensors
-        output = output_layers.get('output_layer')
-
-        if output:
-            # Convert to PyTorch
-            torch_tensor = torch.utils.dlpack.from_dlpack(output)
-
-            # Custom processing
-            result = self.process(torch_tensor)
-            return result
-
-        return None
-
-    def process(self, tensor):
-        # Your custom processing logic
-        return tensor.cpu().numpy()
-```
-
-### ObjectDetectorOutputConverter (Abstract Base Class)
-
-Specialized base class for object detection postprocessing.
-
-**Abstract Method**:
-
-#### `__call__(output_layers)`
-Convert output tensors to object detection results.
-
-**Parameters**:
-- `output_layers` (Dict): Dictionary of (layer_name, tensor) pairs
-
-**Returns**: List of bounding boxes in format `[class_id, confidence, x1, y1, x2, y2]`
-
-**Example**:
-```python
-from pyservicemaker import postprocessing
-import torch
-import torchvision.ops as ops
-
-class YOLOv5Converter(postprocessing.ObjectDetectorOutputConverter):
-    def __init__(self, conf_threshold=0.5, nms_threshold=0.4):
-        self.conf_threshold = conf_threshold
-        self.nms_threshold = nms_threshold
-
-    def __call__(self, output_layers):
-        outputs = []
-
-        # Extract output tensor
-        predictions = output_layers.get('output')
-        if predictions is None:
-            return outputs
-
-        # Convert to PyTorch
-        pred_tensor = torch.utils.dlpack.from_dlpack(predictions).cpu()
-
-        # Process predictions
-        # pred_tensor shape: [batch, num_boxes, 85] (for COCO)
-        # Format: [x, y, w, h, obj_conf, class_conf...]
-
-        for detection in pred_tensor[0]:  # Assuming batch size 1
-            obj_conf = detection[4]
-
-            if obj_conf < self.conf_threshold:
-                continue
-
-            # Get class with highest confidence
-            class_confs = detection[5:]
-            class_id = torch.argmax(class_confs).item()
-            class_conf = class_confs[class_id].item()
-
-            confidence = obj_conf * class_conf
-
-            if confidence < self.conf_threshold:
-                continue
-
-            # Convert center format to corner format
-            x_center, y_center, width, height = detection[:4]
-            x1 = (x_center - width / 2).item()
-            y1 = (y_center - height / 2).item()
-            x2 = (x_center + width / 2).item()
-            y2 = (y_center + height / 2).item()
-
-            outputs.append([class_id, confidence, x1, y1, x2, y2])
-
-        # Apply NMS
-        if outputs:
-            boxes = torch.tensor([[o[2], o[3], o[4], o[5]] for o in outputs])
-            scores = torch.tensor([o[1] for o in outputs])
-            keep = ops.nms(boxes, scores, self.nms_threshold)
-            outputs = [outputs[i] for i in keep]
-
-        return outputs
-```
-
-### CommonFactory
-
-Factory class for creating custom objects and plugins.
-
-**Method**:
-
-#### `create(factory_name, instance_name)`
-Create an instance from a registered factory.
-
-**Parameters**:
-- `factory_name` (str): Name of the factory (e.g., "smart_recording_action")
-- `instance_name` (str): Name for the created instance
-
-**Returns**: Created object instance
-
-**Example**:
-```python
-from pyservicemaker import CommonFactory
-
-# Create smart recording controller
-sr_controller = CommonFactory.create("smart_recording_action", "sr_controller")
-
-# Configure the controller
-if sr_controller:
-    sr_controller.set({
-        "proto-lib": "/path/to/libnvds_kafka_proto.so",
-        "conn-str": "localhost;9092",
-        "msgconv-config-file": "/path/to/msgconv_config.txt",
-        "proto-config-file": "/path/to/proto_config.txt",
-        "topic-list": "sr-events"
-    })
-```
-
-## Configuration and Helper Usage Patterns
-
-### Pattern 1: Load and Use Source Configuration
-
-Load source configuration from YAML and build pipeline.
-
-```python
-from pyservicemaker import Pipeline, SourceConfig
-import platform
-
-def pipeline_from_source_config(source_config_file, pgie_config):
-    """Build pipeline from source configuration file"""
-    # Load source configuration
-    source_config = SourceConfig()
-    source_config.load(source_config_file)
-
-    # Create pipeline
-    pipeline = Pipeline("configured-pipeline")
-
-    # Add sources based on configuration
-    if source_config.source_type == "nvmultiurisrcbin":
-        # Multi-URI source bin
-        uri_list = ','.join([s.uri for s in source_config.sensor_list])
-        sensor_id_list = ','.join([s.sensor_id for s in source_config.sensor_list])
-        sensor_name_list = ','.join([s.sensor_name for s in source_config.sensor_list])
-
-        properties = dict(source_config.source_properties)
-        properties.update({
-            "uri-list": uri_list,
-            "sensor-id-list": sensor_id_list,
-            "sensor-name-list": sensor_name_list
-        })
-
-        pipeline.add("nvmultiurisrcbin", "source", properties)
-        pipeline.add("nvinfer", "pgie", {"config-file-path": pgie_config})
-        pipeline.link("source", "pgie")
-
-    elif source_config.source_type == "camerabin":
-        # Physical cameras
-        pipeline.add("nvstreammux", "mux", {
-            "batch-size": len(source_config.camera_list),
-            "width": 1920,
-            "height": 1080,
-            "live-source": 1
-        })
-
-        for i, camera in enumerate(source_config.camera_list):
-            src_name = f"src_{i}"
-
-            if camera.camera_type == "CSI":
-                pipeline.add("nvarguscamerasrc" if platform.processor() == "aarch64" else "videotestsrc",
-                           src_name, {"sensor-id": camera.camera_csi_sensor_id})
-            elif camera.camera_type == "V4L2":
-                device = f"/dev/video{camera.camera_v4l2_dev_node}"
-                pipeline.add("v4l2src", src_name, {"device": device})
-
-            pipeline.link((src_name, "mux"), ("", "sink_%u"))
-
-        pipeline.add("nvinfer", "pgie", {"config-file-path": pgie_config})
-        pipeline.link("mux", "pgie")
-
-    else:
-        # Individual URI sources
-        pipeline.add("nvstreammux", "mux", {
-            "batch-size": len(source_config.sensor_list),
-            "width": 1920,
-            "height": 1080
-        })
-
-        for i, sensor in enumerate(source_config.sensor_list):
-            src_name = f"src_{i}"
-            properties = dict(source_config.source_properties)
-            properties["uri"] = sensor.uri
-
-            pipeline.add(source_config.source_type, src_name, properties)
-            pipeline.link((src_name, "mux"), ("", "sink_%u"))
-
-        pipeline.add("nvinfer", "pgie", {"config-file-path": pgie_config})
-        pipeline.link("mux", "pgie")
-
-    # Add remaining elements
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nveglglessink", "sink")
-    pipeline.link("pgie", "osd", "sink")
-
-    # Start pipeline
-    pipeline.start().wait()
-
-# Use configuration file
-pipeline_from_source_config("sources.yaml", "pgie_config.yml")
-```
-
-### Pattern 2: Smart Recording with Configuration
-
-Set up smart recording using SmartRecordConfig.
-
-```python
-from pyservicemaker import Pipeline, Flow, SmartRecordConfig
-
-def pipeline_with_smart_recording(video_uris, pgie_config):
-    """Pipeline with smart recording enabled"""
-    # Create smart recording configuration
-    sr_config = SmartRecordConfig(
-        proto_lib="/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-        conn_str="localhost;9092",
-        msgconv_config_file="/opt/nvidia/deepstream/deepstream/sources/libs/kafka_protocol_adaptor/cfg_kafka.txt",
-        proto_config_file="/opt/nvidia/deepstream/deepstream/sources/libs/kafka_protocol_adaptor/cfg_kafka.txt",
-        topic_list="sr-events",
-        smart_rec_cache=30,
-        smart_rec_container=0,  # MP4
-        smart_rec_dir_path="./recordings",
-        smart_rec_mode=0  # Audio + Video
-    )
-
-    # Create pipeline with Flow API
-    pipeline = Pipeline("smart-recording-pipeline")
-    flow = Flow(pipeline)
-
-    # Build pipeline with smart recording
-    flow.batch_capture(video_uris)
-    flow.infer(pgie_config)
-    flow.smart_record(sr_config)  # Enable smart recording
-    flow.render()
-
-    # Execute
-    flow()
-
-# Run with smart recording
-video_sources = [
-    "rtsp://192.168.1.100/stream",
-    "rtsp://192.168.1.101/stream"
-]
-pipeline_with_smart_recording(video_sources, "pgie_config.yml")
-```
-
-### Pattern 3: Custom Postprocessing
-
-Implement custom postprocessing for inference outputs.
-
-```python
-from pyservicemaker import Pipeline, Probe, BatchMetadataOperator, postprocessing
-import torch
-
-class CustomDetectorConverter(postprocessing.ObjectDetectorOutputConverter):
-    """Custom object detector postprocessing"""
-    def __init__(self, threshold=0.5):
-        self.threshold = threshold
-
-    def __call__(self, output_layers):
-        outputs = []
-
-        # Extract your model's output tensors
-        bbox_layer = output_layers.get('bboxes')
-        conf_layer = output_layers.get('confidences')
-        class_layer = output_layers.get('classes')
-
-        if not all([bbox_layer, conf_layer, class_layer]):
-            return outputs
-
-        # Convert to PyTorch
-        bboxes = torch.utils.dlpack.from_dlpack(bbox_layer).cpu()
-        confs = torch.utils.dlpack.from_dlpack(conf_layer).cpu()
-        classes = torch.utils.dlpack.from_dlpack(class_layer).cpu()
-
-        # Process detections
-        for bbox, conf, cls in zip(bboxes, confs, classes):
-            if conf > self.threshold:
-                x1, y1, x2, y2 = bbox
-                outputs.append([
-                    int(cls),
-                    float(conf),
-                    float(x1), float(y1),
-                    float(x2), float(y2)
-                ])
-
-        return outputs
-
-class CustomPostprocessor(BatchMetadataOperator):
-    """Apply custom postprocessing to inference results"""
-    def __init__(self):
-        super().__init__()
-        self.converter = CustomDetectorConverter(threshold=0.6)
-
-    def handle_metadata(self, batch_meta):
-        for frame_meta in batch_meta.frame_items:
-            # Process tensor outputs
-            for tensor_meta in frame_meta.tensor_items:
-                output_layers = tensor_meta.as_tensor_output().get_layers()
-                detections = self.converter(output_layers)
-
-                # Create object metadata from detections
-                for det in detections:
-                    obj_meta = batch_meta.acquire_object_meta()
-                    obj_meta.class_id = det[0]
-                    obj_meta.confidence = det[1]
-                    obj_meta.rect_params.left = det[2]
-                    obj_meta.rect_params.top = det[3]
-                    obj_meta.rect_params.width = det[4] - det[2]
-                    obj_meta.rect_params.height = det[5] - det[3]
-                    frame_meta.append(obj_meta)
-
-def pipeline_with_custom_postprocessing(video_uri, config_path):
-    """Pipeline with custom postprocessing"""
-    pipeline = Pipeline("custom-postproc")
-
-    # Build pipeline
-    pipeline.add("nvurisrcbin", "src", {"uri": video_uri})
-    pipeline.add("nvstreammux", "mux", {"batch-size": 1, "width": 1920, "height": 1080})
-
-    # Enable tensor output
-    pipeline.add("nvinfer", "infer", {
-        "config-file-path": config_path,
-        "output-tensor-meta": 1  # Enable tensor output
-    })
-
-    pipeline.add("nvosdbin", "osd")
-    pipeline.add("nveglglessink", "sink")
-
-    pipeline.link(("src", "mux"), ("", "sink_%u"))
-    pipeline.link("mux", "infer", "osd", "sink")
-
-    # Attach custom postprocessor
-    pipeline.attach("infer", Probe("custom-postproc", CustomPostprocessor()))
-
-    pipeline.start().wait()
-
-pipeline_with_custom_postprocessing("file:///path/to/video.mp4", "config.yml")
-```
-
-### Pattern 4: Dynamic Sensor Management
-
-Manage sensors dynamically using SensorInfo. For combining this with performance monitoring, see Part 1 above (Pattern 3: Dynamic Source Monitoring).
-
-```python
-from pyservicemaker import Pipeline, SensorInfo, DynamicSourceMessage
-import time
-import threading
-
-def dynamic_sensor_management():
-    """Manage sensors dynamically"""
-    pipeline = Pipeline("dynamic-sensors", config_file="pipeline_config.yml")
-
-    # Sensor registry
-    active_sensors = {}
-
-    def on_message(message):
-        if isinstance(message, DynamicSourceMessage):
-            source_id = message.source_id
-
-            if message.source_added:
-                # Register new sensor
-                sensor = SensorInfo(
-                    sensor_id=message.sensor_id,
-                    sensor_name=message.sensor_name,
-                    uri=message.uri
-                )
-                active_sensors[source_id] = sensor
-                print(f"Added sensor: {sensor.sensor_name} ({sensor.sensor_id})")
-            else:
-                # Unregister sensor
-                if source_id in active_sensors:
-                    sensor = active_sensors[source_id]
-                    print(f"Removed sensor: {sensor.sensor_name}")
-                    del active_sensors[source_id]
-
-    pipeline.prepare(on_message)
-    pipeline.activate()
-    pipeline.wait()
-
-dynamic_sensor_management()
-```
-
-### Pattern 5: Factory-Based Plugin Creation
-
-Use CommonFactory to create custom plugins.
-
-```python
-from pyservicemaker import Pipeline, CommonFactory, signal
-
-def pipeline_with_factory_plugins(video_uris, config_path):
-    """Pipeline using factory-created plugins"""
-    pipeline = Pipeline("factory-pipeline")
-
-    # Build pipeline
-    pipeline.add("nvstreammux", "mux", {
-        "batch-size": len(video_uris),
-        "width": 1920,
-        "height": 1080
-    })
-
-    for i, uri in enumerate(video_uris):
-        pipeline.add("nvurisrcbin", f"src{i}", {"uri": uri})
-        pipeline.link((f"src{i}", "mux"), ("", "sink_%u"))
-
-    pipeline.add("nvinfer", "pgie", {"config-file-path": config_path})
-    pipeline.add("nvmsgbroker", "msgbroker", {
-        "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-        "conn-str": "localhost;9092",
-        "topic": "analytics"
-    })
-
-    pipeline.link("mux", "pgie", "msgbroker")
-
-    # Create smart recording controller using factory
-    sr_controller = CommonFactory.create("smart_recording_action", "sr_controller")
-
-    if sr_controller and isinstance(sr_controller, signal.Emitter):
-        # Configure smart recording
-        sr_controller.set({
-            "proto-lib": "/opt/nvidia/deepstream/deepstream/lib/libnvds_kafka_proto.so",
-            "conn-str": "localhost;9092",
-            "msgconv-config-file": "/path/to/msgconv_config.txt",
-            "proto-config-file": "/path/to/proto_config.txt",
-            "topic-list": "sr-events"
-        })
-
-        # Attach to sources
-        for i in range(len(video_uris)):
-            sr_controller.attach("start-sr", pipeline[f"src{i}"])
-            sr_controller.attach("stop-sr", pipeline[f"src{i}"])
-            pipeline.attach(f"src{i}", "smart_recording_signal", "sr", "sr-done")
-
-    pipeline.start().wait()
-
-video_sources = ["rtsp://cam1/stream", "rtsp://cam2/stream"]
-pipeline_with_factory_plugins(video_sources, "pgie_config.yml")
-```
-
-## Configuration and Helper Best Practices
-
-### 1. Use Configuration Files
-```python
-# Good: Externalize configuration
-source_config = SourceConfig()
-source_config.load("sources.yaml")
-
-# Avoid: Hardcoding configuration
-sensors = [
-    SensorInfo("001", "Camera 1", "rtsp://..."),
-    SensorInfo("002", "Camera 2", "rtsp://...")
-]
-```
-
-### 2. Validate Configuration
-```python
-source_config = SourceConfig()
-source_config.load("sources.yaml")
-
-if not source_config.sensor_list:
-    raise ValueError("No sensors configured")
-
-if source_config.source_type not in ["nvurisrcbin", "nvmultiurisrcbin"]:
-    raise ValueError(f"Unsupported source type: {source_config.source_type}")
-```
-
-### 3. Use Dataclasses for Configuration
-```python
-# Good: Use SmartRecordConfig dataclass
-sr_config = SmartRecordConfig(
-    proto_lib="/path/to/lib.so",
-    conn_str="localhost;9092",
-    # ... other parameters
-)
-
-# Avoid: Manual dictionary management
-sr_config = {
-    "proto-lib": "/path/to/lib.so",
-    "conn-str": "localhost;9092",
-    # ... other parameters
-}
-```
-
-### 4. Implement Proper Postprocessing
-```python
-class MyConverter(postprocessing.ObjectDetectorOutputConverter):
-    def __call__(self, output_layers):
-        # Always return list of [class_id, conf, x1, y1, x2, y2]
-        outputs = []
-
-        # Process tensors
-        # ...
-
-        return outputs  # Return empty list if no detections
-```
-
-### 5. Handle Factory Creation Errors
-```python
-plugin = CommonFactory.create("plugin_name", "instance_name")
-
-if plugin is None:
-    print("Warning: Failed to create plugin")
-    # Handle gracefully
-else:
-    # Use plugin
-    plugin.set(properties)
-```
-
-## Related APIs
-
-- **Pipeline API**: See `service_maker_api.md`
-- **Flow API**: See `service_maker_api.md`
-- **Postprocessing**: See `service_maker_api.md`
-- **Smart Recording**: See `service_maker_api.md` and `kafka_messaging.md`
-
-## Configuration and Helper Summary
-
-The configuration and helper classes provide essential utilities for DeepStream application development:
-
-1. **SourceConfig**: Manage video sources and cameras from YAML
-2. **SensorInfo/CameraInfo**: Structured sensor and camera information
-3. **SmartRecordConfig**: Configure smart recording functionality
-4. **PostProcessing**: Base class for custom tensor postprocessing
-5. **ObjectDetectorOutputConverter**: Specialized postprocessing for object detection
-6. **CommonFactory**: Create custom plugins and objects
-
-Key features:
-- YAML-based configuration management
-- Structured data classes for type safety
-- Abstract base classes for custom implementations
-- Factory pattern for plugin creation
-- Smart recording configuration
-- Flexible postprocessing framework
-
-These utilities simplify configuration management, enable code reuse, and provide clean interfaces for extending DeepStream functionality.
diff --git a/skills/deepstream/deepstream-dev/skill-card.md b/skills/deepstream/deepstream-dev/skill-card.md
deleted file mode 100644
index b11d53eb..00000000
--- a/skills/deepstream/deepstream-dev/skill-card.md
+++ /dev/null
@@ -1,86 +0,0 @@
-## Description: <br>
-NVIDIA DeepStream SDK 9.0 development with Python pyservicemaker API. Use when building video analytics pipelines, GStreamer-based video processing, TensorRT inference integration, object detection/tracking, or Kafka/message broker integration. <br>
-
-This skill is ready for commercial/non-commercial use. <br>
-
-## Owner
-NVIDIA <br>
-
-### License/Terms of Use: <br>
-CC-BY-4.0 AND Apache-2.0 <br>
-## Use Case: <br>
-Developers and engineers building real-time video analytics pipelines, integrating TensorRT inference, multi-object tracking, and message broker connectivity using the NVIDIA DeepStream SDK 9.0 Python API. <br>
-
-### Deployment Geography for Use: <br>
-Global <br>
-
-## Known Risks and Mitigations: <br>
-Risk: Review before execution as proposals could introduce incorrect or misleading guidance into skills. <br>
-Mitigation: Review and scan skill before deployment. <br>
-
-## Reference(s): <br>
-- [GStreamer Plugins Reference](references/gstreamer_plugins.md) <br>
-- [Service Maker API Reference](references/service_maker_api.md) <br>
-- [Use Cases and Pipelines](references/use_cases_pipelines.md) <br>
-- [Kafka Messaging Integration](references/kafka_messaging.md) <br>
-- [Best Practices and Design Patterns](references/best_practices.md) <br>
-- [Buffer APIs](references/buffer_apis.md) <br>
-- [nvinfer Configuration](references/nvinfer_config.md) <br>
-- [Tracker Configuration](references/tracker_config.md) <br>
-- [Troubleshooting Guide](references/troubleshooting.md) <br>
-- [REST API and Dynamic Sources](references/rest_api_dynamic.md) <br>
-- [Docker Containers](references/docker_containers.md) <br>
-- [NVIDIA DeepStream SDK](https://developer.nvidia.com/deepstream-sdk) <br>
-- [DeepStream NGC Container](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/deepstream) <br>
-
-
-## Skill Output: <br>
-**Output Type(s):** [Code, Shell commands, Configuration instructions] <br>
-**Output Format:** [Markdown with inline Python and bash code blocks] <br>
-**Output Parameters:** [1D] <br>
-**Other Properties Related to Output:** [None] <br>
-
-## Evaluation Agents Used: <br>
-- Claude Code (`claude-code`) <br>
-- Codex (`codex`) <br>
-
-
-
-## Evaluation Tasks: <br>
-Evaluated against 7 internal evaluation tasks (5 positive skill-activation, 2 negative) with 2 attempts per task via NVSkills-Eval external profile. <br>
-
-## Evaluation Metrics Used: <br>
-Reported benchmark dimensions: <br>
-- Security: Checks whether skill-assisted execution avoids unsafe behavior such as secret leakage, destructive commands, or unauthorized access. <br>
-- Correctness: Checks whether the agent follows the expected workflow and produces the correct final output. <br>
-- Discoverability: Checks whether the agent loads the skill when relevant and avoids using it when irrelevant. <br>
-- Effectiveness: Checks whether the agent performs measurably better with the skill than without it. <br>
-- Efficiency: Checks whether the agent uses fewer tokens and avoids redundant work. <br>
-
-Underlying evaluation signals used in this run: <br>
-- `skill_execution`: Verifies that the agent loaded the expected skill and workflow. <br>
-- `skill_efficiency`: Checks routing quality, decoy avoidance, and redundant tool usage. <br>
-- `accuracy`: Grades final-answer correctness against the reference answer. <br>
-- `goal_accuracy`: Checks whether the overall user task completed successfully. <br>
-- `behavior_check`: Verifies expected behavior steps, including safety expectations. <br>
-- `token_efficiency`: Compares token usage with and without the skill. <br>
-
-
-
-## Evaluation Results: <br>
-| Dimension | Num | `claude-code` | `codex` |
-|---|---:|---:|---:|
-| Security | 8 | 74% (+9%) | 57% (-2%) |
-| Correctness | 8 | 94% (+6%) | 88% (+9%) |
-| Discoverability | 8 | 86% (+11%) | 76% (+9%) |
-| Effectiveness | 8 | 81% (+6%) | 78% (+9%) |
-| Efficiency | 8 | 72% (+12%) | 64% (+9%) |
-
-## Skill Version(s): <br>
-1.1.0 (source: frontmatter) <br>
-
-## Ethical Considerations: <br>
-NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal team to ensure this skill meets requirements for the relevant industry and use case and addresses unforeseen product misuse. <br>
-
-(For Release on NVIDIA Platforms Only) <br>
-Please report quality, risk, security vulnerabilities or NVIDIA AI Concerns [here](https://app.intigriti.com/programs/nvidia/nvidiavdp/detail). <br>
diff --git a/skills/deepstream/deepstream-dev/skill.oms.sig b/skills/deepstream/deepstream-dev/skill.oms.sig
deleted file mode 100644
index 593d1120..00000000
--- a/skills/deepstream/deepstream-dev/skill.oms.sig
+++ /dev/null
@@ -1 +0,0 @@
-{"mediaType":"application/vnd.dev.sigstore.bundle.v0.3+json","verificationMaterial":{"x509CertificateChain":{"certificates":[{"rawBytes":"MIICgzCCAgmgAwIBAgIUKIyS7SxNteQIiWzK1dWj85E6520wCgYIKoZIzj0EAwMwVTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjEpMCcGA1UEAwwgTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBJQ0EgMDEwHhcNMjYwNDAxMDAwMDAwWhcNMjgwNDIyMTUzMzA5WjBUMQswCQYDVQQGEwJVUzEbMBkGA1UECgwSTlZJRElBIENvcnBvcmF0aW9uMSgwJgYDVQQDDB9OVklESUEgQWdlbnQgU2tpbGxzIFNpZ25pbmcgMDAxMHYwEAYHKoZIzj0CAQYFK4EEACIDYgAEYoRM9bQl/dGlwSRNi6bTpIJUXH8Nv9GciP6LSflJYYMLCc296kpyuTSsk5ddbAWiDcFX3C/ydX3jwc+qCLYP6uHy9XphyLjOQ27Yb2J6rBLVtRBS1mgGco/Gr7fL6ODco4GaMIGXMB0GA1UdDgQWBBRQ/5ZW3nJ6lmo9SVk7I15o7UGmpTAfBgNVHSMEGDAWgBRPGpILxMBBleJSsBGjrMKsby1CgjAMBgNVHRMBAf8EAjAAMA4GA1UdDwEB/wQEAwIHgDA3BggrBgEFBQcBAQQrMCkwJwYIKwYBBQUHMAGGG2h0dHA6Ly9vY3NwLm5kaXMubnZpZGlhLmNvbTAKBggqhkjOPQQDAwNoADBlAjAUygu/GiOCIXrgGr4SmLgeEVDcEitfFUv7ALbvLVGVyMysB3mxmO/uInZfXzWcJZsCMQDxuoxj4ZmO30jhkPIcCxGFCOvnUsnfU3TfGcouYm4M6iRpbKvtVnHPiy4bi6pcKf0="},{"rawBytes":"MIICiDCCAg6gAwIBAgIUZsIuSv9NkpJCNqtYEfCouVv5BzowCgYIKoZIzj0EAwMwUTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjElMCMGA1UEAwwcTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBDQTAgFw0yNjA0MDEwMDAwMDBaGA85OTk5MTIzMTIzNTk1OVowVTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjEpMCcGA1UEAwwgTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBJQ0EgMDEwdjAQBgcqhkjOPQIBBgUrgQQAIgNiAASI72cR3ctKGg4VWnB3bNja6g1Z2PnOmFEopkPof+QeIcPk9rT+g9MjJnq51EQXL93a7C2GJ9J985G4o2V85VD7wJ1RaXhluHW2rf3y8bQGeAYaKMr5s/hUgn+M3/9WlWejgaAwgZ0wHQYDVR0OBBYEFE8akgvEwEGV4lKwEaOswqxvLUKCMB8GA1UdIwQYMBaAFItnoAjjfuCEUvzyvWyI2vOGvwPjMBIGA1UdEwEB/wQIMAYBAf8CAQAwDgYDVR0PAQH/BAQDAgEGMDcGCCsGAQUFBwEBBCswKTAnBggrBgEFBQcwAYYbaHR0cDovL29jc3AubmRpcy5udmlkaWEuY29tMAoGCCqGSM49BAMDA2gAMGUCMQCeIMMfAbyzPDacw2MxG+Yt1cikrJX/DVxiGfXuHmkkXn6VgSzE79+lkqDErpVO2gYCMCNEColOyvUvkzZGUEI1hQ3PfMgi3FIo9tHoBKMw4/wGBLFpu/0ubtmbBXM6/UMOEw=="},{"rawBytes":"MIICRTCCAcygAwIBAgIUeJdY3rV86EdvFmG7L8LJBsyQFYkwCgYIKoZIzj0EAwMwUTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjElMCMGA1UEAwwcTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBDQTAgFw0yNjA0MDEwMDAwMDBaGA85OTk5MTIzMTIzNTk1OVowUTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjElMCMGA1UEAwwcTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBDQTB2MBAGByqGSM49AgEGBSuBBAAiA2IABAYpiXCDjJ9NT2eSDhyHJVSw1Tbze18cGG2F/578oWvHxg23eQAhNRYdq88i1iOshZSO6C29doKui5Xpmo/7Ctw9Sx4PP2RzOmIuOLCuTdNtKcTRwi4GEsd5BAFvWj42M6NjMGEwHQYDVR0OBBYEFItnoAjjfuCEUvzyvWyI2vOGvwPjMB8GA1UdIwQYMBaAFItnoAjjfuCEUvzyvWyI2vOGvwPjMA8GA1UdEwEB/wQFMAMBAf8wDgYDVR0PAQH/BAQDAgEGMAoGCCqGSM49BAMDA2cAMGQCMCwtAjWLaNwgGWNCgdyNoTyvNhqWRECRJV2r3+7w8g0PL6NHLOsbkgE09BH95h8XlgIwTaQmbbUh2ChAJ5TA1wRiVDnCcvbzHlZl2jM2FcwQQZlk19LOAbyGMRixbu2Ww/rj"}]},"tlogEntries":[]},"dsseEnvelope":{"payload":"ewogICJfdHlwZSI6ICJodHRwczovL2luLXRvdG8uaW8vU3RhdGVtZW50L3YxIiwKICAic3ViamVjdCI6IFsKICAgIHsKICAgICAgIm5hbWUiOiAiZGVlcHN0cmVhbS1kZXYiLAogICAgICAiZGlnZXN0IjogewogICAgICAgICJzaGEyNTYiOiAiOWIzMzAzN2RjM2Y1MzM1NTU0ZTVlMGUwZWQ1ZGZjOGJiMTRhOWYxNDI4NmQ4MWU3NTc4ODU5Yzg4ZTMwOGQ1MSIKICAgICAgfQogICAgfQogIF0sCiAgInByZWRpY2F0ZVR5cGUiOiAiaHR0cHM6Ly9tb2RlbF9zaWduaW5nL3NpZ25hdHVyZS92MS4wIiwKICAicHJlZGljYXRlIjogewogICAgInJlc291cmNlcyI6IFsKICAgICAgewogICAgICAgICJkaWdlc3QiOiAiNDcwNzlmNzVmMTdmMjI4OTY0YzQ3YjBmZjgxNWUxNTc4ZmYwNDg4Y2Q4M2FlYWRlZDQ4MGJmMDlkMmFhM2VjNCIsCiAgICAgICAgIm5hbWUiOiAiLmNsYXVkZS1wbHVnaW4vcGx1Z2luLmpzb24iLAogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IgogICAgICB9LAogICAgICB7CiAgICAgICAgImRpZ2VzdCI6ICI0YzVlYTIyNmQwZTE1NDBkM2RlYjYwM2I4MTYyZWQwZTE4N2JjYzVmM2IwODYyOGRhMmI3ZGM1ZjJjYjhmNDliIiwKICAgICAgICAibmFtZSI6ICJCRU5DSE1BUksubWQiLAogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IgogICAgICB9LAogICAgICB7CiAgICAgICAgImRpZ2VzdCI6ICIwYzY2M2E4OGU4ZTU3ZjJhOWNlZWNiNDhhNTY4ZWViMDJmZmJhYjdkMzY4YzI2YzZhN2ZlZDg1NTJlNTlhODA3IiwKICAgICAgICAibmFtZSI6ICJTS0lMTC5tZCIsCiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiZGlnZXN0IjogImQ4YjFhZjEwY2NiZWY0ODJiODRkMTliY2ZkOWRlZTlmNzlmOGQ2YzczNzQyOWNiMDNkMzZiZjQ2MmQ5ZjRkMTEiLAogICAgICAgICJuYW1lIjogImV2YWxzL2V2YWxzLmpzb24iLAogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IgogICAgICB9LAogICAgICB7CiAgICAgICAgImRpZ2VzdCI6ICI4NTlmMjI2OTkwMWY4YTU3MTYzMDJlODA4ZGQxYWI2MTljNWVkNDZmNjg4ZGZlODUxNGE2NDI0NmNiOTU0N2FlIiwKICAgICAgICAibmFtZSI6ICJyZWZlcmVuY2VzL2Jlc3RfcHJhY3RpY2VzLm1kIiwKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIKICAgICAgfSwKICAgICAgewogICAgICAgICJkaWdlc3QiOiAiYzQwZThkMzg5MTJjMGJmZWZlNGZjZjBlNTczMzhlMzZjOWNlMGFkMzNlMTgwYjYxNWE0ODRmNjJhNzllYmQ0ZSIsCiAgICAgICAgIm5hbWUiOiAicmVmZXJlbmNlcy9idWZmZXJfYXBpcy5tZCIsCiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiZGlnZXN0IjogIjU4NTA1Y2ViNjg3NWM0ZTJlYTY4YzNhYmE0NTRkNjJlMzg2NTM0YzYzMjMzZTA0MjMyMmY3MzFmYzlhZWM5MTMiLAogICAgICAgICJuYW1lIjogInJlZmVyZW5jZXMvZG9ja2VyX2NvbnRhaW5lcnMubWQiLAogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IgogICAgICB9LAogICAgICB7CiAgICAgICAgImRpZ2VzdCI6ICJkY2JkMTNhNGI2NGFjM2QyMTY2NTFkM2M1ODE4ZDRiNDI1NWU5NWFlZTE4MjBlMjdiMGUyN2Q2NGJjNTFlZDE4IiwKICAgICAgICAibmFtZSI6ICJyZWZlcmVuY2VzL2dzdHJlYW1lcl9wbHVnaW5zLm1kIiwKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIKICAgICAgfSwKICAgICAgewogICAgICAgICJkaWdlc3QiOiAiYmEwM2Q0OWUzOTRkNjQ3NDI4YzFjMjNkNDcyNGMwMzY1Y2JlZjM3Y2NiMDg1YzJmZmUxNGU5M2ZmOWVmNjUzZCIsCiAgICAgICAgIm5hbWUiOiAicmVmZXJlbmNlcy9rYWZrYV9tZXNzYWdpbmcubWQiLAogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IgogICAgICB9LAogICAgICB7CiAgICAgICAgImRpZ2VzdCI6ICJiOWUxZTExZmQ5YWFjMTg0NDI0NzI0ZGY5MDFiYTdhOTc1ODRiZWMwNzIyZTc0MzA5MDA3YmRlOWQ4YmZhZDkzIiwKICAgICAgICAibmFtZSI6ICJyZWZlcmVuY2VzL21lZGlhX2V4dHJhY3Rvcl9hZHZhbmNlZC5tZCIsCiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiZGlnZXN0IjogImQ2MzdlZTk2YzU3MmJkNDkyNGYwOWFlOWIyOTdmMGI2MWZiNDg3OTRmZjY5MjU0ZWVhOWRjNTI2NDUxZWE5NzciLAogICAgICAgICJuYW1lIjogInJlZmVyZW5jZXMvbWV0YW11eF9jb25maWcubWQiLAogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IgogICAgICB9LAogICAgICB7CiAgICAgICAgImRpZ2VzdCI6ICI1ZTY3MmEzNWNmNzE1YzQ2M2FmYzU2MWQ2MzhmOTA3OTFmM2M5NTZkNjVlNjkzZDRkODRjMmM2NTNkYzA2MjcwIiwKICAgICAgICAibmFtZSI6ICJyZWZlcmVuY2VzL252aW5mZXJfY29uZmlnLm1kIiwKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIKICAgICAgfSwKICAgICAgewogICAgICAgICJkaWdlc3QiOiAiYzMwM2FmNDZlNzlhZTJkMGRjOGVkYTYxYzZhYWIyYzU2MWQ2N2Q2NjUxOTQyZDM3MTIwNWE2MTE5Nzk2MjViYyIsCiAgICAgICAgIm5hbWUiOiAicmVmZXJlbmNlcy9yZXN0X2FwaV9keW5hbWljLm1kIiwKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIKICAgICAgfSwKICAgICAgewogICAgICAgICJkaWdlc3QiOiAiMjg5YTYyZTYyYzE3OWIxNmIwMDQxNGIyNDkwNDUxMDIzNDVjZTk2YzNiMzc0MzJmNGQ3MjNmY2JmYjE0NmFkMCIsCiAgICAgICAgIm5hbWUiOiAicmVmZXJlbmNlcy9zZXJ2aWNlX21ha2VyX2FwaS5tZCIsCiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiZGlnZXN0IjogIjhlZmY5MDYzMGJmYWZiMjFlYmU4MGY2MDEyZWY1ODFjMDQzZGNmYzEzNTg5OThlMTU1Mjk0ZWM2OWQxYzM5NDQiLAogICAgICAgICJuYW1lIjogInJlZmVyZW5jZXMvdHJhY2tlcl9jb25maWcubWQiLAogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IgogICAgICB9LAogICAgICB7CiAgICAgICAgImRpZ2VzdCI6ICIzYzNmZjExYmUyOTI4ZWQ0NmQwZmRhMDdiNWFhOGE0MTEwNDRlMGM1NzQ4NzQ0MGRjNDRmNDI5ZTU1YTI5MWE5IiwKICAgICAgICAibmFtZSI6ICJyZWZlcmVuY2VzL3Ryb3VibGVzaG9vdGluZy5tZCIsCiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiZGlnZXN0IjogIjgyMGFjODA0ZjRjMDFiMDliNTcxMGUyMTRmM2RkN2ZhODdhMTAxNWY0NDdjNGEzYWMwMWM3ZWQyMWJlMGIwNjciLAogICAgICAgICJuYW1lIjogInJlZmVyZW5jZXMvdXNlX2Nhc2VzX3BpcGVsaW5lcy5tZCIsCiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiZGlnZXN0IjogImE4M2UxMjk4MWM2OTczMmE0MWUzYjVjNjMzMjhhYzJjODI2OTIwNWFmNDQ4YjAzMDk3MzU2ZDkyNjhjZGY2MTUiLAogICAgICAgICJuYW1lIjogInJlZmVyZW5jZXMvdXRpbGl0aWVzX2NvbmZpZy5tZCIsCiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiZGlnZXN0IjogIjdmNjA2ZDg4MzVkYzIyYWIzODMzYjU2ZDA4MzlkZmRjODMyZGEyYjUzZTJjZWQzY2ZmOTg5NmZmODFjNDI0YjAiLAogICAgICAgICJuYW1lIjogInNraWxsLWNhcmQubWQiLAogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IgogICAgICB9CiAgICBdLAogICAgInNlcmlhbGl6YXRpb24iOiB7CiAgICAgICJhbGxvd19zeW1saW5rcyI6IGZhbHNlLAogICAgICAiaWdub3JlX3BhdGhzIjogWwogICAgICAgICIuZ2l0aHViIiwKICAgICAgICAiLmdpdGF0dHJpYnV0ZXMiLAogICAgICAgICIuZ2l0IiwKICAgICAgICAiLmdpdGlnbm9yZSIKICAgICAgXSwKICAgICAgIm1ldGhvZCI6ICJmaWxlcyIsCiAgICAgICJoYXNoX3R5cGUiOiAic2hhMjU2IgogICAgfQogIH0KfQ==","payloadType":"application/vnd.in-toto+json","signatures":[{"sig":"MGQCMCO7XGi15fvJv/NPbD5FOBWGtz1lLV//y4fxejoTJk7CwAU5fNKwWtAZphRZWOVVdAIwbj6ly64Y/KFZAomHESFVqgc9cC8g71fcZjXn8Dhc6leT1DbEJ72HaAXaXj2THFeo","keyid":""}]}}
\ No newline at end of file
diff --git a/skills/deepstream/deepstream-import-vision-model/BENCHMARK.md b/skills/deepstream/deepstream-import-vision-model/BENCHMARK.md
deleted file mode 100644
index 3df866f6..00000000
--- a/skills/deepstream/deepstream-import-vision-model/BENCHMARK.md
+++ /dev/null
@@ -1,112 +0,0 @@
-# Evaluation Report
-
-Evaluation of the `deepstream-import-vision-model` skill before publication through NVSkills-Eval.
-
-This benchmark summarizes 3-Tier Evaluation from NVSkills-Eval results for the skill. The goal is to document whether the skill is safe, discoverable, effective, and useful for agents before it is published for broader workflow use.
-
-## Evaluation Summary
-
-- Skill: `deepstream-import-vision-model`
-- Evaluation date: 2026-05-28
-- NVSkills-Eval profile: `external`
-- Environment: `local`
-- Dataset: 5 evaluation tasks
-- Attempts per task: 2
-- Pass threshold: 50%
-- Overall verdict: FAIL
-
-## Agents Used
-
-- `claude-code`
-- `codex`
-
-## Metrics Used
-
-Reported benchmark dimensions:
-
-- Security: checks whether skill-assisted execution avoids unsafe behavior such as secret leakage, destructive commands, or unauthorized access.
-- Correctness: checks whether the agent follows the expected workflow and produces the correct final output.
-- Discoverability: checks whether the agent loads the skill when relevant and avoids using it when irrelevant.
-- Effectiveness: checks whether the agent performs measurably better with the skill than without it.
-- Efficiency: checks whether the agent uses fewer tokens and avoids redundant work.
-
-Underlying evaluation signals used in this run:
-
-- `skill_execution` (Skill Execution): verifies that the agent loaded the expected skill and workflow.
-- `skill_efficiency` (Efficiency): checks routing quality, decoy avoidance, and redundant tool usage.
-- `accuracy` (Accuracy): grades final-answer correctness against the reference answer.
-- `goal_accuracy` (Goal Accuracy): checks whether the overall user task completed successfully.
-- `behavior_check` (Behavior Check): verifies expected behavior steps, including safety expectations.
-- `token_efficiency` (Token Efficiency): compares token usage with and without the skill.
-
-## Test Tasks
-
-The benchmark dataset contained 5 evaluation tasks:
-
-- Positive tasks: 3 tasks where the skill was expected to activate.
-- Negative tasks: 2 tasks where no skill was expected.
-- Unlabeled tasks: 0 tasks where positive/negative intent could not be inferred.
-
-Task composition is derived from the evaluation dataset when possible. Entries with `expected_skill` set are treated as positive skill-activation cases, while entries with `expected_skill: null` are treated as negative activation cases.
-
-## Results
-
-| Dimension | Num | `claude-code` | `codex` |
-|---|---:|---:|---:|
-| Security | 8 | 68% (+13%) | 72% (+18%) |
-| Correctness | 8 | 83% (-2%) | 89% (+13%) |
-| Discoverability | 8 | 61% (+0%) | 80% (+1%) |
-| Effectiveness | 8 | 80% (+2%) | 81% (+17%) |
-| Efficiency | 8 | 52% (+2%) | 70% (+2%) |
-
-Score values show skill-assisted performance. Values in parentheses show uplift versus the no-skill baseline when baseline data is available.
-
-## Tier 1: Static Validation Summary
-
-Tier 1 validation passed with observations. NVSkills-Eval ran 9 checks and found 12 total findings.
-
-Top findings:
-
-- MEDIUM QUALITY/quality_correctness: SKILL_SPEC recommended field missing: 'metadata.tags' (`skills/deepstream-import-vision-model/SKILL.md`)
-- MEDIUM SCHEMA/body_recommended_section: Missing recommended section: '## Instructions' (`skills/deepstream-import-vision-model/SKILL.md`)
-- MEDIUM SCHEMA/body_recommended_section: Missing recommended section: '## Examples' (`skills/deepstream-import-vision-model/SKILL.md`)
-- LOW QUALITY/quality_discoverability: Description very long (285 chars, recommend 50-150) (`skills/deepstream-import-vision-model/SKILL.md`)
-- LOW QUALITY/quality_discoverability: No '## Purpose' section (`skills/deepstream-import-vision-model/SKILL.md`)
-
-## Tier 2: Deduplication Summary
-
-Tier 2 validation reported findings. NVSkills-Eval ran 2 checks and found 7 total findings.
-
-Top findings:
-
-- HIGH DUPLICATE/duplicate: Duplicate content found across scripts/deepstream/benchmark-ds.sh and scripts/deepstream/ds-kitti-dump.sh and scripts/deepstream/ds-perf-run.sh and scripts/deepstream/ds-single-stream.sh and scripts/deepstream/ds-sweep.sh and scripts/deepstream/extract-frame.sh and scripts/engine/benchmark-trtexec.sh and scripts/model/cleanup.sh and scripts/model/hf-download-config.sh and scripts/model/hf-list-files.sh and scripts/model/ngc-download.sh and scripts/model/ngc-list-files.sh and scripts/model/safetensors-to-onnx.sh and scripts/report/md-to-pdf.sh:
-  "(comment)" in scripts/deepstream/benchmark-ds.sh (lines 3-16)
-  vs "(comment)" in scripts/deepstream/ds-kitti-dump.sh (lines 3-16)
-  vs "(comment)" in scripts/deepstream/ds-perf-run.sh (lines 3-16)
-  vs "(comment)" in scripts/deepstream/ds-single-stream.sh (lines 3-16)
-  vs "(comment)" in scripts/deepstream/ds-sweep.sh (lines 3-16)
-  vs "(comment)" in scripts/deepstream/extract-frame.sh (lines 3-16)
-  vs "(comment)" in scripts/engine/benchmark-trtexec.sh (lines 3-16)
-  vs "(comment)" in scripts/model/cleanup.sh (lines 3-16)
-  vs "(comment)" in scripts/model/hf-download-config.sh (lines 3-16)
-  vs "(comment)" in scripts/model/hf-list-files.sh (lines 3-16)
-  vs "(comment)" in scripts/model/ngc-download.sh (lines 3-16)
-  vs "(comment)" in scripts/model/ngc-list-files.sh (lines 3-16)
-  vs "(comment)" in scripts/model/safetensors-to-onnx.sh (lines 3-16)
-  vs "(comment)" in scripts/report/md-to-pdf.sh (lines 3-16) (`scripts/deepstream/benchmark-ds.sh:3`)
-- HIGH DUPLICATE/duplicate: Duplicate content found within references/pipeline-run.md:
-  "# Hard constraint: num_streams <= engine max batch size — always" in references/pipeline-run.md (lines 437-442)
-  vs "# Hard constraint: num_streams <= engine max batch size — always" in references/pipeline-run.md (lines 458-463) (`references/pipeline-run.md:437`)
-- HIGH DUPLICATE/duplicate: Duplicate content found across references/report-generation.md and scripts/deepstream/ds-perf-run.sh:
-  "# Capture stream-0 instantaneous FPS (\K after `**PERF:`) — 1 value per line — so" in references/report-generation.md (lines 136-136)
-  vs "(comment)" in scripts/deepstream/ds-perf-run.sh (lines 131-134) (`references/report-generation.md:136`)
-- HIGH DUPLICATE/duplicate: Duplicate content found within references/pipeline-run.md:
-  "# 2=DeepStream NMS (dense heads: YOLO, SSD). Use 4 if engine has fused NMS output" in references/pipeline-run.md (lines 225-244)
-  vs "# 2=DeepStream NMS (dense heads: YOLO, SSD). Use 4 if engine has fused NMS output" in references/pipeline-run.md (lines 401-414) (`references/pipeline-run.md:225`)
-- HIGH DUPLICATE/duplicate: Duplicate content found within references/model-acquire.md:
-  "#### 2b-vi: onnxsim — Run After Export When Needed" in references/model-acquire.md (lines 273-282)
-  vs "# Use the _sim.onnx for engine building if the original triggers ForeignNode errors" in references/model-acquire.md (lines 283-287) (`references/model-acquire.md:273`)
-
-## Publication Recommendation
-
-The skill should be reviewed before NVSkills-Eval publication. Skill owners should address the findings above and rerun NVSkills-Eval to refresh this benchmark.
diff --git a/skills/deepstream/deepstream-import-vision-model/SKILL.md b/skills/deepstream/deepstream-import-vision-model/SKILL.md
deleted file mode 100644
index 12705029..00000000
--- a/skills/deepstream/deepstream-import-vision-model/SKILL.md
+++ /dev/null
@@ -1,179 +0,0 @@
----
-name: deepstream-import-vision-model
-description: >
-  Use this skill to bring any vision model from HuggingFace or NVIDIA NGC into
-  an NVIDIA DeepStream pipeline with end-to-end automation: ONNX download,
-  SafeTensors export, TRT engine build, custom nvinfer bbox parser, multi-stream
-  benchmark, and PDF report. Object detection models only.
-license: CC-BY-4.0 AND Apache-2.0
-metadata:
-  author: NVIDIA CORPORATION
-  version: 1.2.1
----
-
-# DeepStream Import Vision Model
-
-When this skill is active, **read the relevant reference document before starting each phase**. Do not rely on memory — reference documents contain exact script paths, bash variable conventions, log filename contracts, and critical parsing rules.
-
-**Current scope:** Object detection models only. Fail fast on classification, segmentation, or other architectures detected in `config.json`.
-
-## Pipeline Overview
-
-| Step | Phase | Reference | What it does |
-|------|-------|-----------|--------------|
-| 1–3 | Model Acquire | [references/model-acquire.md](references/model-acquire.md) | Browse HF/NGC, detect format, download ONNX or export SafeTensors |
-| 4–5 | Engine Build  | [references/engine-build.md](references/engine-build.md) | Build dynamic TRT engine, run trtexec BS=1 and BS=MAX_BS |
-| 6–7 | DS Pipeline   | [references/pipeline-run.md](references/pipeline-run.md) | Custom bbox parser, nvinfer config, single-stream + multi-stream benchmarks |
-| 8   | Report        | [references/report-generation.md](references/report-generation.md) | 5 charts, HTML, PDF benchmark report |
-
-Run the full pipeline autonomously without pausing for confirmation at each step.
-
-## Pre-flight Checks
-
-Run before starting:
-
-```bash
-# 1. GPU and drivers
-nvidia-smi
-
-# 2. TensorRT version match (must match between builder and DS runtime)
-trtexec 2>&1 | head -3
-dpkg -l | grep libnvinfer-bin
-
-# 3. Shared Python venv — create once, reuse across all models
-mkdir -p build
-VENV=build/.venv_optimum
-if [ ! -x "$VENV/bin/python3" ]; then
-  python3 -m venv "$VENV"
-  "$VENV/bin/pip" install --upgrade pip -q
-  "$VENV/bin/pip" install "optimum[exporters]>=1.20,<2.0" "torch<2.12" \
-    transformers onnxruntime matplotlib numpy markdown -q
-fi
-
-# 4. System tools
-which wkhtmltopdf || apt-get install -y wkhtmltopdf
-which mediainfo    || apt-get install -y mediainfo
-which deepstream-app  # required for KITTI dump (Step 6g) and benchmark perf-measurement (Step 7c); shipped with DeepStream SDK
-
-# 5. Sample video — only check default path when user has not provided a custom DS_VIDEO
-if [ -z "$DS_VIDEO" ]; then
-  [ -f /opt/nvidia/deepstream/deepstream/samples/streams/sample_720p.mp4 ] || \
-    echo "WARNING: sample_720p.mp4 not found. Install DeepStream samples or set DS_VIDEO=/path/to/your.mp4"
-fi
-```
-
-## Mandatory Output Structure
-
-Create once `MODEL_NAME` is known (Step 1). Never dump files flat.
-
-```
-models/{model_name}/
-  model/           <- ONNX file(s)
-  parser/          <- .cpp, Makefile, .so
-  config/          <- nvinfer config, ds-app config, labels.txt
-  scripts/         <- run helper scripts
-  benchmarks/
-    engines/       <- _dynamic_b{MAX_BS}.engine, timing.cache, build logs
-    b1/            <- trtexec BS=1 log
-    b{MAX_BS}/     <- trtexec BS=MAX_BS log
-    ds/            <- DS benchmark logs
-  reports/         <- benchmark_report.md, .html, .pdf, benchmark_data.json
-    charts/        <- chart_*.png (5 charts)
-  samples/         <- output .mp4 or .ogv (theoraenc fallback), test frames
-    kitti_output/  <- KITTI detection .txt files
-```
-
-```bash
-mkdir -p models/$MODEL_NAME/{model,parser,config,scripts,benchmarks/engines,benchmarks/ds,reports/charts,samples/kitti_output}
-```
-
-## Critical Rules
-
-1. **Engine naming** — always `{model}_dynamic_b{MAX_BS}.engine`. Never bare `model_dynamic.engine`.
-2. **batch_size == num_streams** — in DS runs, `batch-size` and stream count are always equal.
-3. **Log filenames are fixed** — `trtexec_b1.log`, `trtexec_b${MAX_BS}.log`, `ds_s${N}_run1.log`, `ds_s${N}_run2.log`. No timestamps. Report generation reads exact paths.
-4. **Parser zero-init** — always `NvDsInferObjectDetectionInfo obj = {};`. Required for DS 9.0 OBB support; bare `obj;` leaves `rotation_angle` uninitialized, causing tilted bounding boxes.
-5. **KITTI validation gate** — do NOT proceed to Step 7 if KITTI frame count is zero or detection rate < 90%.
-6. **Shared venv** — `build/.venv_optimum` reused across all models. Never create per-model venvs.
-7. **trtexec `--noDataTransfers`** — GPU-only compute matches DeepStream's GPU-to-GPU data flow.
-8. **Report HTML+PDF** — always use `skills/deepstream-import-vision-model/scripts/report/md-to-html-pdf.py`. Never write a custom HTML generator or call `wkhtmltopdf` directly.
-9. **Object detection only** — reject non-detection architectures from `config.json` before building anything.
-10. **Encoder fallback (MANDATORY)** — `x264enc` and `openh264enc` are **prohibited**. On NVENC-unavailable systems, use `theoraenc + oggmux` (LGPL; ships in gst-plugins-base; output is `.ogv`). If `theoraenc`/`oggmux` are absent, skip video creation (`DS_SINGLE_STREAM_MODE=skipped`). Report which mode was used: `nvv4l2h264enc` / `theoraenc-fallback` / `skipped`.
-11. **Video source (MANDATORY)** — default is always `sample_720p.mp4` (1280×720). Never autonomously substitute `sample_1080p_h264.mp4` or any other file. Only use a different video when the user explicitly provides a path (via `DS_VIDEO` env var or script argument).
-
-## Pipeline Timing
-
-Wrap every step:
-
-```bash
-STEP_START=$(date +%s.%N)
-# ... step commands ...
-STEP_END=$(date +%s.%N)
-STEP_DURATION=$(echo "$STEP_END - $STEP_START" | bc)
-echo "[Step N] completed in ${STEP_DURATION}s"
-```
-
-Track `PIPELINE_START` (before Step 1) and `PIPELINE_END` (after Step 8). Report all durations in the benchmark report.
-
-## Report Output (MANDATORY — all 3 formats)
-
-1. `benchmark_report.md` — markdown source (12 mandatory sections)
-2. `benchmark_report.html` — styled HTML (charts base64-inlined, no local file access)
-3. `benchmark_report_{model_name}.pdf` — via `md-to-html-pdf.py`; verify charts are embedded by counting `data:image/png` occurrences in the HTML output: `grep -o 'data:image/png' benchmark_report.html | wc -l` should equal 5
-
-Run charts and report scripts with the shared venv active: `source build/.venv_optimum/bin/activate`.
-
-## Reference Documents
-
-**IMPORTANT**: Read the relevant reference before starting each phase. Do NOT generate code from memory.
-
-| Document | Use When |
-|----------|----------|
-| [references/model-acquire.md](references/model-acquire.md) | Steps 1–3: HF/NGC URL parsing, format detection, ONNX download, SafeTensors export, label extraction |
-| [references/engine-build.md](references/engine-build.md) | Steps 4–5: trtexec engine build, benchmarks, PEAK_GPU_STREAMS derivation, iterative scaling |
-| [references/pipeline-run.md](references/pipeline-run.md) | Steps 6–7: custom bbox parser, nvinfer config, single-stream validation, KITTI dump, multi-stream benchmark |
-| [references/report-generation.md](references/report-generation.md) | Step 8: benchmark_data.json, 5 charts, 12-section markdown report, HTML + PDF |
-
-## Scripts
-
-Located in `scripts/`.
-
-| Script | Phase | Purpose |
-|--------|-------|---------|
-| `model/hf-list-files.sh` | 1–3 | List HuggingFace repo files |
-| `model/hf-download-config.sh` | 1–3 | Download config.json from HF |
-| `model/ngc-list-files.sh` | 1–3 | List NGC model files |
-| `model/ngc-download.sh` | 1–3 | Download NGC model archive |
-| `model/safetensors-to-onnx.sh` | 1–3 | Export SafeTensors → ONNX via optimum-cli |
-| `model/inspect-onnx.py` | 1–5 | Inspect ONNX input/output shapes |
-| `model/make-static-batch-onnx.py` | 4–5 | Bake batch dim into ONNX |
-| `model/cleanup.sh` | Any | Remove staging dirs, preserve shared venv |
-| `engine/benchmark-trtexec.sh` | 4–5 | Run trtexec with standard flags |
-| `deepstream/ds-single-stream.sh` | 6–7 | Single-stream visual validation (NVENC primary; theoraenc+oggmux fallback; skip if neither) |
-| `deepstream/ds-sweep.sh` | 6–7 | 2-phase batch size sweep |
-| `deepstream/benchmark-ds.sh` | 6–7 | Fixed-stream DS benchmark |
-| `deepstream/ds-kitti-dump.sh` | 6–7 | KITTI detection dump via deepstream-app |
-| `deepstream/ds-perf-run.sh` | 7 | Step 7c two-run benchmark — wraps `deepstream-app` with `enable-perf-measurement=1`, writes fixed-name log for the report parser |
-| `deepstream/extract-frame.sh` | 6–7 | Extract sample frames from output video (`.mp4` NVENC path or `.ogv` theoraenc fallback) |
-| `report/generate-benchmark-charts.py` | 8 | Generate 5 benchmark PNG charts |
-| `report/md-to-html-pdf.py` | 8 | Markdown → styled HTML → PDF (canonical benchmark report path) |
-| `report/md-to-pdf.sh` | Any | Markdown → PDF via pandoc/pdflatex — for design docs and references only, NOT for benchmark reports (use md-to-html-pdf.py for those) |
-| `report/report-style.css` | 8 | CSS for HTML report |
-| `report/render-mermaid-for-pdf.py` | 8 | Mermaid diagram → PNG |
-| `report/mermaid-puppeteer.json` | 8 | Vetted Puppeteer config for Mermaid (sandboxed; non-root) |
-| `report/mermaid-puppeteer-root.json` | 8 | Vetted Puppeteer config for Mermaid (used when running as root) |
-
-## Quick Error Reference
-
-| Error | Fix |
-|-------|-----|
-| Tilted/diagonal bounding boxes | Parser struct not zero-initialized — use `NvDsInferObjectDetectionInfo obj = {};` |
-| Zero KITTI files | `gie-kitti-output-dir` not read by nvinfer — use `ds-kitti-dump.sh` (wraps `deepstream-app`) |
-| Engine rebuilds every DS run | `model-engine-file` path wrong — check relative path from `config/` dir |
-| `setDimensions` negative dims | Add `infer-dims=3;H;W` to nvinfer config for dynamic ONNX models |
-| `--memPoolSize` workspace 0.03 MiB | Use `M` suffix not `MiB` — e.g. `--memPoolSize=workspace:32768M` |
-| ForeignNode build failure (DETR) | Use dynamo export path or run `onnxsim` — see references/engine-build.md |
-| Zero detections | Wrong `net-scale-factor` — check model family table in references/pipeline-run.md |
-| `No module named 'pyservicemaker'` | Install into venv: `pip install /opt/nvidia/deepstream/.../pyservicemaker*.whl` |
-
-<!-- Signing refresh marker.  -->
diff --git a/skills/deepstream/deepstream-import-vision-model/evals/evals.json b/skills/deepstream/deepstream-import-vision-model/evals/evals.json
deleted file mode 100644
index 69251089..00000000
--- a/skills/deepstream/deepstream-import-vision-model/evals/evals.json
+++ /dev/null
@@ -1,71 +0,0 @@
-[
-  {
-    "id": "deepstream-import-vision-model-001",
-    "question": "I want to import a HuggingFace object detection model into DeepStream. Describe the end-to-end workflow this skill should follow, including model acquisition, engine build, DeepStream validation, benchmarking, and report generation.",
-    "expected_skill": "deepstream-import-vision-model",
-    "expected_script": null,
-    "ground_truth": "The response should use the import-model workflow: inspect or download model assets, reject unsupported non-detection architectures, export or use ONNX, build TensorRT engines, create parser and nvinfer config, validate with a single-stream DeepStream run and KITTI output, run multi-stream benchmarks, and generate markdown, HTML, and PDF benchmark reports.",
-    "expected_behavior": [
-      "Read the relevant reference document before each phase rather than relying on memory.",
-      "Use the mandatory models/{model_name}/ directory structure.",
-      "Handle HuggingFace or NGC model acquisition and detect unsupported non-detection architectures early.",
-      "Build TensorRT engines with the prescribed naming pattern.",
-      "Run DeepStream validation before benchmarking.",
-      "Generate benchmark_report.md, benchmark_report.html, and benchmark_report_{model_name}.pdf."
-    ]
-  },
-  {
-    "id": "deepstream-import-vision-model-002",
-    "question": "A YOLO object detection model exported from HuggingFace has dynamic ONNX dimensions. Explain how to build and configure it for DeepStream so the engine and nvinfer config are stable.",
-    "expected_skill": "deepstream-import-vision-model",
-    "expected_script": null,
-    "ground_truth": "The answer should inspect the ONNX model, create a static batch variant if needed, build TensorRT engines with batch-specific names, set infer-dims in the nvinfer config, use DeepStream NMS for pre-NMS YOLO outputs, and keep batch-size equal to the number of streams during DeepStream runs.",
-    "expected_behavior": [
-      "Inspect ONNX input and output shapes before engine build.",
-      "Create or use a static batch ONNX when dynamic dimensions would break TensorRT or DeepStream.",
-      "Name engines as {model}_dynamic_b{MAX_BS}.engine.",
-      "Set infer-dims to the explicit C;H;W input dimensions.",
-      "Use cluster-mode 2 for dense pre-NMS YOLO-style outputs.",
-      "Keep DeepStream batch-size equal to the number of input streams."
-    ]
-  },
-  {
-    "id": "deepstream-import-vision-model-003",
-    "question": "During DeepStream validation for an imported detector, KITTI output has zero frames and NVENC is unavailable on the system. What should the skill do before producing a benchmark report?",
-    "expected_skill": "deepstream-import-vision-model",
-    "expected_script": null,
-    "ground_truth": "The skill should fail or stop before Step 7 when KITTI validation has zero frames or detection rate is below the threshold. For video output, it should use nvv4l2h264enc when available, fall back to theoraenc plus oggmux when NVENC is unavailable, or skip video creation if neither path is available, then report which mode was used.",
-    "expected_behavior": [
-      "Do not proceed to multi-stream benchmarking when KITTI frame count is zero.",
-      "Treat detection rate below 90 percent as a validation gate failure.",
-      "Do not use x264enc or openh264enc.",
-      "Use theoraenc plus oggmux as the fallback when NVENC is unavailable.",
-      "Skip video creation if neither NVENC nor theora fallback is available.",
-      "Report the selected video mode in the benchmark output."
-    ]
-  },
-  {
-    "id": "deepstream-import-vision-model-004-negative",
-    "question": "Optimize SQL queries for a PostgreSQL reporting dashboard and add Redis caching. No model import or DeepStream runtime changes are needed.",
-    "expected_skill": null,
-    "expected_script": null,
-    "ground_truth": "The deepstream-import-vision-model skill should not be selected because the request is unrelated to model acquisition, TensorRT build, or DeepStream pipeline validation.",
-    "expected_behavior": [
-      "Do not activate deepstream-import-vision-model for this request.",
-      "Avoid model import, TensorRT, and DeepStream benchmarking instructions.",
-      "Respond with a generic fallback or suggest a relevant database-focused workflow."
-    ]
-  },
-  {
-    "id": "deepstream-import-vision-model-005-negative",
-    "question": "How can I fine-tune a BERT model for sentiment analysis on my own dataset?",
-    "expected_skill": null,
-    "expected_script": null,
-    "ground_truth": "The deepstream-import-vision-model skill should not be selected because this request is unrelated to DeepStream object-detection model import or TensorRT/benchmark workflow.",
-    "expected_behavior": [
-      "Do not activate deepstream-import-vision-model for this request.",
-      "State that this is outside the DeepStream import-vision-model scope.",
-      "Suggest a relevant NLP model fine-tuning path instead."
-    ]
-  }
-]
diff --git a/skills/deepstream/deepstream-import-vision-model/references/engine-build.md b/skills/deepstream/deepstream-import-vision-model/references/engine-build.md
deleted file mode 100644
index 4d99962b..00000000
--- a/skills/deepstream/deepstream-import-vision-model/references/engine-build.md
+++ /dev/null
@@ -1,318 +0,0 @@
-
-# NV Engine Build -- Steps 4-5
-
-Build a TensorRT engine from ONNX and derive PEAK_GPU_STREAMS for DeepStream sizing.
-
-The ONNX model path is: `$ARGUMENTS`
-
-## Pre-flight: Validate Inputs and Extract Variables
-
-Before anything else, derive all variables from `$ARGUMENTS` and verify the environment:
-```bash
-ONNX_PATH="$ARGUMENTS"
-
-# Derive MODEL_NAME from directory structure: models/{MODEL_NAME}/model/...
-MODEL_NAME=$(echo "$ONNX_PATH" | sed 's|models/\([^/]*\)/.*|\1|')
-
-# Derive MODEL_FILENAME as the ONNX basename without extension
-MODEL_FILENAME=$(basename "$ONNX_PATH" .onnx)
-
-# MAX_BS drives --optShapes, --maxShapes, and the engine filename postfix
-# Starting value is 64 — will double iteratively in Step 5 if PEAK_GPU_STREAMS > 64
-MAX_BS=64
-
-echo "Model:    $MODEL_NAME"
-echo "File:     $MODEL_FILENAME"
-echo "ONNX:     $ONNX_PATH"
-echo "Engine:   models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_b${MAX_BS}.engine"
-
-# Verify ONNX file exists
-ls -lh "$ONNX_PATH" || { echo "ERROR: ONNX file not found at $ONNX_PATH"; exit 1; }
-
-# Verify trtexec is available and check TRT version
-TRTEXEC=$(which trtexec) || { echo "ERROR: trtexec not found in PATH — install TensorRT or check PATH"; exit 1; }
-$TRTEXEC --help 2>&1 | head -3
-dpkg -l | grep libnvinfer-bin
-
-# Verify GPU is available
-nvidia-smi --query-gpu=name,memory.total --format=csv,noheader
-```
-
-If the ONNX file doesn't exist, inform the user to run Steps 1-3 first (see references/model-acquire.md).
-
-> All subsequent commands use `$MODEL_NAME`, `$MODEL_FILENAME`, `$MAX_BS`, and `$TRTEXEC` — never hardcoded paths or template placeholders.
-
-Inspect the ONNX model and auto-parse input name and spatial dimensions:
-```bash
-INSPECT_OUT=$(python3 skills/deepstream-import-vision-model/scripts/model/inspect-onnx.py "$ONNX_PATH")
-echo "$INSPECT_OUT"
-
-INPUT_NAME=$(echo "$INSPECT_OUT" | grep -oP 'input_name:\s*\K\S+')
-H=$(echo "$INSPECT_OUT"          | grep -oP 'height:\s*\K[0-9]+')
-W=$(echo "$INSPECT_OUT"          | grep -oP 'width:\s*\K[0-9]+')
-
-echo "INPUT_NAME=$INPUT_NAME  H=$H  W=$W"
-[ -z "$INPUT_NAME" ] && { echo "ERROR: could not parse INPUT_NAME from inspect output"; exit 1; }
-# If H/W are empty (dynamic spatial dims), set them manually before proceeding:
-#   H=640; W=640   # or whatever the model's expected input resolution is
-#   Check the model card on HuggingFace or config.json image_size field
-[ -z "$H" ] && { echo "ERROR: H not detected — model has dynamic spatial dims. Set H manually: H=<height>"; exit 1; }
-[ -z "$W" ] && { echo "ERROR: W not detected — model has dynamic spatial dims. Set W manually: W=<width>";  exit 1; }
-```
-
-## Step 4: Build TensorRT Engine
-
-Build one dynamic engine optimized for BS=64. `opt=max=64` ensures TRT optimizes kernels for
-the exact batch size used for benchmarking and DeepStream. `min=1` handles single-stream validation.
-
-```bash
-STEP4_START=$(date +%s.%N)
-TIMESTAMP=$(date +%Y%m%d_%H%M%S)
-# benchmarks/engines/ already exists from nv-model-acquire;
-# mkdir -p kept here as a safety net for standalone use
-mkdir -p models/$MODEL_NAME/benchmarks/engines models/$MODEL_NAME/benchmarks/b1 models/$MODEL_NAME/benchmarks/b${MAX_BS}
-
-$TRTEXEC \
-  --onnx="$ONNX_PATH" \
-  --minShapes=$INPUT_NAME:1x3x${H}x${W} \
-  --optShapes=$INPUT_NAME:${MAX_BS}x3x${H}x${W} \
-  --maxShapes=$INPUT_NAME:${MAX_BS}x3x${H}x${W} \
-  --fp16 \
-  --skipInference \
-  --memPoolSize=workspace:32768M \
-  --timingCacheFile=models/$MODEL_NAME/benchmarks/engines/timing.cache \
-  --saveEngine="models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_b${MAX_BS}.engine" \
-  2>&1 | tee models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_build_${TIMESTAMP}.log
-
-# Verify engine was created — trtexec exit code is lost through the pipe, so check the file
-[ -f "models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_b${MAX_BS}.engine" ] || \
-  { echo "ERROR: Engine file not created — check build log for errors"; exit 1; }
-
-STEP4_END=$(date +%s.%N)
-STEP4_DURATION=$(echo "$STEP4_END - $STEP4_START" | bc)
-echo "[Step 4] Engine build completed in ${STEP4_DURATION}s"
-```
-
-Set the ENGINE variable — used by all subsequent trtexec and DeepStream runs:
-```bash
-ENGINE="models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_b${MAX_BS}.engine"
-```
-
-## Step 5: Benchmark — 2 Runs Only
-
-Run exactly **2 trtexec benchmarks** using the Step 4 engine. No sweep needed.
-- BS=1 → latency baseline (single-stream worst case)
-- BS=64 → peak throughput → `PEAK_GPU_STREAMS`
-
-```bash
-STEP5_START=$(date +%s.%N)
-```
-
-### Run 5a — Latency baseline (BS=1)
-
-> Log filename is **fixed** — no timestamp, no variation. Always `trtexec_b1.log`. This ensures the nv-import-vision-model-report skill can find it with an exact path, not a wildcard.
-
-```bash
-$TRTEXEC \
-  --loadEngine="$ENGINE" \
-  --shapes=$INPUT_NAME:1x3x${H}x${W} \
-  --noDataTransfers --duration=10 --warmUp=1000 \
-  2>&1 | tee models/$MODEL_NAME/benchmarks/b1/trtexec_b1.log
-```
-
-### Run 5b — Peak throughput (BS=MAX_BS)
-
-> Log filename is **fixed** — always `trtexec_b${MAX_BS}.log`. Updated by the while loop if MAX_BS changes.
-
-```bash
-$TRTEXEC \
-  --loadEngine="$ENGINE" \
-  --shapes=$INPUT_NAME:${MAX_BS}x3x${H}x${W} \
-  --noDataTransfers --duration=10 --warmUp=1000 \
-  2>&1 | tee models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log
-```
-
-### Parse results and compute PEAK_GPU_STREAMS
-```bash
-QPS_BS1=$(grep -oP 'Throughput:\s*\K[0-9.]+' \
-  models/$MODEL_NAME/benchmarks/b1/trtexec_b1.log | tail -1)
-GPU_MEAN_BS1=$(grep -oP 'GPU Compute Time:.*mean = \K[0-9.]+' \
-  models/$MODEL_NAME/benchmarks/b1/trtexec_b1.log | tail -1)
-
-QPS_BS_MAX=$(grep -oP 'Throughput:\s*\K[0-9.]+' \
-  models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log | tail -1)
-GPU_MEAN_BS_MAX=$(grep -oP 'GPU Compute Time:.*mean = \K[0-9.]+' \
-  models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log | tail -1)
-GPU_P99_BS_MAX=$(grep -oP 'GPU Compute Time:.*percentile\(99%\) = \K[0-9.]+' \
-  models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log | tail -1)
-
-read IMGS_PER_SEC PEAK_GPU_STREAMS < <(python3 -c "
-import math
-imgs = float('$QPS_BS_MAX') * $MAX_BS
-streams = int(math.floor(imgs / 30))
-print(round(imgs, 2), streams)
-")
-
-echo "BS=1:       QPS=$QPS_BS1  GPU mean=${GPU_MEAN_BS1}ms"
-echo "BS=$MAX_BS: QPS=$QPS_BS_MAX  imgs/s=$IMGS_PER_SEC  GPU mean=${GPU_MEAN_BS_MAX}ms  P99=${GPU_P99_BS_MAX}ms"
-echo "PEAK_GPU_STREAMS=$PEAK_GPU_STREAMS  (floor($IMGS_PER_SEC / 30))"
-
-STEP5_END=$(date +%s.%N)
-STEP5_DURATION=$(echo "$STEP5_END - $STEP5_START" | bc)
-echo "[Step 5] Benchmarks completed in ${STEP5_DURATION}s"
-```
-
-`PEAK_GPU_STREAMS` is the GPU-only upper bound on real-time 30fps stream count. DeepStream will always achieve fewer streams due to NVDEC, mux, and GStreamer overhead (typically 10–40%). Use `PEAK_GPU_STREAMS` as the starting stream count for DS Run 1 (calibration).
-
-### Iterative Engine Scaling (PEAK_GPU_STREAMS > MAX_BS)
-
-If `PEAK_GPU_STREAMS > MAX_BS`, the engine's max batch size is the bottleneck — DeepStream cannot run more streams than `MAX_BS`. **Double MAX_BS and rebuild**, then re-run trtexec and recompute `PEAK_GPU_STREAMS`. Repeat until `PEAK_GPU_STREAMS ≤ MAX_BS`.
-
-**Why doubling, not jumping to PEAK directly**: Jumping from 64→512 based on an extrapolated projection wastes GPU memory if the projection was off. Doubling (64→128→256→512) makes incremental, verifiable steps — each trtexec run gives real throughput data before committing to a larger rebuild.
-
-```bash
-while [ "$PEAK_GPU_STREAMS" -gt "$MAX_BS" ]; do
-  NEW_MAX_BS=$(python3 -c "print($MAX_BS * 2)")  # STRICT DOUBLING — do not change to ceil(log2(PEAK))
-  echo "Rebuilding engine: PEAK_GPU_STREAMS=$PEAK_GPU_STREAMS > MAX_BS=$MAX_BS — doubling to: $NEW_MAX_BS"
-
-  mkdir -p models/$MODEL_NAME/benchmarks/b${NEW_MAX_BS}
-
-  $TRTEXEC \
-    --onnx="$ONNX_PATH" \
-    --minShapes=$INPUT_NAME:1x3x${H}x${W} \
-    --optShapes=$INPUT_NAME:${NEW_MAX_BS}x3x${H}x${W} \
-    --maxShapes=$INPUT_NAME:${NEW_MAX_BS}x3x${H}x${W} \
-    --fp16 --skipInference \
-    --memPoolSize=workspace:32768M \
-    --timingCacheFile=models/$MODEL_NAME/benchmarks/engines/timing.cache \
-    --saveEngine="models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_b${NEW_MAX_BS}.engine" \
-    2>&1 | tee models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_build_b${NEW_MAX_BS}_${TIMESTAMP}.log
-
-  [ -f "models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_b${NEW_MAX_BS}.engine" ] || \
-    { echo "ERROR: Engine b${NEW_MAX_BS} not created — check build log"; exit 1; }
-
-  # Update ENGINE and MAX_BS — re-run trtexec at new BS and recompute PEAK_GPU_STREAMS
-  ENGINE="models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_b${NEW_MAX_BS}.engine"
-  MAX_BS=$NEW_MAX_BS
-
-  $TRTEXEC \
-    --loadEngine="$ENGINE" \
-    --shapes=$INPUT_NAME:${MAX_BS}x3x${H}x${W} \
-    --noDataTransfers --duration=10 --warmUp=1000 \
-    2>&1 | tee models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log
-
-  QPS_BS_MAX=$(grep -oP 'Throughput:\s*\K[0-9.]+' \
-    models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log | tail -1)
-  GPU_MEAN_BS_MAX=$(grep -oP 'GPU Compute Time:.*mean = \K[0-9.]+' \
-    models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log | tail -1)
-  GPU_P99_BS_MAX=$(grep -oP 'GPU Compute Time:.*percentile\(99%\) = \K[0-9.]+' \
-    models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log | tail -1)
-  read IMGS_PER_SEC PEAK_GPU_STREAMS < <(python3 -c "
-import math
-imgs = float('$QPS_BS_MAX') * $MAX_BS
-print(round(imgs, 2), int(math.floor(imgs / 30)))
-")
-  echo "Recomputed: BS=$MAX_BS  imgs/s=$IMGS_PER_SEC  PEAK_GPU_STREAMS=$PEAK_GPU_STREAMS"
-done
-
-echo "PEAK_GPU_STREAMS ($PEAK_GPU_STREAMS) <= MAX_BS ($MAX_BS) — engine scaling complete."
-```
-
-**Engine count summary:**
-
-| Scenario | Example | Engines | trtexec runs |
-|----------|---------|---------|-------------|
-| PEAK_GPU_STREAMS ≤ 64 (transformer/large models) | RT-DETR, OWL-ViT | **1** (`b64`) | **2** |
-| PEAK_GPU_STREAMS > 64, ≤ 128 (mid models) | TrafficCamNet | **2** (`b64` + `b128`) | **3** |
-| PEAK_GPU_STREAMS > 128, ≤ 256 (fast models) | YOLO26n | **3** (`b64`+`b128`+`b256`) | **4** |
-| PEAK_GPU_STREAMS > 256 (very fast nano models) | — | **4+** (keep doubling) | **5+** |
-
-## trtexec Flags Reference
-
-### Recommended Flags
-| Flag | Purpose | When to use |
-|------|---------|-------------|
-| `--duration=10` | Longer run for stable numbers | All benchmark runs (5a, 5b) |
-| `--warmUp=1000` | 1s warmup before measurement | All benchmark runs (5a, 5b) |
-| `--noDataTransfers` | GPU-only compute (matches DS reality) | Always |
-
-### Why GPU-only (`--noDataTransfers`) Only
-In DeepStream, frames are decoded on GPU (`nvv4l2decoder`) and stay on GPU through `nvinfer` — no H2D transfer. Standard trtexec transfers synthetic data from host, which is not representative. Do NOT report H2D/D2H latency.
-
-### Flags That Do NOT Help (tested)
-| Flag | Result | Why |
-|------|--------|-----|
-| `--best` | No improvement | Engine already built with --fp16, runtime flag doesn't change precision |
-| `--exposeDMA` | **45% WORSE** throughput | Serializes DMA transfers — kills pipelining |
-| `--infStreams=4` | +2% QPS max | GPU already saturated |
-
-### Key Metrics to Report from trtexec
-- **Throughput (QPS)** and **Images/s** (QPS × batch_size)
-- **GPU Compute mean (ms)** and **GPU Compute P99 (ms)**
-- **GPU Compute per image (ms)** (GPU Compute mean / batch_size)
-- Do **NOT** report: H2D latency, D2H latency, Host Latency, transfer overhead
-
-## Engine Version Compatibility -- CRITICAL
-
-TensorRT engine files are **not portable** across TensorRT versions.
-
-### Pre-flight Version Check (MANDATORY before building engines)
-Already done at the top of this skill via `$TRTEXEC --help` and `dpkg -l | grep libnvinfer-bin`. Do not repeat.
-
-### Docker vs Host Engine Builds
-Docker-built engines may silently fail at runtime when loaded by host DeepStream (symptom: 0% GPU, pipeline stuck). **Always build engines on the host** using the same `libnvinfer` version as DeepStream (`dpkg -l | grep libnvinfer-bin`). Never mix TRT versions between engine builder and runtime.
-
-## Known Issues and Workarounds
-
-### `--memPoolSize` Flag Format — `M` vs `MiB` (CRITICAL silent failure)
-
-- **Correct**: `--memPoolSize=workspace:32768M` (suffix `M` = Mebibytes)
-- **WRONG**: `--memPoolSize=workspace:32768MiB` — trtexec interprets `MiB` as bytes, so `32768MiB` becomes 32 KB. All tactics fail with "insufficient workspace". There is no parse warning; the only symptom is `Memory Pools: workspace: 0.03125 MiB` in the build log.
-- Valid suffixes: `B`, `K`, `M`, `G`, or no suffix (default MiB).
-
-### Deformable Attention Models (RT-DETR, DDETR, Deformable DETR)
-
-Models using `MultiscaleDeformableAttnPlugin_TRT` build correctly on TRT 10.16 **provided workspace is sufficient**.
-- **Required**: `--memPoolSize=workspace:32768M` (not the default 8GB) — deformable attention at BS=64 needs substantial workspace for ForeignNode fusion tactics.
-- `--builderOptimizationLevel=4` (default) works; do not lower it unless necessary.
-- Typical footprint at BS=64 on H100: activation ~4266 MiB, peak memory ~7809 MiB, build time ~825s. The compiler backend phase after engine generation can take 5-10 minutes with no log output — this is normal, not a hang.
-- Error "Could not find any implementation for node {ForeignNode[...]} due to insufficient workspace" is a genuine signal to raise the workspace.
-
-### DETR / DETR-family Backbone Mask ForeignNode Failure (TRT 10.16)
-
-HF-exported DETR/DDETR models contain a dynamic backbone mask path (`Cast → Resize → Sigmoid`) that TRT 10.16 fuses into a ForeignNode with no valid tactic: `"Could not find any implementation for node {ForeignNode[.../Cast_2.../Sigmoid]}"`.
-
-- **Preferred fix** (TRT 10.16.01+, PyTorch 2.11+, transformers 5.5+): use the dynamo export path with `torch.export.Dim("batch", min=1, max=N)` in `dynamic_shapes`. The dynamo exporter produces a different graph that does NOT trigger the ForeignNode failure. TRT converts it directly as a dynamic-batch engine.
-- **Fallback for older toolchains**: run `onnxsim.simplify(model, input_shapes={'pixel_values': [BS, 3, H, W]})` first. This folds the mask into constants but bakes batch size, requiring per-batch ONNX + engine files.
-- **Secondary workaround**: lower `builder_optimization_level` to 2 via the Python TRT API (`config.builder_optimization_level = 2`). Prevents over-aggressive fusion; engines built this way are still compatible with `trtexec --loadEngine`.
-
-### Dynamic Engine Batch-Size Anomalies (transformer models)
-
-Dynamic-shape engines for transformer models (DETR, RT-DETR) can show **non-monotonic throughput** — specific non-power-of-2 batch sizes (e.g., BS=17-19) perform dramatically worse than neighboring values. Cause: TRT tactic selection for attention layers at non-optimal shapes. When DS at `N` streams shows surprisingly low FPS, test `N±8` before concluding the GPU is saturated. Prefer power-of-2 batch sizes for production.
-
-## Output Summary
-
-```bash
-TOTAL_DURATION=$(echo "$STEP4_DURATION + $STEP5_DURATION" | bc)
-```
-
-When complete, print:
-```
-=== TRT Engine Build Complete ===
-Model:   $MODEL_NAME
-Engine:  models/$MODEL_NAME/benchmarks/engines/${MODEL_FILENAME}_dynamic_b${MAX_BS}.engine
-         (single engine — used for trtexec baseline and all DS runs)
-
-trtexec Results:
-  BS=1:       $QPS_BS1 QPS  |  GPU mean: ${GPU_MEAN_BS1}ms
-  BS=$MAX_BS: $QPS_BS_MAX QPS  |  $IMGS_PER_SEC img/s  |  GPU mean: ${GPU_MEAN_BS_MAX}ms  P99: ${GPU_P99_BS_MAX}ms
-
-PEAK_GPU_STREAMS (GPU-only upper bound): $PEAK_GPU_STREAMS streams @30fps
-
-Timing:
-  Step 4 (engine build): ${STEP4_DURATION}s
-  Step 5 (benchmarks):   ${STEP5_DURATION}s
-  Total Steps 4-5:       ${TOTAL_DURATION}s
-
-Ready for: Steps 6-7 — read references/pipeline-run.md models/$MODEL_NAME/
-```
diff --git a/skills/deepstream/deepstream-import-vision-model/references/model-acquire.md b/skills/deepstream/deepstream-import-vision-model/references/model-acquire.md
deleted file mode 100644
index c5305afe..00000000
--- a/skills/deepstream/deepstream-import-vision-model/references/model-acquire.md
+++ /dev/null
@@ -1,419 +0,0 @@
-
-# NV Model Acquire — Steps 1-3
-
-Acquire an ONNX model from Hugging Face, creating the mandatory model folder structure.
-
-## MANDATORY: Model Folder Structure
-
-Create this layout at the start of Step 2 (once `$MODEL_NAME` is set by Step 1):
-```
-models/{model_name}/
-  model/       config/       parser/       scripts/
-  benchmarks/engines/
-  reports/charts/      samples/
-```
-```bash
-mkdir -p models/$MODEL_NAME/{model,parser,config,scripts,benchmarks/engines,reports/charts,samples}
-```
-Temporary staging dirs (`hf_model/`, `ngc_download/`, `build/`) are created inline where needed and cleaned up afterward — they are NOT part of this structure.
-
-## Step 1: Parse the Model Source URL
-
-Accept a model URL or ID in one of these formats and extract the required fields:
-
-```bash
-[ -z "$ARGUMENTS" ] && { echo "ERROR: No model URL or ID provided. Usage: /deepstream-import-vision-model <url>"; exit 1; }
-INPUT="${ARGUMENTS}"
-
-if echo "$INPUT" | grep -q "catalog.ngc.nvidia.com"; then
-  # NGC catalog URL
-  # e.g. https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/models/trafficcamnet_transformer_lite/files?version=deployable_resnet50_v2.0
-  MODEL_SOURCE="ngc"
-  NGC_ORG=$(echo "$INPUT"    | sed 's|.*/orgs/\([^/]*\)/.*|\1|')
-  NGC_TEAM=$(echo "$INPUT"   | sed 's|.*/teams/\([^/]*\)/.*|\1|')
-  MODEL_NAME=$(echo "$INPUT" | sed 's|.*/models/\([^/]*\)/.*|\1|')
-  NGC_VERSION=$(echo "$INPUT" | sed 's|.*version=\([^&]*\).*|\1|')
-  echo "Source: NGC  Org: $NGC_ORG  Team: $NGC_TEAM  Model: $MODEL_NAME  Version: $NGC_VERSION"
-else
-  # HuggingFace full URL or short ID (e.g. https://huggingface.co/onnx-community/yolov8n or onnx-community/yolov8n)
-  MODEL_SOURCE="hf"
-  SLUG=$(echo "$INPUT" | sed 's|https://huggingface.co/||' | sed 's|/resolve/.*||' | sed 's|/$||')
-  HF_ORG=$(echo "$SLUG"    | cut -d/ -f1)
-  MODEL_NAME=$(echo "$SLUG" | cut -d/ -f2)
-  echo "Source: HF  Org: $HF_ORG  Model: $MODEL_NAME"
-fi
-```
-
-- `MODEL_SOURCE` (`hf` or `ngc`) drives category selection in Step 2
-- `MODEL_NAME` is used as the folder name throughout (`models/{MODEL_NAME}/`)
-- Proceed to Step 2 with these variables set
-
-## Step 2: Detect Model Source and Format
-
-First, create the model directory structure (required for all sources), then route by source:
-```bash
-# Create permanent model directory structure (all sources — HF and NGC)
-mkdir -p models/$MODEL_NAME/{model,parser,config,scripts,benchmarks/engines,reports/charts,samples}
-
-# Route based on MODEL_SOURCE set in Step 1
-if [ "$MODEL_SOURCE" = "ngc" ]; then
-  echo "NGC model detected — skipping HF repo browse, proceeding to Step 2d"
-  # Skip to Step 2d directly — do not run any HF curl commands below
-fi
-# The following HF browse, config download, and labels extraction only runs for MODEL_SOURCE=hf
-```
-
-- Browse the HF repository and classify available model files using the vetted helper script
-  (validates inputs, uses HTTPS+TLSv1.2 only, honors `$HF_TOKEN`):
-  ```bash
-  FILES="$(bash skills/deepstream-import-vision-model/scripts/model/hf-list-files.sh "$HF_ORG" "$MODEL_NAME")"
-  ONNX_FILES=$(echo "$FILES" | grep -E '\.onnx$' || true)
-  ST_FILES=$(echo "$FILES" | grep -E '\.(safetensors|bin)$' || true)
-  echo "ONNX files:      ${ONNX_FILES:-none}"
-  echo "SafeTensors/bin: ${ST_FILES:-none}"
-  echo "All files:       $FILES"
-
-  # If ONNX list is empty in root, also check /onnx subdirectory
-  if [ -z "$ONNX_FILES" ]; then
-      ONNX_SUB="$(bash skills/deepstream-import-vision-model/scripts/model/hf-list-files.sh "$HF_ORG" "$MODEL_NAME" onnx | grep -E '\.onnx$' || true)"
-      echo "ONNX in /onnx subdir: ${ONNX_SUB:-none}"
-  fi
-  ```
-- Classify the repo into one of these categories:
-
-  **Category A: ONNX files available** -> proceed to Step 2a (select ONNX variant)
-  **Category B: SafeTensors/PyTorch only (no ONNX)** -> proceed to Step 2b (export to ONNX)
-  **Category C: No usable model files** -> inform user, suggest alternative repos
-  **Category D: NGC model (not on HuggingFace)** -> proceed to Step 2d (NGC download)
-
-- Download `config.json` — required for architecture detection and label extraction.
-  Uses the vetted helper script (validated inputs, HTTPS+TLS, honors `$HF_TOKEN`):
-  ```bash
-  # HF: download from API via vetted helper. NGC: extracted from archive in Step 2d.
-  if [ "$MODEL_SOURCE" = "hf" ]; then
-    bash skills/deepstream-import-vision-model/scripts/model/hf-download-config.sh \
-        "$HF_ORG" "$MODEL_NAME" "models/$MODEL_NAME/config/config.json"
-  else
-    echo "NGC model — config.json will be extracted from the downloaded archive in Step 2d"
-  fi
-  # Note: models/$MODEL_NAME/config/ already exists from the MANDATORY mkdir at the top of Step 2
-  ```
-- Inspect `config.json` to identify:
-  - Model type (e.g., `grounding-dino`, `detr`, `yolos`, `resnet`, `swin`)
-  - Architecture class (e.g., `GroundingDinoForObjectDetection`)
-  - Number of inputs (single input vs multi-modal)
-
-- **Reject non-detection architectures (fail fast)**: Check the `architectures` field in `config.json` before continuing. If the architecture class ends in a non-detection suffix such as `ForImageClassification`, `ForSemanticSegmentation`, `ForInstanceSegmentation`, `ForPanopticSegmentation`, `ForDepthEstimation`, `ForMaskedLM`, `ForTokenClassification`, or `ForCausalLM`, **abort the pipeline with a clear error and exit non-zero**: `"deepstream-import-vision-model currently supports object detection models only. Detected architecture: {arch_class}. Classification, segmentation, and other vision tasks are not yet supported."` Do not prompt the user. Detection architectures end in `ForObjectDetection` (or, for some DETR-family variants, `ForConditionalDetection` / `ForZeroShotObjectDetection`).
-
-- **Extract `labels.txt` from `config.json`** — run this immediately after `config.json` is in place (for HF models that is now; for NGC models this runs at the end of Step 2d):
-  ```bash
-  python3 - <<EOF
-  import json, sys
-  with open("models/$MODEL_NAME/config/config.json") as f:
-      cfg = json.load(f)
-
-  # Primary: id2label (standard HF detection/classification format)
-  if "id2label" in cfg:
-      labels = [cfg["id2label"][str(i)] for i in range(len(cfg["id2label"]))]
-  # Fallback 1: label2id reversed
-  elif "label2id" in cfg:
-      labels = [k for k, v in sorted(cfg["label2id"].items(), key=lambda x: x[1])]
-  # Fallback 2: names dict/list (some YOLO HF repos)
-  elif "names" in cfg:
-      names = cfg["names"]
-      labels = [names[str(i)] for i in range(len(names))] if isinstance(names, dict) else list(names)
-  else:
-      print("ERROR: No label map found in config.json -- cannot create labels.txt", file=sys.stderr)
-      sys.exit(1)
-
-  with open("models/$MODEL_NAME/config/labels.txt", "w") as f:
-      f.write("\n".join(labels) + "\n")
-  print(f"labels.txt: {len(labels)} classes")
-  print("  " + ", ".join(labels[:5]) + (" ..." if len(labels) > 5 else ""))
-  EOF
-  ```
-  If the script exits with error (no label map found), **fail the pipeline with a clear error and exit** — do not prompt the user, and never fall back to hardcoded COCO, ImageNet, or any other default list. This same script runs for HF and NGC — the only requirement is that `config.json` exists at `models/$MODEL_NAME/config/config.json`.
-
-### Step 2a: Select ONNX Variant (Category A)
-- Identify available quantization variants (fp32, fp16, int8, int4, quantized, etc.)
-- **Default preference: fp16**. Apply this logic:
-  1. If fp16 variant exists -> **select it silently**, log: `"Selected: fp16 (default). All available: [list]"`
-  2. If fp16 does NOT exist -> **auto-select deterministically** in this priority order: fp32 > int8 > int4 > quantized > first ONNX alphabetically. Log: `"Selected: {variant} (fp16 unavailable). All available: [list]"`. Do not prompt the user.
-  3. If only one ONNX file exists -> log it and proceed without asking
-- **Construct the resolved download URL** for the selected variant from the tree listing:
-  ```bash
-  # The tree API returns entries with a "path" field (relative to repo root)
-  # Construct the download URL as:
-  PATH_FROM_TREE="<path field from tree listing, e.g. onnx/model_fp16.onnx>"
-  ONNX_URL="https://huggingface.co/$HF_ORG/$MODEL_NAME/resolve/main/$PATH_FROM_TREE"
-  # Example: path="onnx/model_fp16.onnx" -> URL ends in /resolve/main/onnx/model_fp16.onnx
-  # Store this URL for use in Step 3
-  ```
-- After URL construction, proceed to **Step 3** (download ONNX)
-
-### Step 2b: Export SafeTensors to ONNX (Category B)
-
-When the repo only has `.safetensors` (or `.bin`) files and no ONNX export, convert to ONNX using an **isolated virtual environment** to avoid polluting the host system.
-
-#### 2b-i: Setup Isolated Virtual Environment
-- **ALWAYS** use a dedicated venv for export tools. Never install optimum/transformers/torch system-wide.
-- Use a **single shared venv** at `build/.venv_optimum` across all models — `optimum`, `transformers`, `torch`, and `safetensors` are heavy (~2-5 GB) and identical from one model to the next, so creating one per model wastes ~minutes of install time and GBs of disk every run. The `skills/deepstream-import-vision-model/scripts/model/safetensors-to-onnx.sh` helper is built around this shared venv; align the skill-driven path with it.
-  ```bash
-  mkdir -p build
-  VENV=build/.venv_optimum
-  if [ ! -x "$VENV/bin/optimum-cli" ]; then
-    python3 -m venv "$VENV"
-    source "$VENV/bin/activate"
-    pip install --upgrade pip
-    pip install optimum[exporters] torch transformers safetensors onnxruntime matplotlib numpy markdown
-  else
-    source "$VENV/bin/activate"
-  fi
-  ```
-- For a new model that needs **extra packages** (e.g. `timm` for DETR-family backbones, `onnxsim`, or a different `optimum` pin), `pip install` them **into the existing shared venv** rather than creating a new one:
-  ```bash
-  source build/.venv_optimum/bin/activate
-  pip install timm   # or: pip install 'optimum[exporters]<2.1'
-  ```
-- The venv lives under `build/.venv_optimum` at the repo root, keeping `models/` clean and excluded from git via the root `.gitignore`
-- All subsequent Python/pip commands in Step 2b must run inside this venv
-- Legacy per-model venvs at `build/.venv_$MODEL_NAME` from older runs are still cleaned up by `skills/deepstream-import-vision-model/scripts/model/cleanup.sh "$MODEL_NAME"` for backward compatibility
-
-#### 2b-ii: Download Required Files
-- Download from the HF repo into `models/$MODEL_NAME/hf_model/` using `-P` to avoid changing the working directory:
-  ```bash
-  mkdir -p models/$MODEL_NAME/hf_model
-  HF_BASE="https://huggingface.co/$HF_ORG/$MODEL_NAME/resolve/main"
-  # Download model files
-  wget -P models/$MODEL_NAME/hf_model "$HF_BASE/model.safetensors"
-  wget -P models/$MODEL_NAME/hf_model "$HF_BASE/config.json"
-  wget -P models/$MODEL_NAME/hf_model "$HF_BASE/preprocessor_config.json"
-  # For text+vision models, also download tokenizer files (failures are non-fatal):
-  wget -P models/$MODEL_NAME/hf_model "$HF_BASE/tokenizer.json"         || true
-  wget -P models/$MODEL_NAME/hf_model "$HF_BASE/tokenizer_config.json"  || true
-  wget -P models/$MODEL_NAME/hf_model "$HF_BASE/vocab.txt"              || true
-  wget -P models/$MODEL_NAME/hf_model "$HF_BASE/special_tokens_map.json" || true
-  ```
-- For sharded models (multiple `.safetensors` files), also download `model.safetensors.index.json` and all shards
-
-#### 2b-iii: Try optimum-cli Export (Preferred) -- Max 3 Retries
-
-> **optimum 2.1.0 removed the `onnx` subcommand.** If `optimum-cli export onnx` exits with "unknown command", pin an older version (`pip install 'optimum[exporters]<2.1'`) or skip straight to **Step 2b-iv** (manual `torch.onnx.export`). The `optimum.exporters.onnx` Python module is also gone in 2.1+.
-
-- Attempt export using optimum-cli:
-  ```bash
-  source build/.venv_optimum/bin/activate
-  optimum-cli export onnx \
-    --model models/$MODEL_NAME/hf_model \
-    --task object-detection \
-    --opset 17 \
-    models/$MODEL_NAME/onnx_export/
-  ```
-- Common `--task` values for detection/vision models:
-  - `object-detection` -- DETR, YOLOS, Conditional DETR
-  - `image-classification` -- ResNet, ViT, Swin, ConvNeXt
-  - `image-segmentation` -- Mask2Former, SAM
-  - `semantic-segmentation` -- SegFormer, UperNet
-  - `zero-shot-object-detection` -- OWL-ViT, Grounding DINO (if supported)
-- If export succeeds, copy the ONNX file to the `model/` subdirectory:
-  ```bash
-  cp models/$MODEL_NAME/onnx_export/model.onnx models/$MODEL_NAME/model/$MODEL_NAME.onnx
-  ```
-- **Retry policy**: If the export fails, retry up to **3 times total** with adjustments between attempts:
-  - **Retry 1**: Try a different `--task` value if the error suggests wrong task type
-  - **Retry 2**: Try a different `--opset` version (e.g., 14 or 16 instead of 17)
-  - **Retry 3**: Try with `--no-post-process` or other flags relevant to the error
-  - After 3 failed attempts with optimum-cli, fall back to **Step 2b-iv** (manual torch.onnx.export)
-
-#### 2b-iv: Fallback -- Manual torch.onnx.export (If optimum fails) -- Max 3 Retries
-- If optimum-cli fails after 3 retries (unsupported architecture), use manual export:
-  ```bash
-  source build/.venv_optimum/bin/activate
-  python3 -c "
-  from transformers import AutoModelForObjectDetection, AutoConfig
-  import torch
-
-  model = AutoModelForObjectDetection.from_pretrained('models/$MODEL_NAME/hf_model')
-  model.eval()
-
-  # Create dummy input matching preprocessor_config.json dimensions
-  dummy = torch.randn(1, 3, 800, 800)
-
-  torch.onnx.export(model, dummy, 'models/$MODEL_NAME/model/$MODEL_NAME.onnx',
-    export_params=True, opset_version=17, do_constant_folding=True,
-    input_names=['pixel_values'],
-    output_names=['logits', 'pred_boxes'],
-    dynamic_axes={'pixel_values': {0: 'batch'},
-                  'logits': {0: 'batch'},
-                  'pred_boxes': {0: 'batch'}})
-  "
-  ```
-- Adjust input/output names and shapes based on the model architecture
-- **Retry policy**: If manual export fails, retry up to **3 times total** with adjustments:
-  - **Retry 1**: Try a different `AutoModel` class (e.g., `AutoModel`, `AutoModelForImageClassification`)
-  - **Retry 2**: Try a different opset version or simplify dynamic_axes
-  - **Retry 3**: Try with `torch.onnx.export(..., operator_export_type=torch.onnx.OperatorExportTypes.ONNX_ATEN_FALLBACK)`
-  - After 3 failed attempts, **stop and generate a failure report**
-
-> **Gotchas for recent PyTorch/transformers**:
-> - PyTorch 2.11+ with onnxscript installed auto-upgrades opset to 18 even when `opset_version=17` is requested. The resulting opset-18 ONNX is compatible with TRT 10.16 — accept it.
-> - The dynamo backend (`dynamo=True`) may silently ignore `dynamic_axes` for transformer models where attention reshape patterns bake the batch dimension into the graph. Verify exported input shapes with `onnx.load()`. For DETR-family models on TRT 10.16, prefer the dynamo path with `torch.export.Dim("batch", min=1, max=N)` — it avoids the backbone-mask ForeignNode failure described in `nv-engine-build`.
-> - The legacy TorchScript path (`dynamo=False`) crashes with transformers 5.5+ due to `create_bidirectional_mask` incompatibility.
-> - **External data files**: `torch.onnx.export` may produce `model.onnx.data` alongside the `.onnx`. Consolidate before TRT conversion: `m = onnx.load(path, load_external_data=True); onnx.save(m, consolidated_path)`.
-
-#### 2b-v: Handle Multi-Modal Models (e.g., Grounding DINO)
-- Models that take **both image AND text** inputs need special handling for DeepStream (nvinfer only supports image input)
-- Strategy: **freeze the text prompt** into the ONNX graph as a constant
-  1. Run the model once with a fixed text prompt (e.g., "person . car . truck .")
-  2. Export ONNX with the text embeddings baked in as constants
-  3. The resulting ONNX model only needs `pixel_values` as input
-- If freezing is not possible, check `onnx-community/` for pre-converted single-input versions
-- **Inform the user** about the frozen text prompt and its implications (fixed detection classes)
-
-#### 2b-vi: onnxsim — Run After Export When Needed
-
-If the model has dynamic shape paths that cause TRT `ForeignNode` fusion issues, simplify the ONNX graph with `onnxsim` **before** engine building:
-
-```bash
-source build/.venv_optimum/bin/activate
-pip install onnxsim
-python3 -m onnxsim \
-  models/$MODEL_NAME/model/$MODEL_NAME.onnx \
-  models/$MODEL_NAME/model/${MODEL_NAME}_sim.onnx
-# Use the _sim.onnx for engine building if the original triggers ForeignNode errors
-```
-
-Only run `onnxsim` if TRT build fails with `ForeignNode` warnings — it is not needed for most models.
-
-#### 2b-vii: Validate ONNX Output
-- After export, validate the ONNX file:
-  ```bash
-  source build/.venv_optimum/bin/activate
-  python3 -c "
-  import onnx
-  m = onnx.load('models/$MODEL_NAME/model/$MODEL_NAME.onnx')
-  onnx.checker.check_model(m)
-  print('Inputs:')
-  for i in m.graph.input:
-    dims = [d.dim_param or d.dim_value for d in i.type.tensor_type.shape.dim]
-    print(f'  {i.name}: {dims}')
-  print('Outputs:')
-  for o in m.graph.output:
-    dims = [d.dim_param or d.dim_value for d in o.type.tensor_type.shape.dim]
-    print(f'  {o.name}: {dims}')
-  print('ONNX validation passed!')
-  "
-  ```
-- Verify:
-  - Single image input (no text/mask inputs -- remove if needed)
-  - Output shapes match expected detection format
-  - Dynamic batch dimension is present
-
-#### 2b-viii: Cleanup
-- Deactivate the venv after export is complete:
-  ```bash
-  deactivate
-  ```
-- **Keep `build/.venv_optimum` across runs** — it is shared by every SafeTensors → ONNX export and rebuilding it for each model costs minutes and GBs. `cleanup.sh` intentionally does not remove it.
-- `cleanup.sh` removes per-model artifacts (`models/$MODEL_NAME/hf_model`, `models/$MODEL_NAME/onnx_export`, and any legacy `build/.venv_$MODEL_NAME` left over from older runs):
-  ```bash
-  # Validated script; will refuse unsafe paths. Shared .venv_optimum is preserved.
-  bash skills/deepstream-import-vision-model/scripts/model/cleanup.sh "$MODEL_NAME"
-  # Preview without removing:
-  # bash skills/deepstream-import-vision-model/scripts/model/cleanup.sh "$MODEL_NAME" --dry-run
-  ```
-- The ONNX file is now at `models/$MODEL_NAME/model/$MODEL_NAME.onnx` -- proceed to engine building
-
-### Step 2d: NGC Model Download (Category D)
-
-When the model comes from NVIDIA NGC (not HuggingFace), download using the `ngc` CLI if available, or fall back to `wget` for direct file download:
-
-```bash
-# Vetted helper: prefers ngc CLI if installed, else falls back to authenticated
-# HTTPS+TLS via curl against the public NGC catalog API. All inputs validated
-# against ^[A-Za-z0-9._-]+$. See skills/deepstream-import-vision-model/scripts/model/ngc-download.sh for details.
-bash skills/deepstream-import-vision-model/scripts/model/ngc-download.sh \
-    "$NGC_ORG" "$NGC_TEAM" "$MODEL_NAME" "$NGC_VERSION" \
-    "models/$MODEL_NAME/ngc_download"
-
-# Inspect downloaded files
-echo "Downloaded files:"
-ls -lhR models/$MODEL_NAME/ngc_download/
-```
-
-- Identify the ONNX file(s) in the downloaded archive (often inside a subdirectory named after the model version)
-- If the download contains a `.etlt` or `.engine` file only (TAO encrypted format), check if a plain ONNX is also provided; if not, use the TAO-provided engine directly and skip Step 4 (engine build)
-- Copy the ONNX to the model directory:
-  ```bash
-  NGC_ONNX=$(find models/$MODEL_NAME/ngc_download -name "*.onnx" | head -1)
-  cp "$NGC_ONNX" models/$MODEL_NAME/model/$MODEL_NAME.onnx
-  echo "ONNX: $NGC_ONNX -> models/$MODEL_NAME/model/$MODEL_NAME.onnx"
-  ```
-- Extract `config.json` from the archive and build `labels.txt` (same logic as HF path):
-  ```bash
-  NGC_CONFIG=$(find models/$MODEL_NAME/ngc_download -name "config.json" | head -1)
-  if [ -z "$NGC_CONFIG" ]; then
-    echo "ERROR: config.json not found in NGC archive — cannot create labels.txt"
-    echo "Cannot proceed without a label map — aborting. Provide an NGC archive that contains config.json."
-    exit 1
-  else
-    cp "$NGC_CONFIG" models/$MODEL_NAME/config/config.json
-    echo "config.json extracted from: $NGC_CONFIG"
-    # Now run the same labels.txt extraction as the HF path
-    python3 - <<EOF
-import json, sys
-with open("models/$MODEL_NAME/config/config.json") as f:
-    cfg = json.load(f)
-if "id2label" in cfg:
-    labels = [cfg["id2label"][str(i)] for i in range(len(cfg["id2label"]))]
-elif "label2id" in cfg:
-    labels = [k for k, v in sorted(cfg["label2id"].items(), key=lambda x: x[1])]
-elif "names" in cfg:
-    names = cfg["names"]
-    labels = [names[str(i)] for i in range(len(names))] if isinstance(names, dict) else list(names)
-else:
-    print("ERROR: No label map found in config.json -- cannot create labels.txt", file=sys.stderr)
-    sys.exit(1)
-with open("models/$MODEL_NAME/config/labels.txt", "w") as f:
-    f.write("\n".join(labels) + "\n")
-print(f"labels.txt: {len(labels)} classes")
-print("  " + ", ".join(labels[:5]) + (" ..." if len(labels) > 5 else ""))
-EOF
-  fi
-  ```
-
-## Step 3: Download the ONNX Model
-
-The model directory structure was already created in the MANDATORY block at the top. Do NOT run `mkdir -p` again here — just download the file:
-
-```bash
-wget -O "models/$MODEL_NAME/model/$MODEL_NAME.onnx" "${ONNX_URL}"
-```
-
-Where `$ONNX_URL` is the resolved URL constructed at the end of Step 2a (Category A) or derived from the NGC download path (Category D). Categories B and D write the ONNX directly to `models/$MODEL_NAME/model/$MODEL_NAME.onnx` during export/copy — Step 3 only applies to Category A.
-- Also download any external data files if the ONNX model references them (files with `.onnx_data` extension or similar)
-- Verify the download completed successfully and report file size
-
-## Timing
-
-Record wall-clock time at the start and end of this skill:
-```bash
-STEP_START=$(date +%s.%N)
-# ... all steps ...
-STEP_END=$(date +%s.%N)
-STEP_DURATION=$(echo "$STEP_END - $STEP_START" | bc)
-```
-
-## Output Summary
-
-When complete, print:
-```
-=== HF Model Acquire Complete ===  [Steps 1-3: ${STEP_DURATION}s]
-Model:  $MODEL_NAME
-ONNX:   models/$MODEL_NAME/model/$MODEL_NAME.onnx ({size} MB)
-Input:  {input_name} {input_shape}
-Output: {output_names} {output_shapes}
-Labels: {num_classes} classes -> models/$MODEL_NAME/config/labels.txt
-Ready for: Steps 4-5 — read references/engine-build.md models/$MODEL_NAME/model/$MODEL_NAME.onnx
-```
-(`{size}`, `{input_name}`, `{input_shape}`, `{output_names}`, `{output_shapes}`, `{num_classes}` are filled from the ONNX inspection output — all other fields use bash variables.)
diff --git a/skills/deepstream/deepstream-import-vision-model/references/pipeline-run.md b/skills/deepstream/deepstream-import-vision-model/references/pipeline-run.md
deleted file mode 100644
index 92bda31e..00000000
--- a/skills/deepstream/deepstream-import-vision-model/references/pipeline-run.md
+++ /dev/null
@@ -1,529 +0,0 @@
-
-# DS Run Pipeline -- Steps 6-7
-
-Integrate a TensorRT model into DeepStream with parser, validation, and multi-stream benchmarks.
-
-The model directory is: `$ARGUMENTS`
-
-## Pre-flight: Extract Variables
-
-```bash
-[ -z "$ARGUMENTS" ] && { echo "ERROR: No model directory provided. Usage: /deepstream-import-vision-model models/<model_name>/"; exit 1; }
-MODEL_DIR="${ARGUMENTS%/}"
-MODEL_NAME=$(basename "$MODEL_DIR")
-
-# Find ONNX file (exclude _dynamic variants created during export)
-ONNX_FILE=$(ls models/$MODEL_NAME/model/*.onnx 2>/dev/null | grep -v '_dynamic' | head -1)
-[ -z "$ONNX_FILE" ] && { echo "ERROR: No ONNX file found in models/$MODEL_NAME/model/ — run Steps 1-3 first (references/model-acquire.md)"; exit 1; }
-MODEL_FILENAME=$(basename "$ONNX_FILE" .onnx)
-
-# Find TRT engine from nv-engine-build
-ENGINE=$(ls models/$MODEL_NAME/benchmarks/engines/*_dynamic_b*.engine 2>/dev/null | head -1)
-[ -z "$ENGINE" ] && { echo "ERROR: No engine found in models/$MODEL_NAME/benchmarks/engines/ — run Steps 4-5 first (references/engine-build.md)"; exit 1; }
-MAX_BS=$(echo "$ENGINE" | grep -oP '_b\K[0-9]+(?=\.engine)')
-
-# Read PEAK_GPU_STREAMS from trtexec Step 5b log — fixed filename, no timestamp, no wildcard
-TRTEXEC_LOG="models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log"
-[ -f "$TRTEXEC_LOG" ] || { echo "ERROR: trtexec log not found at $TRTEXEC_LOG — run Steps 4-5 first (references/engine-build.md)"; exit 1; }
-QPS_BS_MAX=$(grep -oP 'Throughput:\s*\K[0-9.]+' "$TRTEXEC_LOG" | tail -1)
-read IMGS_PER_SEC PEAK_GPU_STREAMS < <(python3 -c "
-import math
-imgs = float('$QPS_BS_MAX') * $MAX_BS
-print(round(imgs, 2), int(math.floor(imgs / 30)))
-")
-
-# Read spatial dimensions from ONNX inspection
-INSPECT_OUT=$(python3 skills/deepstream-import-vision-model/scripts/model/inspect-onnx.py "$ONNX_FILE")
-INPUT_NAME=$(echo "$INSPECT_OUT" | grep -oP 'input_name:\s*\K\S+')
-H=$(echo "$INSPECT_OUT"          | grep -oP 'height:\s*\K[0-9]+')
-W=$(echo "$INSPECT_OUT"          | grep -oP 'width:\s*\K[0-9]+')
-[ -z "$INPUT_NAME" ] && { echo "ERROR: could not parse INPUT_NAME from inspect output"; exit 1; }
-[ -z "$H" ]          && { echo "ERROR: could not parse H — dynamic spatial dims? Set H manually"; exit 1; }
-[ -z "$W" ]          && { echo "ERROR: could not parse W — dynamic spatial dims? Set W manually"; exit 1; }
-
-# Detect installed CUDA version for parser compilation
-CUDA_VER=$(ls /usr/local/ 2>/dev/null | grep -oP '^cuda-\K[0-9]+\.[0-9]+$' | sort -V | tail -1)
-[ -z "$CUDA_VER" ] && CUDA_VER=12.8
-echo "CUDA_VER=$CUDA_VER"
-
-# Count labels
-[ -f "models/$MODEL_NAME/config/labels.txt" ] || { echo "ERROR: labels.txt not found — run Steps 1-3 first (references/model-acquire.md)"; exit 1; }
-NUM_LABELS=$(wc -l < models/$MODEL_NAME/config/labels.txt)
-
-# Parser function suffix: PascalCase of MODEL_NAME, sanitized for C++ identifiers
-# e.g. yolov8n→Yolov8n  rtdetr-l→RtdetrL  grounding-dino-base→GroundingDinoBase
-PARSER_FUNC_SUFFIX=$(python3 -c "
-import re
-parts = re.sub(r'[^a-zA-Z0-9]', ' ', '$MODEL_NAME').split()
-print(''.join(p.capitalize() for p in parts))
-")
-# Sanitize MODEL_NAME for use in C++ source/library filenames — mirrors PARSER_FUNC_SUFFIX logic.
-# e.g. rtdetr-l → rtdetr_l  grounding-dino-base → grounding_dino_base
-MODEL_NAME_SAFE=$(echo "$MODEL_NAME" | tr -c 'A-Za-z0-9' '_')
-
-# Video source — default is sample_720p.mp4 (MANDATORY). Never autonomously substitute
-# sample_1080p_h264.mp4 or any other file. DS_VIDEO may only be set when the user explicitly
-# provides a custom video path; it is not a licence to pick a different resolution.
-VIDEO="${DS_VIDEO:-/opt/nvidia/deepstream/deepstream/samples/streams/sample_720p.mp4}"
-[ -f "$VIDEO" ] || {
-  echo "ERROR: Video file not found: $VIDEO"
-  echo "  Fix 1: Set DS_VIDEO=/path/to/sample_720p.mp4 before running"
-  echo "  Fix 2: Install DeepStream samples (replace 9.0 with your installed minor version): apt-get install deepstream-9.0-samples"
-  exit 1
-}
-
-echo "Model:            $MODEL_NAME"
-echo "ONNX:             $ONNX_FILE  (input=$INPUT_NAME, ${H}x${W})"
-echo "Engine:           $ENGINE  (MAX_BS=$MAX_BS)"
-echo "PEAK_GPU_STREAMS: $PEAK_GPU_STREAMS  (floor($IMGS_PER_SEC img/s / 30))"
-echo "Labels:           $NUM_LABELS classes"
-```
-
-> All subsequent commands use these variables — never hardcoded paths or template placeholders.
-
-## Step 6: DeepStream Integration
-
-```bash
-STEP6_START=$(date +%s.%N)
-```
-
-### 6a: Inspect Model Output Format
-
-Verify output tensor shapes and value ranges before writing the parser:
-```bash
-python3 -c "
-import onnxruntime as ort, numpy as np
-sess = ort.InferenceSession('$ONNX_FILE')
-inp = sess.get_inputs()[0]
-out = sess.get_outputs()
-print(f'Input: {inp.name} shape={inp.shape}')
-for o in out: print(f'Output: {o.name} shape={o.shape}')
-dummy = np.random.randn(*[d if isinstance(d,int) else 1 for d in inp.shape]).astype(np.float32)
-result = sess.run(None, {inp.name: dummy})
-for i,r in enumerate(result): print(f'Output[{i}] range: [{r.min():.4f}, {r.max():.4f}]')
-"
-```
-
-**CRITICAL**: Determine the correct `net-scale-factor` from the output ranges and model family:
-
-| Model expects | net-scale-factor | Notes |
-|---------------|-----------------|-------|
-| 0–255 input (OpenCV Zoo) | `1.0` | No normalization |
-| 0–1 normalized | `0.00392156862745098` (1/255) | Standard |
-| ImageNet normalized | `0.01752` + offsets | Rare in DS |
-
-Wrong scale factor = zero detections. Always verify with KITTI dump (Step 6g) before benchmarks.
-
-### 6b: Write Custom Bounding Box Parser
-
-Create `models/$MODEL_NAME/parser/nvdsinfer_custombboxparser_${MODEL_NAME_SAFE}.cpp`:
-```cpp
-extern "C"
-bool NvDsInferParseCustom${PARSER_FUNC_SUFFIX}(
-    std::vector<NvDsInferLayerInfo> const &outputLayersInfo,
-    NvDsInferNetworkInfo const &networkInfo,
-    NvDsInferParseDetectionParams const &detectionParams,
-    std::vector<NvDsInferObjectDetectionInfo> &objectList);
-
-CHECK_CUSTOM_PARSE_FUNC_PROTOTYPE(NvDsInferParseCustom${PARSER_FUNC_SUFFIX});
-```
-
-Parser implementation rules:
-- Include `nvdsinfer_custom_impl.h` and use `NvDsInferObjectDetectionInfo` (classId, left, top, width, height, detectionConfidence)
-- Decode model-specific output format into pixel-space bounding boxes:
-  - YOLOX-style `[N, num_anchors, 5+C]`: decode grid offsets, exp(w/h), objectness×class_score
-  - SSD-style `[N, num_dets, 6]`: extract class, confidence, normalized → pixel coords
-  - YOLO with BatchedNMS: parse keepCount, bboxes, scores, classes from 4 output layers
-- **Clip all coordinates** to `[0, networkInfo.width-1]` and `[0, networkInfo.height-1]`
-- Use `detectionParams.perClassPreclusterThreshold` for confidence filtering
-- **NMS**: Dense heads → `cluster-mode=2` (DeepStream NMS). Fused TRT NMS → `cluster-mode=4`
-- **Sanity check for undecoded output**: if bbox values land in [0, 3], the parser is reading grid-space offsets. Most models need `(raw + grid_offset) * stride` for cx/cy and `exp(raw) * stride` for w/h. Verify raw output ranges with Python/ONNX Runtime before writing the parser.
-- Reference: `/opt/nvidia/deepstream/deepstream/sources/libs/nvdsinfer_customparser/nvdsinfer_custombboxparser.cpp`; Header: `sources/includes/nvdsinfer_custom_impl.h`
-
-#### Model-family parser patterns
-
-- **DETR / Conditional DETR**: outputs `logits [B, num_queries, num_classes+1]` and `pred_boxes [B, num_queries, 4]`. Boxes are `(cx, cy, w, h)` normalized to `[0,1]` — convert to `(left, top, width, height)` in pixels. Use **softmax** (not sigmoid) on logits. **Background class is the LAST index** (e.g., index 91 for a 92-class DETR, despite `config.json` showing `"0": "N/A"`). Skip the background class when iterating. DETR uses Hungarian matching — NMS is not needed; set `cluster-mode=4` (not `nms-iou-threshold=0.0`, which is a legacy key).
-- **OWL-ViT / CLIP-based zero-shot detectors**: outputs `logits [B, num_patches, num_classes]` and `pred_boxes [B, num_patches, 4]`. **Sigmoid** activation (per-class independent scoring, not softmax). Boxes are `(cx, cy, w, h)` normalized `[0,1]`. Use `cluster-mode=2` (NMS with IoU threshold). CLIP preprocessing: `net-scale-factor=0.01459`, `offsets=122.77;116.75;104.09`. Confidence threshold 0.10 works well for general detection; lower to 0.05 for recall-focused tasks.
-- **HF RT-DETR preprocessing quirk**: `RTDetrImageProcessor` may have `do_normalize=false` even though `image_mean`/`image_std` fields exist. When `do_normalize=false`, the model expects `[0,1]` scaled input — set `net-scale-factor=1/255` with no offsets. The ONNX export does NOT bake normalization into the first Conv layer. Verify with ONNX Runtime on a real frame before debugging nvinfer.
-
-#### NGC TAO models — use the built-in parser library
-
-NVIDIA NGC TAO models (trafficcamnet, peoplenet, TrafficCamNet Transformer Lite, etc.) ship with TAO-specific parsers pre-compiled into a system library:
-- **Library path**: `/opt/nvidia/deepstream/deepstream/lib/libnvds_infercustomparser.so` — NOT `libnvds_infercustomparser_tao.so` (even if the NGC YAML config suggests it).
-- Custom parse function names: `NvDsInferParseCustomDDETRTAO`, `NvDsInferParseCustomRTDETRTAO`, etc.
-- **No custom parser compilation needed** — point `custom-lib-path` at the system library and `parse-bbox-func-name` at the TAO function.
-- KITTI dump from `deepstream-app` may emit zero-valued bbox coordinates for DETR/RT-DETR parsers even when detections are correct. Verify visually with JPEG frame extraction instead.
-
-### `network-type` vs `model-type` — use `network-type=0`
-
-- `model-type` is a legacy/unknown key — nvinfer ignores it with a warning.
-- `network-type=0` (Detector) is required to invoke `parse-bbox-func-name`.
-- `network-type=100` (Other) does NOT invoke the custom bbox parser — it requires `output-tensor-meta=1` for external post-processing.
-- **Symptom of the wrong key**: custom parse function is never called (zero detections, no parser debug output) — check that `network-type=0` is set.
-
-### 6c: Create Makefile
-
-Write `models/$MODEL_NAME/parser/Makefile` using Python to guarantee literal TAB characters in recipe lines (heredoc in bash can produce spaces, which break make):
-```bash
-python3 - << EOF
-model = '$MODEL_NAME'
-model_safe = '$MODEL_NAME_SAFE'
-content = (
-    "DEEPSTREAM_DIR ?= /opt/nvidia/deepstream/deepstream\n"
-    "CUDA_VER ?= 12.8\n"
-    "CC := g++\n"
-    "CFLAGS := -Wall -std=c++11 -shared -fPIC\n"
-    "CFLAGS += -I\$(DEEPSTREAM_DIR)/sources/includes -I/usr/local/cuda-\$(CUDA_VER)/include\n"
-    "LIBS := -lnvinfer\n"
-    "LFLAGS := -Wl,--start-group \$(LIBS) -Wl,--end-group\n"
-    f"SRCFILES := nvdsinfer_custombboxparser_{model_safe}.cpp\n"
-    f"TARGET_LIB := libnvdsinfer_{model_safe}_parser.so\n"
-    "\n"
-    "all: \$(TARGET_LIB)\n"
-    "\$(TARGET_LIB): \$(SRCFILES)\n"
-    "\t\$(CC) -o \$@ \$^ \$(CFLAGS) \$(LFLAGS)\n"   # TAB required by make
-    "clean:\n"
-    "\trm -rf \$(TARGET_LIB)\n"                       # TAB required by make
-)
-with open(f'models/{model}/parser/Makefile', 'w') as f:
-    f.write(content)
-print(f"Makefile written: models/{model}/parser/Makefile")
-EOF
-```
-
-### 6d: Build Parser Library
-
-```bash
-make -C models/$MODEL_NAME/parser \
-  DEEPSTREAM_DIR=/opt/nvidia/deepstream/deepstream \
-  CUDA_VER=$CUDA_VER
-
-# Verify the symbol is exported
-nm -D models/$MODEL_NAME/parser/libnvdsinfer_${MODEL_NAME_SAFE}_parser.so | grep NvDsInferParseCustom
-```
-
-### 6e: Create nvinfer Config File
-
-```bash
-cat > models/$MODEL_NAME/config/config_infer_primary_${MODEL_NAME}.txt << EOF
-[property]
-gpu-id=0
-net-scale-factor=0.00392156862745098
-model-color-format=0
-onnx-file=../model/${MODEL_FILENAME}.onnx
-model-engine-file=../benchmarks/engines/${MODEL_FILENAME}_dynamic_b${MAX_BS}.engine
-labelfile-path=labels.txt
-batch-size=1
-network-mode=2
-num-detected-classes=${NUM_LABELS}
-process-mode=1
-interval=0
-gie-unique-id=1
-network-type=0
-custom-lib-path=../parser/libnvdsinfer_${MODEL_NAME_SAFE}_parser.so
-parse-bbox-func-name=NvDsInferParseCustom${PARSER_FUNC_SUFFIX}
-# 2=DeepStream NMS (dense heads: YOLO, SSD). Use 4 if engine has fused NMS output
-cluster-mode=2
-infer-dims=3;${H};${W}
-maintain-aspect-ratio=1
-
-[class-attrs-all]
-topk=200
-nms-iou-threshold=0.45
-pre-cluster-threshold=0.25
-EOF
-```
-
-> **Path note**: All paths are relative to the `config/` directory where this file lives.
-> `net-scale-factor` defaults to `1/255` — update to `1.0` if the model expects 0–255 input (verify via Step 6a).
-
-Verify label count matches:
-```bash
-echo "labels.txt: $NUM_LABELS classes -> num-detected-classes=$NUM_LABELS"
-```
-
-### 6f: Single-Stream Visual Validation
-
-> **ENCODER RULE:**
-> Primary encoder is `nvv4l2h264enc` (NVENC via V4L2) → `.mp4`. `x264enc` and `openh264enc` are **prohibited**.
-> On systems where `/dev/v4l2-nvenc` is unavailable, the approved fallback is `theoraenc + oggmux`
-> (LGPL; both ship in gst-plugins-base) → `.ogv`. If `theoraenc`/`oggmux` are absent, video creation is skipped.
-> Use `skills/deepstream-import-vision-model/scripts/deepstream/ds-single-stream.sh` which handles this automatically
-> and emits a `DS_SINGLE_STREAM_MODE=` marker the report parser reads.
-
-**Primary (NVENC available):**
-
-```bash
-mkdir -p models/$MODEL_NAME/samples
-
-GST_DEBUG=1 gst-launch-1.0 \
-  filesrc location=$VIDEO ! \
-  qtdemux ! queue leaky=downstream ! h264parse ! queue ! nvv4l2decoder ! queue ! \
-  m.sink_0 nvstreammux name=m batch-size=1 width=1280 height=720 ! queue ! \
-  nvinfer config-file-path=models/$MODEL_NAME/config/config_infer_primary_${MODEL_NAME}.txt ! queue ! \
-  nvvideoconvert ! 'video/x-raw(memory:NVMM),format=RGBA' ! \
-  nvdsosd ! nvvideoconvert ! 'video/x-raw(memory:NVMM),format=NV12' ! \
-  nvv4l2h264enc ! h264parse ! mp4mux ! \
-  filesink location=models/$MODEL_NAME/samples/${MODEL_NAME}_output.mp4 sync=0
-```
-
-**Fallback (NVENC unavailable — `/dev/v4l2-nvenc` missing, `theoraenc`/`oggmux` present):**
-
-Output extension switches from `.mp4` to `.ogv` (Ogg/Theora container). `theoraenc` consumes planar `I420`, not `NV12`.
-
-```bash
-GST_DEBUG=1 gst-launch-1.0 \
-  filesrc location=$VIDEO ! \
-  qtdemux ! queue leaky=downstream ! h264parse ! queue ! nvv4l2decoder ! queue ! \
-  m.sink_0 nvstreammux name=m batch-size=1 width=1280 height=720 ! queue ! \
-  nvinfer config-file-path=models/$MODEL_NAME/config/config_infer_primary_${MODEL_NAME}.txt ! queue ! \
-  nvvideoconvert ! nvdsosd ! nvvideoconvert ! \
-  "video/x-raw, format=I420" ! theoraenc quality=48 ! oggmux ! \
-  filesink location=models/$MODEL_NAME/samples/${MODEL_NAME}_output.ogv sync=0
-```
-
-Extract a frame to visually confirm bounding boxes — auto-detect which output file exists:
-
-```bash
-SAMPLE_OUT=$(ls models/$MODEL_NAME/samples/${MODEL_NAME}_output.{mp4,ogv} 2>/dev/null | head -1)
-
-case "$SAMPLE_OUT" in
-  *.mp4)
-    gst-launch-1.0 \
-      filesrc location="$SAMPLE_OUT" ! \
-      qtdemux ! h264parse ! nvv4l2decoder ! videoconvert ! "video/x-raw,format=RGB" ! \
-      jpegenc quality=95 ! \
-      multifilesink location=models/$MODEL_NAME/samples/frame_%04d.jpg max-files=3
-    ;;
-  *.ogv)
-    gst-launch-1.0 \
-      filesrc location="$SAMPLE_OUT" ! \
-      oggdemux ! theoradec ! videoconvert ! "video/x-raw,format=RGB" ! \
-      jpegenc quality=95 ! \
-      multifilesink location=models/$MODEL_NAME/samples/frame_%04d.jpg max-files=3
-    ;;
-esac
-```
-
-If **no detections appear**, the most common cause is wrong `net-scale-factor` — update the config and re-run.
-
-### 6g: KITTI Dump — Verify Detections Programmatically
-
-Run a KITTI dump to confirm detections exist before multi-stream benchmarks.
-
-> **Note:** `gie-kitti-output-dir` is a `deepstream-app` `[application]`
-> property — it is **not** read by `nvinfer` directly. Appending it to the
-> nvinfer config and running a `gst-launch-1.0 ... nvinfer ...` pipeline
-> silently produces zero KITTI files. Use the `ds-kitti-dump.sh` helper,
-> which wraps `deepstream-app` with the correct `[application]` section.
-
-```bash
-mkdir -p models/$MODEL_NAME/samples/kitti_output
-
-bash skills/deepstream-import-vision-model/scripts/deepstream/ds-kitti-dump.sh \
-  models/$MODEL_NAME/config/config_infer_primary_${MODEL_NAME}.txt \
-  models/$MODEL_NAME/samples/kitti_output \
-  100 \
-  "$VIDEO"
-
-# Summarise detection results
-KITTI_FILES=$(ls models/$MODEL_NAME/samples/kitti_output/*.txt 2>/dev/null | wc -l)
-echo "KITTI frames written: $KITTI_FILES"
-echo "Top detected classes:"
-cat models/$MODEL_NAME/samples/kitti_output/*.txt 2>/dev/null \
-  | awk '{print $1}' | sort | uniq -c | sort -rn | head -10
-```
-
-**Validation gate**: If `KITTI_FILES == 0` or all files are empty, detections are broken. Do NOT proceed to Step 7.
-
-```bash
-# MANDATORY hard stop — do not comment out or remove this check
-if [ "$KITTI_FILES" -eq 0 ]; then
-  echo "ERROR: KITTI validation FAILED — zero detection files written."
-  echo "Fix net-scale-factor, parser output format, or config before retrying."
-  echo "Do NOT proceed to Step 7 benchmarks with broken detections."
-  exit 1
-fi
-FRAMES_WITH_DETECTIONS=$(grep -rl '.' models/$MODEL_NAME/samples/kitti_output/ 2>/dev/null | wc -l)
-DETECTION_RATE=$(python3 -c "print(round($FRAMES_WITH_DETECTIONS/$KITTI_FILES*100,1))")
-echo "Detection rate: $FRAMES_WITH_DETECTIONS / $KITTI_FILES frames = ${DETECTION_RATE}%"
-if python3 -c "exit(0 if $FRAMES_WITH_DETECTIONS/$KITTI_FILES >= 0.9 else 1)"; then
-  echo "KITTI validation PASSED (>= 90% frames with detections)"
-else
-  echo "ERROR: Detection rate ${DETECTION_RATE}% < 90% threshold. Fix parser before proceeding."
-  exit 1
-fi
-```
-
-```bash
-STEP6_END=$(date +%s.%N)
-STEP6_DURATION=$(echo "$STEP6_END - $STEP6_START" | bc)
-echo "[Step 6] completed in ${STEP6_DURATION}s"
-```
-
-### DeepStream Troubleshooting
-
-| Symptom | Fix |
-|---------|-----|
-| Zero detections | Wrong `net-scale-factor` — check model family table in Step 6a |
-| Engine rebuilds every run | `model-engine-file` path wrong — verify relative path from `config/` |
-| Parser crash | Output tensor shape mismatch — re-check Step 6a output shapes |
-| Wrong bounding box positions | Grid/stride decoding mismatch — verify model architecture docs |
-| `"layers num: 0"` | Harmless for dynamic-shape engines — do not debug |
-| deepstream-app segfaults | Use `gst-launch-1.0` instead (transformer models) |
-
-## Step 7: Multi-Stream DeepStream Benchmark
-
-### 7b: Create DS Benchmark Config
-
-Create one nvinfer config for all DS benchmark runs. `batch-size` is overridden at runtime via the nvinfer GStreamer element property:
-
-```bash
-mkdir -p models/$MODEL_NAME/benchmarks/ds
-
-cat > models/$MODEL_NAME/benchmarks/ds/config_infer_ds_${MODEL_NAME}.txt << EOF
-[property]
-gpu-id=0
-net-scale-factor=0.00392156862745098
-model-color-format=0
-onnx-file=../../model/${MODEL_FILENAME}.onnx
-model-engine-file=../engines/${MODEL_FILENAME}_dynamic_b${MAX_BS}.engine
-labelfile-path=../../config/labels.txt
-batch-size=${MAX_BS}
-network-mode=2
-num-detected-classes=${NUM_LABELS}
-process-mode=1
-interval=0
-gie-unique-id=1
-network-type=0
-custom-lib-path=../../parser/libnvdsinfer_${MODEL_NAME_SAFE}_parser.so
-parse-bbox-func-name=NvDsInferParseCustom${PARSER_FUNC_SUFFIX}
-# 2=DeepStream NMS (dense heads: YOLO, SSD). Use 4 if engine has fused NMS output
-cluster-mode=2
-infer-dims=3;${H};${W}
-maintain-aspect-ratio=1
-
-[class-attrs-all]
-topk=200
-nms-iou-threshold=0.45
-pre-cluster-threshold=0.25
-EOF
-```
-
-> **Path note**: Paths are relative to `benchmarks/ds/` where this config lives.
-
-### Queue Placement Rules (MANDATORY)
-
-Every pipeline stage must be separated by `queue` elements. Use `leaky=downstream` after `qtdemux` to drop excess frames under GPU saturation; all other queues use no leaky setting (threading only). Always set `batched-push-timeout=-1` on `nvstreammux`. **Never include** `nvmultistreamtiler`, `nvdsosd`, or extra `nvvideoconvert` in benchmark runs — only use for single-stream visual validation (Step 6f).
-
-### 7c: Two-Run DS Benchmark
-
-Only **2 DS pipeline runs** characterise DS overhead vs trtexec.
-
-Both runs go through `deepstream-app` with `[application] enable-perf-measurement=1` (wrapped by `skills/deepstream-import-vision-model/scripts/deepstream/ds-perf-run.sh`). FPS is parsed from the canonical `**PERF:` lines DeepStream emits at the configured measurement interval. This replaces the older `gst-launch-1.0 ... ! fpsdisplaysink` path so the runtime no longer depends on `gstreamer1.0-plugins-bad`.
-
-> **PERF line format**: `**PERF: <fps_run> (<fps_avg>)` — one float per active source. The helper script averages the per-stream instantaneous FPS across the last few measurement windows; the parser below mirrors that contract.
-
-**DS Run 1 — Calibration at PEAK_GPU_STREAMS streams:**
-
-> **CRITICAL**: Use `$PEAK_GPU_STREAMS` directly. Do NOT pre-apply any efficiency discount (no ×0.6, ×0.7, etc.). Run 1 *measures* the real overhead — do not guess it.
-
-> Log filenames are **fixed** — no timestamp variation. Always `ds_s${N}_run1.log` and `ds_s${N}_run2.log` in `benchmarks/ds/`. The nv-import-vision-model-report skill reads these exact paths.
-
-```bash
-# Hard constraint: num_streams <= engine max batch size — always
-N=$(python3 -c "print(min($PEAK_GPU_STREAMS, $MAX_BS))")
-LOG_RUN1="models/$MODEL_NAME/benchmarks/ds/ds_s${N}_run1.log"
-
-STEP7_RUN1_START=$(date +%s.%N)
-bash skills/deepstream-import-vision-model/scripts/deepstream/ds-perf-run.sh \
-  models/$MODEL_NAME/benchmarks/ds/config_infer_ds_${MODEL_NAME}.txt \
-  "$N" \
-  "$LOG_RUN1" \
-  "$VIDEO"
-
-FPS_RUN1=$(grep -oP '\*\*PERF:\s*\K[0-9.]+' "$LOG_RUN1" | tail -10 | python3 -c "
-import sys; vals=[float(l) for l in sys.stdin if l.strip()]; print(round(sum(vals)/len(vals),2) if vals else 0)")
-python3 -c "exit(0 if float('$FPS_RUN1') > 0 else 1)" || \
-  { echo "ERROR: FPS parsing failed for Run 1 — check $LOG_RUN1"; exit 1; }
-
-TOTAL_FPS_RUN1=$(python3 -c "print(round(float('$FPS_RUN1') * $N, 2))")
-RT_STREAMS=$(python3 -c "import math; print(min(int(math.floor(float('$TOTAL_FPS_RUN1') / 30)), $MAX_BS))")
-echo "DS Run 1: $N streams | FPS/stream=$FPS_RUN1 | total=$TOTAL_FPS_RUN1 img/s | RT_STREAMS=$RT_STREAMS"
-STEP7_RUN1_END=$(date +%s.%N)
-STEP7_RUN1_DURATION=$(echo "$STEP7_RUN1_END - $STEP7_RUN1_START" | bc)
-echo "[Step 7 Run 1] completed in ${STEP7_RUN1_DURATION}s"
-```
-
-**DS Run 2 — Validation at RT_STREAMS:**
-```bash
-N=$RT_STREAMS
-LOG_RUN2="models/$MODEL_NAME/benchmarks/ds/ds_s${N}_run2.log"
-
-STEP7_RUN2_START=$(date +%s.%N)
-bash skills/deepstream-import-vision-model/scripts/deepstream/ds-perf-run.sh \
-  models/$MODEL_NAME/benchmarks/ds/config_infer_ds_${MODEL_NAME}.txt \
-  "$N" \
-  "$LOG_RUN2" \
-  "$VIDEO"
-
-FPS_RUN2=$(grep -oP '\*\*PERF:\s*\K[0-9.]+' "$LOG_RUN2" | tail -10 | python3 -c "
-import sys; vals=[float(l) for l in sys.stdin if l.strip()]; print(round(sum(vals)/len(vals),2) if vals else 0)")
-python3 -c "exit(0 if float('$FPS_RUN2') > 0 else 1)" || \
-  { echo "ERROR: FPS parsing failed for Run 2 — check $LOG_RUN2"; exit 1; }
-
-TOTAL_FPS_RUN2=$(python3 -c "print(round(float('$FPS_RUN2') * $N, 2))")
-RT_CONFIRMED=$(python3 -c "print('YES' if float('$FPS_RUN2') >= 30 else 'NO')")
-echo "DS Run 2: $N streams | FPS/stream=$FPS_RUN2 | total=$TOTAL_FPS_RUN2 img/s | Real-time: $RT_CONFIRMED"
-STEP7_RUN2_END=$(date +%s.%N)
-STEP7_RUN2_DURATION=$(echo "$STEP7_RUN2_END - $STEP7_RUN2_START" | bc)
-echo "[Step 7 Run 2] completed in ${STEP7_RUN2_DURATION}s"
-```
-
-> **NVDEC saturation on fast nano models**: very fast models (YOLO-nano family, etc.) can saturate NVDEC before GPU. Symptom: DS aggregate FPS plateaus at the same value regardless of stream count (e.g., 6,976 at 128 streams, 7,060 at 200 streams). In this case, `PEAK_GPU_STREAMS` from trtexec is an overestimate — Run 1 at that count will show fps/stream well below 30. The `RT_STREAMS = floor(TOTAL_FPS_RUN1 / 30)` formula above produces the correct NVDEC-limited ceiling. Do not pre-apply an efficiency factor to `PEAK_GPU_STREAMS` to compensate — the 2-run method measures overhead, it does not guess it.
-
-**If Run 2 is still not real-time** (FPS/stream < 30): halve RT_STREAMS and retry once:
-```bash
-if [ "$RT_CONFIRMED" = "NO" ]; then
-  RT_STREAMS=$(python3 -c "import math; print(max(1, int(math.floor($RT_STREAMS / 2))))")
-  echo "Run 2 not real-time — retrying at $RT_STREAMS streams"
-  N=$RT_STREAMS
-  LOG_RUN2="models/$MODEL_NAME/benchmarks/ds/ds_s${N}_run2.log"
-  bash skills/deepstream-import-vision-model/scripts/deepstream/ds-perf-run.sh \
-    models/$MODEL_NAME/benchmarks/ds/config_infer_ds_${MODEL_NAME}.txt \
-    "$N" \
-    "$LOG_RUN2" \
-    "$VIDEO"
-  FPS_RUN2=$(grep -oP '\*\*PERF:\s*\K[0-9.]+' "$LOG_RUN2" | tail -10 | python3 -c "
-import sys; vals=[float(l) for l in sys.stdin if l.strip()]; print(round(sum(vals)/len(vals),2) if vals else 0)")
-  TOTAL_FPS_RUN2=$(python3 -c "print(round(float('$FPS_RUN2') * $N, 2))")
-  RT_CONFIRMED=$(python3 -c "print('YES' if float('$FPS_RUN2') >= 30 else 'NO')")
-  echo "Retry: $N streams | FPS/stream=$FPS_RUN2 | Real-time: $RT_CONFIRMED"
-fi
-```
-
-**CONSTRAINT**: `num_streams <= engine_max_bs` always. Already enforced above via `min(RT_STREAMS, MAX_BS)`.
-
-```bash
-TRTEXEC_QPS=$(grep -oP 'Throughput:\s*\K[0-9.]+' "$TRTEXEC_LOG" | tail -1)
-TRTEXEC_IMGS=$(python3 -c "print(round(float('$TRTEXEC_QPS') * $MAX_BS, 2))")
-DS_EFF_RUN1=$(python3 -c "print(round(float('$TOTAL_FPS_RUN1') / float('$TRTEXEC_IMGS') * 100, 1))")
-DS_EFF_RUN2=$(python3 -c "print(round(float('$TOTAL_FPS_RUN2') / float('$TRTEXEC_IMGS') * 100, 1))")
-```
-
-## Timing and Output Summary
-
-```bash
-TOTAL_67_DURATION=$(echo "$STEP6_DURATION + $STEP7_RUN1_DURATION + $STEP7_RUN2_DURATION" | bc)
-```
-
-When complete, print:
-```
-=== DeepStream Integration Complete ===
-Model: $MODEL_NAME | Engine: $ENGINE
-trtexec: $TRTEXEC_IMGS img/s @ BS=$MAX_BS
-DS Run 1 (PEAK): $PEAK_GPU_STREAMS streams | $FPS_RUN1 fps/s | eff $DS_EFF_RUN1%
-DS Run 2 (RT):   $RT_STREAMS streams | $FPS_RUN2 fps/s | RT: $RT_CONFIRMED | eff $DS_EFF_RUN2%
-Timing: Step6=${STEP6_DURATION}s Run1=${STEP7_RUN1_DURATION}s Run2=${STEP7_RUN2_DURATION}s Total=${TOTAL_67_DURATION}s
-Ready for: Step 8 — read references/report-generation.md models/$MODEL_NAME/
-```
diff --git a/skills/deepstream/deepstream-import-vision-model/references/report-generation.md b/skills/deepstream/deepstream-import-vision-model/references/report-generation.md
deleted file mode 100644
index 2f386c81..00000000
--- a/skills/deepstream/deepstream-import-vision-model/references/report-generation.md
+++ /dev/null
@@ -1,519 +0,0 @@
-
-# NV Import Vision Model Report -- Step 8
-
-Generate benchmark report with charts, HTML, and PDF from completed benchmarks.
-
-The model directory is: `$ARGUMENTS`
-
-> ## ⛔ STRICT HTML+PDF RULE — NO EXCEPTIONS, NO DEVIATIONS
->
-> **HTML and PDF MUST be generated via the canonical pipeline script. Do NOT write your own HTML generator.**
->
-> **The ONLY permitted way to generate the HTML + PDF:**
-> ```bash
-> python3 skills/deepstream-import-vision-model/scripts/report/md-to-html-pdf.py \
->   models/$MODEL_NAME/reports/benchmark_report.md \
->   skills/deepstream-import-vision-model/scripts/report/report-style.css \
->   models/$MODEL_NAME/reports/ \
->   $MODEL_NAME
-> ```
-> This produces:
-> - `models/$MODEL_NAME/reports/benchmark_report.html` — styled with report_style.css, charts embedded as base64
-> - `models/$MODEL_NAME/reports/benchmark_report_${MODEL_NAME}.pdf` — via wkhtmltopdf
->
-> **FORBIDDEN — never do any of these:**
-> - Write your own `generate_html.py` or any custom markdown-to-HTML converter script
-> - Call `wkhtmltopdf` directly — use `md-to-html-pdf.py` which already calls it correctly
-> - Use `md-to-pdf.sh` — GFM+Mermaid design doc tool only, wrong CSS
-> - Use `pandoc`, `pdflatex`, or any other converter
->
-> The `report_style.css` provides the ONLY correct CSS (dark navy headers #283593, alternating rows #e8eaf6, dark code blocks #263238). Any other CSS produces wrong-looking reports.
-
-## 8a: Report Structure — 12 Mandatory Sections
-
-The report must contain exactly these 12 sections in order:
-
-1. **Model Configuration** — model name, source (HF repo / NGC), architecture, ONNX source, input/output shapes, classes, custom parser name, cluster mode, precision, engine profile
-2. **System Configuration** — GPU (name + VRAM), Driver, CUDA, TensorRT, DeepStream, OS, Python, PyTorch, ONNX versions
-3. **Preprocessing** — net-scale-factor, offsets, color format, normalization details (with reference to the preprocessing table in deepstream-import-vision-model/SKILL.md)
-4. **Engine Build Summary** — source format, conversion path, engine filename (with max_bs postfix), engine size (MB), FP16 flag, builder_optimization_level if non-default, timing cache path
-5. **trtexec Results** — two runs (BS=1 and BS=MAX_BS) with: QPS, Images/s, GPU Compute mean/P99 (ms). Do NOT include H2D/D2H latency or Host Latency. Show PEAK_GPU_STREAMS derivation:
-   ```
-   PEAK_GPU_STREAMS = floor(QPS_at_MAX_BS × MAX_BS / 30)
-                    = floor(imgs_per_sec_at_MAX_BS / 30)
-   ```
-6. **PEAK_GPU_STREAMS Derivation** — explicit calculation block showing formula, inputs, and result. If a second engine was built, show both PEAK_GPU_STREAMS computations.
-7. **Single-Stream Validation** — KITTI frame count, frames with detections, top-10 detected classes (from KITTI dump), validation result (PASS/FAIL)
-8. **DeepStream Benchmark Results** — two runs:
-   - **DS Run 1 (Calibration at PEAK_GPU_STREAMS)**: streams, batch, FPS/stream, total img/s, real-time (YES/NO)
-   - **DS Run 2 (Validation at RT_STREAMS)**: streams, batch, FPS/stream, total img/s, real-time (YES)
-9. **trtexec vs DeepStream Comparison** — 3-column table: trtexec | DS Run 1 | DS Run 2, rows: engine, batch/streams, total imgs/s, FPS/stream, real-time ≥30fps, DS Efficiency %
-10. **Efficiency Analysis** — efficiency formula, Run 1 and Run 2 percentages, breakdown of the gap (NVDEC + mux + GStreamer overhead), GPU-bound vs pipeline-bound verdict
-11. **Pipeline Timing** — per-step wall-clock duration and total:
-    | Step | Description | Duration |
-    |------|-------------|----------|
-    | 1-3  | HF Model Acquire (download + inspect ONNX) | {time}s |
-    | 4    | Engine build | {time}s |
-    | 5    | trtexec BS=1 + BS=MAX_BS | {time}s |
-    | 6    | Parser + config + visual validation + KITTI | {time}s |
-    | 7 Run 1 | DS Calibration (PEAK_GPU_STREAMS streams) | {time}s |
-    | 7 Run 2 | DS Validation (RT_STREAMS streams) | {time}s |
-    | 8    | Report generation | {time}s |
-    | **Total** | **End-to-end** | **{total}s** |
-12. **Reference Commands** — exact reproducible commands:
-    - trtexec engine build (full command with all flags and paths)
-    - trtexec benchmark BS=1 and BS=MAX_BS
-    - DeepStream single-stream validation (`gst-launch-1.0` with filesink + OSD)
-    - DeepStream multi-stream benchmark (`deepstream-app` with `enable-perf-measurement=1` via `ds-perf-run.sh`, PEAK_GPU_STREAMS and RT_STREAMS variants)
-    - nvinfer config key fields (as an ini code block)
-    - Custom parser build command (`make` with DEEPSTREAM_DIR and CUDA_VER)
-    - Use actual absolute paths from the model directory, never placeholders
-
-## Pre-flight: Extract Variables from Benchmark Logs
-
-Before generating any output, derive all variables by reading completed benchmark files. These variables are used by every section below.
-
-```bash
-STEP8_START=$(date +%s.%N)
-
-MODEL_DIR="${ARGUMENTS%/}"
-MODEL_NAME=$(basename "$MODEL_DIR")
-
-# Locate engine — pick the LARGEST batch engine (sort -V ensures numeric sort, tail picks highest)
-ENGINE=$(ls models/$MODEL_NAME/benchmarks/engines/*_dynamic_b*.engine 2>/dev/null | sort -V | tail -1)
-[ -z "$ENGINE" ] && { echo "ERROR: No engine found in models/$MODEL_NAME/benchmarks/engines/ — run Steps 4-5 first (references/engine-build.md)"; exit 1; }
-MAX_BS=$(echo "$ENGINE" | grep -oP '_b\K[0-9]+(?=\.engine)')
-MODEL_FILENAME=$(basename "$ENGINE" | sed 's/_dynamic_b[0-9]*.engine//')
-echo "Using engine: $ENGINE (MAX_BS=$MAX_BS)"
-
-# Extract input name and spatial dims from ONNX (needed for reference commands in the report)
-ONNX_FILE=$(ls models/$MODEL_NAME/model/*.onnx 2>/dev/null | grep -v '_dynamic' | head -1)
-if [ -n "$ONNX_FILE" ]; then
-  INSPECT_OUT=$(python3 skills/deepstream-import-vision-model/scripts/model/inspect-onnx.py "$ONNX_FILE" 2>/dev/null)
-  INPUT_NAME=$(echo "$INSPECT_OUT" | grep -oP 'input_name:\s*\K\S+')
-  H=$(echo "$INSPECT_OUT" | grep -oP 'height:\s*\K[0-9]+')
-  W=$(echo "$INSPECT_OUT" | grep -oP 'width:\s*\K[0-9]+')
-fi
-INPUT_NAME=${INPUT_NAME:-"images"}  # fallback
-H=${H:-"640"}; W=${W:-"640"}       # fallback — update if model uses different resolution
-
-# Parse trtexec BS=1 log — fixed filename trtexec_b1.log (no timestamp, no wildcard needed)
-TRTEXEC_LOG_BS1="models/$MODEL_NAME/benchmarks/b1/trtexec_b1.log"
-[ -f "$TRTEXEC_LOG_BS1" ] || { echo "ERROR: $TRTEXEC_LOG_BS1 not found — run Steps 4-5 first (references/engine-build.md)"; exit 1; }
-QPS_BS1=$(grep -oP 'Throughput:\s*\K[0-9.]+' "$TRTEXEC_LOG_BS1" | tail -1)
-GPU_MEAN_BS1=$(grep -oP 'GPU Compute Time:.*mean = \K[0-9.]+' "$TRTEXEC_LOG_BS1" | tail -1)
-
-# Parse trtexec BS=MAX_BS log — fixed filename trtexec_b${MAX_BS}.log
-TRTEXEC_LOG_BSMAX="models/$MODEL_NAME/benchmarks/b${MAX_BS}/trtexec_b${MAX_BS}.log"
-[ -f "$TRTEXEC_LOG_BSMAX" ] || { echo "ERROR: $TRTEXEC_LOG_BSMAX not found — run Steps 4-5 first (references/engine-build.md)"; exit 1; }
-QPS_BS_MAX=$(grep -oP 'Throughput:\s*\K[0-9.]+' "$TRTEXEC_LOG_BSMAX" | tail -1)
-GPU_MEAN_BS_MAX=$(grep -oP 'GPU Compute Time:.*mean = \K[0-9.]+' "$TRTEXEC_LOG_BSMAX" | tail -1)
-GPU_P99_BS_MAX=$(grep -oP 'GPU Compute Time:.*percentile\(99%\) = \K[0-9.]+' "$TRTEXEC_LOG_BSMAX" | tail -1)
-[ -z "$QPS_BS_MAX" ] && { echo "ERROR: Could not parse Throughput from $TRTEXEC_LOG_BSMAX — log may be empty or malformed"; exit 1; }
-[ -z "$MAX_BS" ] && { echo "ERROR: Could not parse batch size from engine filename: $ENGINE"; exit 1; }
-
-read IMGS_PER_SEC PEAK_GPU_STREAMS < <(python3 -c "
-import math
-imgs = float('$QPS_BS_MAX') * $MAX_BS
-print(round(imgs, 2), int(math.floor(imgs / 30)))
-")
-
-# Parse DeepStream Run 1 and Run 2 FPS from logs written by ds-run-pipeline
-# Fixed filename pattern: benchmarks/ds/ds_s{N}_run1.log and ds_s{N}_run2.log
-# Use glob to find them (N varies per model) then extract N from filename
-DS_LOG_RUN1=$(ls models/$MODEL_NAME/benchmarks/ds/ds_s*_run1.log 2>/dev/null | head -1)
-DS_LOG_RUN2=$(ls models/$MODEL_NAME/benchmarks/ds/ds_s*_run2.log 2>/dev/null | head -1)
-[ -z "$DS_LOG_RUN1" ] && { echo "ERROR: No DS Run 1 log found at benchmarks/ds/ds_s*_run1.log — run Steps 6-7 first (references/pipeline-run.md)"; exit 1; }
-[ -z "$DS_LOG_RUN2" ] && { echo "ERROR: No DS Run 2 log found at benchmarks/ds/ds_s*_run2.log — run Steps 6-7 first (references/pipeline-run.md)"; exit 1; }
-
-N_RUN1=$(basename "$DS_LOG_RUN1" | grep -oP 'ds_s\K[0-9]+(?=_run1)')
-N_RUN2=$(basename "$DS_LOG_RUN2" | grep -oP 'ds_s\K[0-9]+(?=_run2)')
-[[ "$N_RUN1" =~ ^[0-9]+$ ]] || { echo "ERROR: Could not parse stream count from $(basename "$DS_LOG_RUN1") — expected filename pattern ds_s<N>_run1.log"; exit 1; }
-[[ "$N_RUN2" =~ ^[0-9]+$ ]] || { echo "ERROR: Could not parse stream count from $(basename "$DS_LOG_RUN2") — expected filename pattern ds_s<N>_run2.log"; exit 1; }
-RT_STREAMS=$N_RUN2
-
-# deepstream-app **PERF: format is `**PERF: fps_run0 (fps_avg0)  fps_run1 (fps_avg1)  ...`
-# Capture stream-0 instantaneous FPS (\K after `**PERF:`) — 1 value per line — so
-# tail -10 always covers exactly 10 measurement windows regardless of stream count.
-# Multiply by stream count for total throughput.
-FPS_RAW_RUN1=$(grep -oP '\*\*PERF:\s*\K[0-9.]+' "$DS_LOG_RUN1" | tail -10 | python3 -c "
-import sys; vals=[float(l) for l in sys.stdin if l.strip()]; print(round(sum(vals)/len(vals),2) if vals else 0)")
-FPS_RAW_RUN2=$(grep -oP '\*\*PERF:\s*\K[0-9.]+' "$DS_LOG_RUN2" | tail -10 | python3 -c "
-import sys; vals=[float(l) for l in sys.stdin if l.strip()]; print(round(sum(vals)/len(vals),2) if vals else 0)")
-
-TOTAL_FPS_RUN1=$(python3 -c "print(round(float('$FPS_RAW_RUN1') * $N_RUN1, 2))")
-TOTAL_FPS_RUN2=$(python3 -c "print(round(float('$FPS_RAW_RUN2') * $N_RUN2, 2))")
-
-echo "=== Report Variables ==="
-echo "MODEL_NAME=$MODEL_NAME  MAX_BS=$MAX_BS"
-echo "BS=1:       QPS=$QPS_BS1  GPU mean=${GPU_MEAN_BS1}ms"
-echo "BS=$MAX_BS: QPS=$QPS_BS_MAX  imgs/s=$IMGS_PER_SEC  PEAK_GPU_STREAMS=$PEAK_GPU_STREAMS"
-echo "DS Run 1:   FPS/stream=$FPS_RAW_RUN1  streams=$N_RUN1  total=$TOTAL_FPS_RUN1 img/s"
-echo "DS Run 2:   FPS/stream=$FPS_RAW_RUN2  streams=$N_RUN2  total=$TOTAL_FPS_RUN2 img/s  RT_STREAMS=$RT_STREAMS"
-```
-
-Then immediately write `benchmark_data.json` before generating charts (so charts can load it if needed):
-
-```bash
-mkdir -p models/$MODEL_NAME/reports
-python3 << 'EOF'
-import json, os
-
-def to_num(v, cast=float):
-    """Return cast(v) or None if v is empty/invalid — prevents malformed JSON."""
-    try:
-        return cast(v) if v and str(v).strip() else None
-    except (ValueError, TypeError):
-        return None
-
-data = {
-    "model_name":       os.environ.get("MODEL_NAME", ""),
-    "engine":           os.environ.get("ENGINE", ""),
-    "max_bs":           to_num(os.environ.get("MAX_BS"), int),
-    "trtexec": {
-        "bs1":   {
-            "qps":         to_num(os.environ.get("QPS_BS1")),
-            "gpu_mean_ms": to_num(os.environ.get("GPU_MEAN_BS1"))
-        },
-        "bsmax": {
-            "qps":         to_num(os.environ.get("QPS_BS_MAX")),
-            "gpu_mean_ms": to_num(os.environ.get("GPU_MEAN_BS_MAX")),
-            "p99_ms":      to_num(os.environ.get("GPU_P99_BS_MAX")),
-            "imgs_per_sec": to_num(os.environ.get("IMGS_PER_SEC"))
-        }
-    },
-    "peak_gpu_streams": to_num(os.environ.get("PEAK_GPU_STREAMS"), int),
-    "deepstream": {
-        "run1": {
-            "streams":        to_num(os.environ.get("N_RUN1"), int),
-            "total_fps":      to_num(os.environ.get("TOTAL_FPS_RUN1")),
-            "fps_per_stream": to_num(os.environ.get("FPS_RAW_RUN1"))
-        },
-        "run2": {
-            "streams":        to_num(os.environ.get("N_RUN2"), int),
-            "total_fps":      to_num(os.environ.get("TOTAL_FPS_RUN2")),
-            "fps_per_stream": to_num(os.environ.get("FPS_RAW_RUN2"))
-        }
-    }
-}
-out_path = os.path.join("models", os.environ.get("MODEL_NAME", "unknown"),
-                        "reports", "benchmark_data.json")
-with open(out_path, "w") as f:
-    json.dump(data, f, indent=2)
-print("benchmark_data.json written")
-EOF
-```
-> `<< 'EOF'` (quoted) prevents bash expansion — Python reads all variables via `os.environ.get()`, applies `to_num()` for safe numeric conversion (returns `None` instead of producing malformed JSON when a variable is unset), then uses `json.dump` to guarantee valid output.
-
-## 8c-1: Chart Generation (MANDATORY)
-
-All Python scripts in this step run inside the **shared venv** at `build/.venv_optimum` (which holds `matplotlib`, `numpy`, `markdown`, and `onnxruntime`). Activate it once before running any report scripts:
-
-```bash
-source build/.venv_optimum/bin/activate
-```
-
-Generate exactly **5 charts** using `matplotlib` in `models/{model_name}/reports/charts/`. Use the script at `skills/deepstream-import-vision-model/scripts/report/generate-benchmark-charts.py` or generate manually. Chart names are fixed — do not rename them.
-
-| Filename | Content | Chart type |
-|----------|---------|------------|
-| `chart_trtexec_bs1_vs_bsmax.png` | Bar chart: QPS at BS=1 vs BS=MAX_BS (side by side) | Grouped bar |
-| `chart_trtexec_throughput.png` | GPU-only images/sec at MAX_BS, with PEAK_GPU_STREAMS annotation (dashed line at y=PEAK_GPU_STREAMS×30) | Single bar or line |
-| `chart_ds_streams_vs_fps.png` | Line chart: X=stream count (PEAK_GPU_STREAMS, RT_STREAMS), Y=FPS/stream. Red dashed line at 30fps threshold. | Line + markers |
-| `chart_trt_vs_ds.png` | Grouped bars: trtexec total imgs/s \| DS Run 1 total imgs/s \| DS Run 2 total imgs/s | Grouped bar |
-| `chart_efficiency.png` | DS efficiency %: 2 bars (Run 1 efficiency, Run 2 efficiency), dashed line at 100% | Bar |
-
-Do NOT generate H2D/D2H transfer overhead charts.
-
-Chart style requirements:
-- Figure size: `figsize=(10, 6)`, DPI: 150
-- Title: two-line format via `two_line_title(model_name, subtitle)` — model name on line 1, chart description on line 2 (prevents long titles from clipping outside figure bounds)
-- Axis labels: 13px; Bar value labels: bold, 12-13px, positioned above bars
-- Grid: `axis='y', alpha=0.3`; `plt.tight_layout()` before save
-- Use `matplotlib.use('Agg')` (no display needed)
-
-## 8c-1b: Markdown Report (MANDATORY)
-
-Generate `benchmark_report.md` before the HTML. This file must contain all 12 sections filled with actual values — no placeholders allowed.
-
-First, gather system info not already captured in pre-flight:
-
-```bash
-GPU_INFO=$(nvidia-smi --query-gpu=name,memory.total --format=csv,noheader | head -1)
-GPU_NAME=$(echo "$GPU_INFO" | cut -d, -f1 | xargs)
-GPU_VRAM=$(echo "$GPU_INFO" | cut -d, -f2 | xargs)
-DRIVER_VER=$(nvidia-smi --query-gpu=driver_version --format=csv,noheader | head -1 | xargs)
-CUDA_VER=$(nvcc --version 2>/dev/null | grep -oP 'release \K[0-9.]+' || echo "N/A")
-TRT_VER=$(trtexec 2>&1 | head -3 | grep -oP 'TensorRT v\K[0-9.]+' || echo "N/A")
-DS_VER=$(deepstream-app --version-all 2>/dev/null | grep -oP 'DeepStreamSDK \K[0-9.]+' || echo "N/A")
-ENGINE_SIZE_MB=$(du -m "$ENGINE" | cut -f1)
-IMGS_PER_SEC_BS1=$(python3 -c "print(round(float('$QPS_BS1') * 1, 2))")
-GPU_P99_BS1=$(grep -oP 'GPU Compute Time:.*percentile\(99%\) = \K[0-9.]+' "$TRTEXEC_LOG_BS1" | tail -1)
-GPU_P99_BS1=${GPU_P99_BS1:-"N/A"}  # fallback if log too short to have P99
-EFFICIENCY_RUN1=$(python3 -c "print(round(float('$TOTAL_FPS_RUN1') / float('$IMGS_PER_SEC') * 100, 1))")
-EFFICIENCY_RUN2=$(python3 -c "print(round(float('$TOTAL_FPS_RUN2') / float('$IMGS_PER_SEC') * 100, 1))")
-RT_LABEL_RUN1=$(python3 -c "print('YES' if float('$FPS_RAW_RUN1') >= 30 else 'NO')")
-RT_LABEL_RUN2=$(python3 -c "print('YES' if float('$FPS_RAW_RUN2') >= 30 else 'NO')")
-```
-
-Then write the markdown (use unquoted `<< MDEOF` so bash expands variables):
-
-```bash
-cat > models/$MODEL_NAME/reports/benchmark_report.md << MDEOF
-# ${MODEL_NAME} Benchmark Report
-
-Generated: $(date '+%Y-%m-%d %H:%M:%S')
-
----
-
-## 1. Model Configuration
-
-| Parameter | Value |
-|-----------|-------|
-| **Model Name** | ${MODEL_NAME} |
-| **Source** | (fill from Steps 1-3 log) |
-| **Architecture** | (fill from config.json model_type) |
-| **ONNX Source** | models/${MODEL_NAME}/model/ |
-| **Precision** | FP16 |
-| **Engine File** | $(basename $ENGINE) |
-| **Engine Profile** | min=1x3x640x640  opt=${MAX_BS}x3x640x640  max=${MAX_BS}x3x640x640 |
-| **Custom Parser** | libnvdsinfer_${MODEL_NAME}_parser.so |
-| **Cluster Mode** | (fill from nvinfer config) |
-
-## 2. System Configuration
-
-| Parameter | Value |
-|-----------|-------|
-| **GPU** | ${GPU_NAME} |
-| **VRAM** | ${GPU_VRAM} |
-| **Driver** | ${DRIVER_VER} |
-| **CUDA** | ${CUDA_VER} |
-| **TensorRT** | ${TRT_VER} |
-| **DeepStream** | ${DS_VER} |
-
-## 3. Preprocessing
-
-| Parameter | Value |
-|-----------|-------|
-| **net-scale-factor** | (fill from nvinfer config) |
-| **offsets** | (fill from nvinfer config) |
-| **Color Format** | (fill from nvinfer config) |
-| **Input Resolution** | 640×640 |
-
-## 4. Engine Build Summary
-
-| Parameter | Value |
-|-----------|-------|
-| **Source Format** | ONNX |
-| **Engine File** | $(basename $ENGINE) |
-| **Engine Size** | ${ENGINE_SIZE_MB} MB |
-| **FP16** | Enabled |
-| **MAX Batch Size** | ${MAX_BS} |
-| **Workspace** | 32768 MiB |
-| **Timing Cache** | models/${MODEL_NAME}/benchmarks/engines/timing.cache |
-
-## 5. trtexec Results
-
-| Metric | BS=1 | BS=${MAX_BS} |
-|--------|------|------|
-| **QPS (queries/s)** | ${QPS_BS1} | ${QPS_BS_MAX} |
-| **Images/s** | ${IMGS_PER_SEC_BS1} | ${IMGS_PER_SEC} |
-| **GPU Compute Mean (ms)** | ${GPU_MEAN_BS1} | ${GPU_MEAN_BS_MAX} |
-| **GPU Compute P99 (ms)** | ${GPU_P99_BS1} | ${GPU_P99_BS_MAX} |
-
-> Note: H2D/D2H latency excluded — trtexec run with \`--noDataTransfers\` to match DeepStream (GPU-to-GPU data flow, no host transfers).
-
-![trtexec BS=1 vs BS=${MAX_BS}](charts/chart_trtexec_bs1_vs_bsmax.png)
-
-## 6. PEAK_GPU_STREAMS Derivation
-
-\`\`\`
-PEAK_GPU_STREAMS = floor(imgs_per_sec_at_MAX_BS / 30)
-                = floor(${IMGS_PER_SEC} / 30)
-                = ${PEAK_GPU_STREAMS} streams
-\`\`\`
-
-![trtexec throughput at BS=${MAX_BS}](charts/chart_trtexec_throughput.png)
-
-## 7. Single-Stream Validation
-
-| Parameter | Value |
-|-----------|-------|
-| **Video Source** | sample_720p.mp4 (1280×720) |
-| **KITTI Output Dir** | models/${MODEL_NAME}/samples/kitti_output/ |
-| **Total Frames** | (fill from kitti dump) |
-| **Frames with Detections** | (fill from kitti dump) |
-| **Detection Rate** | (fill — must be ≥ 90%) |
-| **Visual Capture Mode** | (fill: `nvv4l2h264enc MP4` OR `theoraenc OGV (NVENC unavailable)` OR `skipped (no encoder available)`) |
-| **Visual Capture Artifact** | (fill: `samples/${MODEL_NAME}_output.mp4` for NVENC path; `samples/${MODEL_NAME}_output.ogv` for theoraenc fallback; `N/A` if skipped) |
-| **Validation Result** | PASS |
-
-> **Encoder reporting rule (MANDATORY):** The Visual Capture Mode field MUST be exactly one of:
-> - `nvv4l2h264enc MP4` — NVENC succeeded; artifact is `.mp4`
-> - `theoraenc OGV (NVENC unavailable)` — if `DS_SINGLE_STREAM_MODE=theoraenc-fallback`; use `.ogv` path from `DS_SINGLE_STREAM_OUTPUT=`
-> - `skipped (no encoder available)` — if `DS_SINGLE_STREAM_MODE=skipped`; no artifact file
-> `x264enc` and `openh264enc` are prohibited and must never appear in this field.
-
-## 8. DeepStream Benchmark Results
-
-### DS Run 1 — Calibration at PEAK_GPU_STREAMS (${N_RUN1} streams)
-
-| Metric | Value |
-|--------|-------|
-| **Streams** | ${N_RUN1} |
-| **Batch Size** | ${N_RUN1} |
-| **FPS / Stream** | ${FPS_RAW_RUN1} |
-| **Total Images/s** | ${TOTAL_FPS_RUN1} |
-| **Real-Time (≥30 fps/stream)** | ${RT_LABEL_RUN1} |
-
-### DS Run 2 — Validation at RT_STREAMS (${N_RUN2} streams)
-
-| Metric | Value |
-|--------|-------|
-| **Streams** | ${N_RUN2} |
-| **Batch Size** | ${N_RUN2} |
-| **FPS / Stream** | ${FPS_RAW_RUN2} |
-| **Total Images/s** | ${TOTAL_FPS_RUN2} |
-| **Real-Time (≥30 fps/stream)** | ${RT_LABEL_RUN2} |
-
-![DeepStream FPS/stream vs stream count](charts/chart_ds_streams_vs_fps.png)
-
-## 9. trtexec vs DeepStream Comparison
-
-| Metric | trtexec BS=${MAX_BS} | DS Run 1 (${N_RUN1} streams) | DS Run 2 (${N_RUN2} streams) |
-|--------|---------------------|------------------------------|------------------------------|
-| **Engine** | $(basename $ENGINE) | $(basename $ENGINE) | $(basename $ENGINE) |
-| **Batch / Streams** | BS=${MAX_BS} | ${N_RUN1} streams | ${N_RUN2} streams |
-| **Total imgs/s** | ${IMGS_PER_SEC} | ${TOTAL_FPS_RUN1} | ${TOTAL_FPS_RUN2} |
-| **FPS / stream** | $(python3 -c "print(round(float('$IMGS_PER_SEC')/${MAX_BS},1))") | ${FPS_RAW_RUN1} | ${FPS_RAW_RUN2} |
-| **Real-Time ≥30fps** | YES | ${RT_LABEL_RUN1} | ${RT_LABEL_RUN2} |
-| **DS Efficiency %** | — | ${EFFICIENCY_RUN1}% | ${EFFICIENCY_RUN2}% |
-
-![trtexec vs DeepStream total throughput](charts/chart_trt_vs_ds.png)
-
-## 10. Efficiency Analysis
-
-\`\`\`
-DS Efficiency = DS_total_imgs_per_sec / trtexec_imgs_per_sec × 100
-Run 1: ${TOTAL_FPS_RUN1} / ${IMGS_PER_SEC} × 100 = ${EFFICIENCY_RUN1}%
-Run 2: ${TOTAL_FPS_RUN2} / ${IMGS_PER_SEC} × 100 = ${EFFICIENCY_RUN2}%
-\`\`\`
-
-Efficiency gap breakdown: NVDEC decode overhead (~5-10%), GStreamer mux/queue overhead (~5-10%), CPU scheduler jitter (~2-5%).
-
-Interpretation notes for the numbers above:
-
-- **Well-balanced pipeline**: GPU=99-100%, NVDEC=99-100%, CPU=30-40% with no single core pinned. The ~50% DS/trtexec gap at this utilization is physically irreducible — it's the cost of real decode + memory transfers that trtexec skips with \`--noDataTransfers\`.
-- **DS efficiency above 100% is expected for ViT / transformer models**: the TRT compiler backend (opt-level 4) often produces bimodal GPU latency with two alternating execution paths (e.g., 1.5ms and 4.0ms modes for OWL-ViT). trtexec reports high variance and a conservative median; DeepStream's pipelined scheduling smooths the bimodal pattern and can achieve 100-110% of the trtexec baseline. This is not a measurement error.
-- **1080p tends to saturate NVDEC** while GPU has headroom. The pipeline is pinned to 720p (\`sample_720p.mp4\`) specifically to keep benchmarks comparable across models.
-
-![DeepStream efficiency vs trtexec baseline](charts/chart_efficiency.png)
-
-## 11. Pipeline Timing
-
-| Step | Description | Duration |
-|------|-------------|----------|
-| 1-3 | HF Model Acquire (download + inspect ONNX) | (fill from step timing) |
-| 4 | Engine build | (fill from step timing) |
-| 5 | trtexec BS=1 + BS=${MAX_BS} | (fill from step timing) |
-| 6 | Parser + config + visual validation + KITTI | (fill from step timing) |
-| 7 Run 1 | DS Calibration (${N_RUN1} streams) | (fill from step timing) |
-| 7 Run 2 | DS Validation (${N_RUN2} streams) | (fill from step timing) |
-| 8 | Report generation | (fill) |
-| **Total** | **End-to-end** | **(fill)** |
-
-## 12. Reference Commands
-
-### Engine Build
-\`\`\`bash
-trtexec --onnx=models/${MODEL_NAME}/model/${MODEL_FILENAME}.onnx \\
-  --saveEngine=models/${MODEL_NAME}/benchmarks/engines/${MODEL_FILENAME}_dynamic_b${MAX_BS}.engine \\
-  --minShapes=${INPUT_NAME}:1x3x${H}x${W} \\
-  --optShapes=${INPUT_NAME}:${MAX_BS}x3x${H}x${W} \\
-  --maxShapes=${INPUT_NAME}:${MAX_BS}x3x${H}x${W} \\
-  --fp16 --memPoolSize=workspace:32768M \\
-  --timingCacheFile=models/${MODEL_NAME}/benchmarks/engines/timing.cache
-\`\`\`
-
-### trtexec Benchmark
-\`\`\`bash
-# BS=1
-trtexec --loadEngine=$(basename $ENGINE) --shapes=${INPUT_NAME}:1x3x${H}x${W} \\
-  --noDataTransfers --warmUp=1000 --duration=10
-
-# BS=${MAX_BS}
-trtexec --loadEngine=$(basename $ENGINE) --shapes=${INPUT_NAME}:${MAX_BS}x3x${H}x${W} \\
-  --noDataTransfers --warmUp=1000 --duration=10
-\`\`\`
-
-### DeepStream Single-Stream Validation
-\`\`\`bash
-# See models/${MODEL_NAME}/scripts/ for full gst-launch-1.0 command
-\`\`\`
-
-### DeepStream Multi-Stream Benchmark
-\`\`\`bash
-# DS Run 1: ${N_RUN1} streams — see models/${MODEL_NAME}/scripts/
-# DS Run 2: ${N_RUN2} streams — see models/${MODEL_NAME}/scripts/
-\`\`\`
-
-### Custom Parser Build
-\`\`\`bash
-cd models/${MODEL_NAME}/parser && make DEEPSTREAM_DIR=/opt/nvidia/deepstream/deepstream CUDA_VER=12
-\`\`\`
-MDEOF
-echo "benchmark_report.md written: $(wc -l < models/$MODEL_NAME/reports/benchmark_report.md) lines"
-```
-
-> **Note on "fill" fields**: Fields marked `(fill from ...)` must be replaced with actual values from the step logs before finalizing. Search the step output logs for the exact values and substitute them. Do not leave any `(fill ...)` placeholder in the final report.
-
-## 8c-2 + 8c-3: HTML + PDF Report (MANDATORY — ONE COMMAND)
-
-Before generating HTML+PDF, verify all 5 charts exist:
-
-```bash
-CHART_DIR="models/$MODEL_NAME/reports/charts"
-MISSING_CHARTS=0
-for CHART in chart_trtexec_bs1_vs_bsmax.png chart_trtexec_throughput.png \
-             chart_ds_streams_vs_fps.png chart_trt_vs_ds.png chart_efficiency.png; do
-  [ ! -f "$CHART_DIR/$CHART" ] && { echo "ERROR: Missing $CHART_DIR/$CHART"; MISSING_CHARTS=$((MISSING_CHARTS+1)); }
-done
-[ "$MISSING_CHARTS" -gt 0 ] && { echo "ERROR: $MISSING_CHARTS chart(s) missing — re-run 8c-1"; exit 1; }
-echo "All 5 charts verified OK"
-```
-
-Then run the canonical pipeline script — this generates BOTH the HTML and PDF correctly:
-
-```bash
-python3 skills/deepstream-import-vision-model/scripts/report/md-to-html-pdf.py \
-  models/$MODEL_NAME/reports/benchmark_report.md \
-  skills/deepstream-import-vision-model/scripts/report/report-style.css \
-  models/$MODEL_NAME/reports/ \
-  $MODEL_NAME
-```
-
-This script uses `report_style.css` (navy `#283593` headers, `#e8eaf6` rows, `#263238` code blocks), embeds charts as base64 data URIs, calls `wkhtmltopdf` internally, and outputs `benchmark_report.html` + `benchmark_report_{model_name}.pdf`.
-
-> **NAMING RULES:**
-> - HTML: always `benchmark_report.html` (no model name suffix)
-> - PDF: always `benchmark_report_{model_name}.pdf` (model name postfix required)
-
-Verify PDF size is >500 KB (confirms charts embedded). Run all python commands with the shared venv active (`source build/.venv_optimum/bin/activate`); `markdown` and `matplotlib` are already installed there.
-
-## 8c-4: Final Report Checklist and Timing
-
-After generating markdown, HTML, and PDF, record step timing:
-
-```bash
-STEP8_END=$(date +%s.%N)
-STEP8_DURATION=$(echo "$STEP8_END - $STEP8_START" | bc)
-echo "[Step 8] Report generation completed in ${STEP8_DURATION}s"
-```
-
-Before marking the report as complete, verify ALL of these exist:
-- [ ] `reports/benchmark_report.md` — markdown source (12 sections)
-- [ ] `reports/benchmark_report.html` — styled HTML (charts/ alongside)
-- [ ] `reports/benchmark_report_{model_name}.pdf` — PDF >500 KB (confirms charts embedded)
-- [ ] `reports/benchmark_data.json` — raw benchmark numbers
-- [ ] `reports/charts/` — all 5 PNGs: `chart_trtexec_bs1_vs_bsmax.png`, `chart_trtexec_throughput.png`, `chart_ds_streams_vs_fps.png`, `chart_trt_vs_ds.png`, `chart_efficiency.png`
-- **Charts**: fixed filenames above — never rename or add model name suffix to charts
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/benchmark-ds.sh b/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/benchmark-ds.sh
deleted file mode 100755
index 6103bbce..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/benchmark-ds.sh
+++ /dev/null
@@ -1,93 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-set -euo pipefail
-################################################################################
-# DeepStream benchmark using gst-launch-1.0
-# Thumb rule: batch_size == num_streams (always equal).
-# Measures total throughput by timing full video processing with fakesink.
-#
-# Usage: ./benchmark-ds.sh <config_file> <num_streams> [input_video]
-# Example: ./benchmark-ds.sh config_infer_primary_b21.txt 21 video.mp4
-#
-# batch_size in the nvinfer config must match num_streams.
-################################################################################
-
-CONFIG="${1:-}"
-NUM_STREAMS="${2:-}"
-VIDEO="${3:-/opt/nvidia/deepstream/deepstream/samples/streams/sample_720p.mp4}"
-MUXER_W=1280
-MUXER_H=720
-NS_PER_SEC=$(( 1000 * 1000 * 1000 ))
-
-if [ -z "$CONFIG" ] || [ -z "$NUM_STREAMS" ]; then
-    echo "Usage: $0 <config_file> <num_streams> [input_video]"
-    exit 1
-fi
-
-# Detect video FPS via mediainfo; fall back to 30 for the standard sample
-VIDEO_FPS=$(mediainfo --Inform="Video;%FrameRate%" "${VIDEO}" 2>/dev/null | awk '{printf "%.0f", $1+0}')
-VIDEO_FPS="${VIDEO_FPS:-30}"
-
-# Detect actual frame count; fall back to 1440 if mediainfo unavailable or fails
-if [ -n "$3" ]; then
-    FRAMES_PER_STREAM=$(mediainfo --Inform="Video;%FrameCount%" "${VIDEO}" 2>/dev/null)
-    if ! echo "$FRAMES_PER_STREAM" | grep -qE '^[0-9]+$' || [ "$FRAMES_PER_STREAM" -eq 0 ]; then
-        echo "Warning: mediainfo failed, falling back to 1440 frames" >&2
-        FRAMES_PER_STREAM=1440
-    fi
-else
-    # Default sample_720p.mp4 is ~1440 frames at 30fps
-    FRAMES_PER_STREAM=1440
-fi
-TOTAL_FRAMES=$((FRAMES_PER_STREAM * NUM_STREAMS))
-
-echo "=== DeepStream Benchmark ==="
-echo "Config:  $CONFIG"
-echo "Streams: $NUM_STREAMS"
-echo "Frames/stream: $FRAMES_PER_STREAM"
-echo "Total frames:  $TOTAL_FRAMES"
-echo ""
-
-# Build source elements
-SOURCES=""
-for i in $(seq 0 $((NUM_STREAMS - 1))); do
-    SOURCES+="filesrc location=${VIDEO} ! qtdemux ! queue ! h264parse ! queue ! nvv4l2decoder ! queue ! mux.sink_${i} "
-done
-
-PIPELINE="${SOURCES} nvstreammux name=mux batch-size=${NUM_STREAMS} width=${MUXER_W} height=${MUXER_H} batched-push-timeout=-1 ! \
-    queue ! nvinfer config-file-path=${CONFIG} ! queue ! fakesink sync=0"
-
-echo "Starting pipeline..."
-START_TIME=$(date +%s%N)
-
-GST_DEBUG=0 gst-launch-1.0 -e ${PIPELINE} 2>&1 | grep -v "^$" || true
-
-END_TIME=$(date +%s%N)
-ELAPSED_NS=$((END_TIME - START_TIME))
-ELAPSED_SEC=$(echo "scale=2; $ELAPSED_NS / $NS_PER_SEC" | bc)
-FPS=$(echo "scale=1; $TOTAL_FRAMES / $ELAPSED_SEC" | bc)
-REALTIME=$(echo "scale=2; $FPS / (${NUM_STREAMS} * ${VIDEO_FPS})" | bc)
-
-echo ""
-echo "=== Results ==="
-echo "Wall time:     ${ELAPSED_SEC}s"
-echo "Total frames:  ${TOTAL_FRAMES}"
-echo "Throughput:    ${FPS} img/s"
-echo "Per-stream:    $(echo "scale=1; $FPS / $NUM_STREAMS" | bc) fps"
-echo "Real-time factor: ${REALTIME}x (${NUM_STREAMS} streams @ ${VIDEO_FPS}fps)"
-echo "==============="
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-kitti-dump.sh b/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-kitti-dump.sh
deleted file mode 100755
index 8321686d..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-kitti-dump.sh
+++ /dev/null
@@ -1,151 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-################################################################################
-# Step 6: KITTI dump using deepstream-app (built-in KITTI support)
-# Generates a temporary deepstream-app config, runs for N frames, dumps KITTI.
-#
-# Usage: ./ds-kitti-dump.sh <nvinfer_config> <kitti_output_dir> [num_frames] [input_video]
-# Example: ./ds-kitti-dump.sh config_infer_primary_yolox.txt kitti_output 100
-################################################################################
-set -euo pipefail
-
-NVINFER_CONFIG="$1"
-KITTI_DIR="$2"
-NUM_FRAMES="${3:-100}"
-VIDEO="${4:-/opt/nvidia/deepstream/deepstream/samples/streams/sample_720p.mp4}"
-
-if [ -z "$NVINFER_CONFIG" ] || [ -z "$KITTI_DIR" ]; then
-    echo "Usage: $0 <nvinfer_config> <kitti_output_dir> [num_frames] [input_video]"
-    exit 1
-fi
-
-# Validate inputs before resolving paths
-[ -f "$NVINFER_CONFIG" ] || { echo "ERROR: nvinfer config not found: $NVINFER_CONFIG"; exit 1; }
-[ -f "$VIDEO" ] || { echo "ERROR: video file not found: $VIDEO"; exit 1; }
-
-# Resolve to absolute paths
-NVINFER_CONFIG="$(realpath "$NVINFER_CONFIG")"
-KITTI_DIR="$(realpath -m "$KITTI_DIR")"
-VIDEO="$(realpath "$VIDEO")"
-
-mkdir -p "${KITTI_DIR}"
-
-echo "=== DeepStream KITTI Dump ==="
-echo "nvinfer config: $NVINFER_CONFIG"
-echo "KITTI dir:      $KITTI_DIR"
-echo "Max frames:     $NUM_FRAMES"
-echo "Input video:    $VIDEO"
-echo ""
-
-# Generate temporary deepstream-app config
-trap 'rm -f "${TMPCONFIG:-}"' EXIT
-TMPCONFIG=$(mktemp /tmp/ds_kitti_XXXXXX.txt)
-
-cat > "$TMPCONFIG" << EOF
-[application]
-enable-perf-measurement=0
-gie-kitti-output-dir=${KITTI_DIR}
-
-[tiled-display]
-enable=0
-
-[source0]
-enable=1
-type=3
-uri=file://${VIDEO}
-num-sources=1
-gpu-id=0
-
-[sink0]
-enable=1
-type=1
-#1=FakeSink
-sync=0
-
-[osd]
-enable=0
-
-[streammux]
-live-source=0
-batch-size=1
-batched-push-timeout=-1
-width=1280
-height=720
-
-[primary-gie]
-enable=1
-batch-size=1
-gie-unique-id=1
-config-file=${NVINFER_CONFIG}
-
-[tests]
-file-loop=0
-EOF
-
-echo "Temp config: $TMPCONFIG"
-echo "Running deepstream-app..."
-
-# Run deepstream-app (it will process entire video).
-# Temporarily disable pipefail so head -30 closing the pipe early (SIGPIPE to grep)
-# doesn't trigger set -e before we can capture deepstream-app's exit code.
-set +o pipefail
-timeout 120 deepstream-app -c "$TMPCONFIG" 2>&1 | grep -v "^$" | head -30
-DS_EXIT_CODE=${PIPESTATUS[0]}
-set -o pipefail
-
-if [ $DS_EXIT_CODE -eq 124 ]; then
-    echo "Warning: deepstream-app timed out after 120 seconds"
-elif [ $DS_EXIT_CODE -ne 0 ]; then
-    echo "Error: deepstream-app failed with exit code $DS_EXIT_CODE"
-    exit 1
-fi
-
-# Count KITTI files generated
-TOTAL_FILES=$(ls -1 "${KITTI_DIR}"/*.txt 2>/dev/null | wc -l)
-echo ""
-echo "Total KITTI files generated: ${TOTAL_FILES}"
-
-# Keep only first N frames, remove the rest
-if [ "$TOTAL_FILES" -gt "$NUM_FRAMES" ]; then
-    # Guard against misconfigured KITTI_DIR blowing away something else
-    [ -n "$KITTI_DIR" ] && [ -d "$KITTI_DIR" ] && [ "$KITTI_DIR" != "/" ] \
-        || { echo "ERROR: invalid KITTI_DIR for cleanup: $KITTI_DIR"; exit 1; }
-    TO_REMOVE=$((TOTAL_FILES - NUM_FRAMES))
-    echo "Trimming to first ${NUM_FRAMES} frames (removing ${TO_REMOVE})..."
-    # NUL-delimited read so filenames with spaces/newlines are handled safely.
-    KITTI_FILES=()
-    while IFS= read -r -d '' f; do
-        KITTI_FILES+=("$f")
-    done < <(find "$KITTI_DIR" -maxdepth 1 -type f -name '*.txt' -print0 | sort -z)
-    for ((i = NUM_FRAMES; i < ${#KITTI_FILES[@]}; i++)); do
-        rm -f -- "${KITTI_FILES[i]}"
-    done
-    TOTAL_FILES=$(find "$KITTI_DIR" -maxdepth 1 -type f -name '*.txt' 2>/dev/null | wc -l)
-    echo "Kept ${TOTAL_FILES} KITTI files"
-fi
-
-# Show sample KITTI output
-echo ""
-echo "=== Sample KITTI Output (first 3 files) ==="
-for f in $(ls -1 "${KITTI_DIR}"/*.txt 2>/dev/null | sort | head -3); do
-    echo "--- $(basename $f) ---"
-    cat "$f"
-done
-
-echo ""
-echo "=== KITTI Dump Complete ==="
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-perf-run.sh b/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-perf-run.sh
deleted file mode 100755
index d4cda080..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-perf-run.sh
+++ /dev/null
@@ -1,154 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-################################################################################
-# Step 7c: DeepStream perf-measurement run via deepstream-app.
-#
-# Replaces the older `gst-launch-1.0 ... ! fpsdisplaysink ...` benchmark, which
-# pulled in `gstreamer1.0-plugins-bad`. `deepstream-app` is part of the NVIDIA
-# DeepStream SDK and emits `**PERF: fps_run0 (fps_avg0)  fps_run1 (fps_avg1)  ...`
-# lines (one pair per active source) that the report-generation phase parses.
-#
-# Usage: ./ds-perf-run.sh <nvinfer_config> <num_streams> <log_path> [input_video]
-# Example:
-#   ./ds-perf-run.sh config_infer_ds_yolox.txt 32 \
-#       models/yolox/benchmarks/ds/ds_s32_run1.log \
-#       /opt/nvidia/deepstream/deepstream/samples/streams/sample_720p.mp4
-#
-# Notes:
-#   - `[primary-gie] batch-size` and `[streammux] batch-size` are both set to N
-#     (matches the skill-wide rule batch_size == num_streams).
-#   - `num-sources=N` fans the single input video out to N pipeline sources;
-#     deepstream-app handles the file-loop / EOS bookkeeping.
-#   - The nvinfer config must already point at the engine, parser, and labels.
-#     This script does NOT mutate the nvinfer config.
-################################################################################
-set -euo pipefail
-
-NVINFER_CONFIG="${1:-}"
-NUM_STREAMS="${2:-}"
-LOG_PATH="${3:-}"
-VIDEO="${4:-/opt/nvidia/deepstream/deepstream/samples/streams/sample_720p.mp4}"
-
-if [ -z "$NVINFER_CONFIG" ] || [ -z "$NUM_STREAMS" ] || [ -z "$LOG_PATH" ]; then
-    echo "Usage: $0 <nvinfer_config> <num_streams> <log_path> [input_video]"
-    exit 1
-fi
-
-[ -f "$NVINFER_CONFIG" ] || { echo "ERROR: nvinfer config not found: $NVINFER_CONFIG"; exit 1; }
-[ -f "$VIDEO" ] || { echo "ERROR: video file not found: $VIDEO"; exit 1; }
-command -v deepstream-app >/dev/null 2>&1 || { echo "ERROR: deepstream-app not on PATH"; exit 1; }
-
-NVINFER_CONFIG="$(realpath "$NVINFER_CONFIG")"
-VIDEO="$(realpath "$VIDEO")"
-LOG_PATH="$(realpath -m "$LOG_PATH")"
-mkdir -p "$(dirname "$LOG_PATH")"
-
-N="$NUM_STREAMS"
-MUXER_W=1280
-MUXER_H=720
-
-echo "=== DeepStream Perf Run ==="
-echo "nvinfer config: $NVINFER_CONFIG"
-echo "Streams (=N):   $N"
-echo "Input video:    $VIDEO"
-echo "Log path:       $LOG_PATH"
-echo ""
-
-trap 'rm -f "${TMPCONFIG:-}"' EXIT
-TMPCONFIG=$(mktemp /tmp/ds_perf_XXXXXX.txt)
-
-cat > "$TMPCONFIG" <<EOF
-[application]
-enable-perf-measurement=1
-perf-measurement-interval-sec=2
-
-[tiled-display]
-enable=0
-
-[source0]
-enable=1
-type=3
-uri=file://${VIDEO}
-num-sources=${N}
-gpu-id=0
-
-[sink0]
-enable=1
-type=1
-sync=0
-
-[osd]
-enable=0
-
-[streammux]
-live-source=0
-batch-size=${N}
-batched-push-timeout=-1
-width=${MUXER_W}
-height=${MUXER_H}
-
-[primary-gie]
-enable=1
-batch-size=${N}
-gie-unique-id=1
-config-file=${NVINFER_CONFIG}
-
-[tests]
-file-loop=1
-EOF
-
-echo "Temp config: $TMPCONFIG"
-echo "Running deepstream-app..."
-
-set +o pipefail
-# file-loop=1 has no built-in stop condition; timeout(1) kills deepstream-app
-# after 60 s and returns exit 124 — treated as success below.
-timeout 60s deepstream-app -c "$TMPCONFIG" 2>&1 | tee "$LOG_PATH"
-DS_EXIT_CODE=${PIPESTATUS[0]}
-set -o pipefail
-
-# exit 124 = timeout fired as expected (file-loop=1, 60 s cap)
-if [ $DS_EXIT_CODE -ne 0 ] && [ $DS_EXIT_CODE -ne 124 ]; then
-    echo "ERROR: deepstream-app exited with code $DS_EXIT_CODE — see $LOG_PATH" >&2
-    exit "$DS_EXIT_CODE"
-fi
-
-# Average stream-0 instantaneous FPS across the last 10 **PERF: lines.
-# Using stream 0 (the \K capture after `**PERF:`) gives exactly 1 value per
-# measurement window so tail -10 always covers 10 windows regardless of N.
-# Multiply by N for total throughput.
-PERF_FPS=$(grep -oP '\*\*PERF:\s*\K[0-9.]+' "$LOG_PATH" | tail -10 | python3 -c "
-import sys
-vals = [float(line) for line in sys.stdin if line.strip()]
-print(round(sum(vals)/len(vals), 2) if vals else 0)
-")
-
-if [ -z "$PERF_FPS" ] || [ "$PERF_FPS" = "0" ]; then
-    echo "ERROR: no **PERF: lines parsed from $LOG_PATH" >&2
-    exit 1
-fi
-
-TOTAL_FPS=$(python3 -c "print(round(float('$PERF_FPS') * $N, 2))")
-
-echo ""
-echo "=== Perf Summary ==="
-echo "Streams:        $N"
-echo "FPS/stream:     $PERF_FPS"
-echo "Total imgs/sec: $TOTAL_FPS"
-echo "Log:            $LOG_PATH"
-echo "===================="
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-single-stream.sh b/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-single-stream.sh
deleted file mode 100755
index f8b5546d..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-single-stream.sh
+++ /dev/null
@@ -1,120 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-################################################################################
-# Step 6: Single-stream DeepStream pipeline -- saves output video with OSD boxes.
-#
-# Usage: ./ds-single-stream.sh <config_file> <output_video> [input_video]
-# Example: ./ds-single-stream.sh config_infer_primary_yolox.txt yolox_output.mp4
-#
-# Encoder policy (MANDATORY):
-#   - Primary path uses nvv4l2h264enc (NVENC) with .mp4 container. nvdsosd
-#     overlays are reliably preserved only with NVENC on the NVMM memory path.
-#   - x264enc and openh264enc are PROHIBITED and must never be used.
-#   - On NVENC-init failure, the script checks theoraenc + oggmux availability
-#     (LGPL elements; both ship in gst-plugins-base):
-#       * Available  → falls back to theoraenc+oggmux → saves <output>.ogv
-#           nvvideoconvert ! "video/x-raw, format=I420" ! theoraenc quality=48 ! oggmux
-#         Emits DS_SINGLE_STREAM_MODE=theoraenc-fallback and DS_SINGLE_STREAM_OUTPUT=<path>
-#       * Unavailable → skips video creation, emits DS_SINGLE_STREAM_MODE=skipped, exit 0
-#     The benchmark report must surface which encoder mode was used.
-################################################################################
-
-set -o pipefail
-
-CONFIG="$1"
-OUTPUT="$2"
-VIDEO="${3:-/opt/nvidia/deepstream/deepstream/samples/streams/sample_720p.mp4}"
-MUXER_W=1280
-MUXER_H=720
-
-if [ -z "$CONFIG" ] || [ -z "$OUTPUT" ]; then
-    echo "Usage: $0 <config_file> <output_video> [input_video]"
-    exit 1
-fi
-
-OUTPUT_DIR="$(dirname "$OUTPUT")"
-LOG_FILE="$(mktemp -t ds-single-stream-XXXXXX.log)"
-trap 'rm -f "$LOG_FILE"' EXIT
-
-mkdir -p "$OUTPUT_DIR"
-
-echo "=== DeepStream Single-Stream Test ==="
-echo "Config: $CONFIG"
-echo "Input:  $VIDEO"
-echo "Output: $OUTPUT (primary: nvv4l2h264enc)"
-echo ""
-
-gst-launch-1.0 \
-    filesrc location="${VIDEO}" ! qtdemux ! queue ! h264parse ! queue ! nvv4l2decoder ! queue ! mux.sink_0 \
-    nvstreammux name=mux batch-size=1 width=${MUXER_W} height=${MUXER_H} batched-push-timeout=-1 ! \
-    nvinfer config-file-path="${CONFIG}" ! \
-    nvvideoconvert ! nvdsosd ! nvvideoconvert ! \
-    "video/x-raw(memory:NVMM), format=NV12" ! nvv4l2h264enc ! h264parse ! mp4mux ! \
-    filesink location="${OUTPUT}" sync=0 \
-    2>&1 | tee "$LOG_FILE"
-STATUS=${PIPESTATUS[0]}
-
-if [ $STATUS -eq 0 ] && [ -s "$OUTPUT" ]; then
-    echo ""
-    echo "Output saved to: ${OUTPUT}"
-    echo "DS_SINGLE_STREAM_MODE=nvenc-primary"
-    echo "DS_SINGLE_STREAM_OUTPUT=${OUTPUT}"
-    exit 0
-fi
-
-# Detect NVENC-init failure -- the only condition under which we use the theoraenc fallback.
-# x264enc and openh264enc are prohibited. Any other failure surfaces as a hard error.
-if grep -qE "v4l2-nvenc.*failed during initialization|Could not open device.*v4l2-nvenc|nvv4l2h264enc.*not-negotiated" "$LOG_FILE"; then
-    echo ""
-    echo "WARNING: nvv4l2h264enc (NVENC) is unavailable on this GPU." >&2
-
-    if ! gst-inspect-1.0 theoraenc > /dev/null 2>&1 || ! gst-inspect-1.0 oggmux > /dev/null 2>&1; then
-        echo "WARNING: theoraenc/oggmux not available. Skipping video creation." >&2
-        echo "DS_SINGLE_STREAM_MODE=skipped"
-        exit 0
-    fi
-
-    echo "         Falling back to theoraenc+oggmux (OGV output)." >&2
-    echo ""
-    OGV_OUTPUT="$(echo "${OUTPUT}" | sed -E 's/\.[Mm][Pp]4$//').ogv"
-    rm -f "$OUTPUT" "$OGV_OUTPUT"
-
-    gst-launch-1.0 \
-        filesrc location="${VIDEO}" ! qtdemux ! queue ! h264parse ! queue ! nvv4l2decoder ! queue ! mux.sink_0 \
-        nvstreammux name=mux batch-size=1 width=${MUXER_W} height=${MUXER_H} batched-push-timeout=-1 ! \
-        nvinfer config-file-path="${CONFIG}" ! \
-        nvvideoconvert ! nvdsosd ! nvvideoconvert ! \
-        "video/x-raw, format=I420" ! theoraenc quality=48 ! oggmux ! \
-        filesink location="${OGV_OUTPUT}" sync=0 \
-        2>&1
-    THEORA_STATUS=$?
-
-    if [ $THEORA_STATUS -eq 0 ] && [ -s "$OGV_OUTPUT" ]; then
-        echo ""
-        echo "theoraenc fallback succeeded. Output saved to: ${OGV_OUTPUT}"
-        echo "DS_SINGLE_STREAM_MODE=theoraenc-fallback"
-        echo "DS_SINGLE_STREAM_OUTPUT=${OGV_OUTPUT}"
-        exit 0
-    fi
-
-    echo "ERROR: theoraenc fallback pipeline failed (exit ${THEORA_STATUS})." >&2
-    exit ${THEORA_STATUS:-1}
-fi
-
-echo "Pipeline failed with exit code $STATUS (not an NVENC-init failure)." >&2
-exit $STATUS
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-sweep.sh b/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-sweep.sh
deleted file mode 100755
index e486d557..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/ds-sweep.sh
+++ /dev/null
@@ -1,278 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-################################################################################
-# DeepStream BS_OPT sweep — smart 2-phase approach.
-#
-# Phase 1: trtexec probe at BS=1,4,8 (~30s total, fast)
-#   - Fits power-law curve: QPS = a × BS^(-alpha)
-#   - Predicts BS where trtexec QPS = FPS_THRESHOLD / DS_EFFICIENCY
-#   - This accounts for DeepStream pipeline overhead vs raw trtexec
-#
-# Phase 2: DeepStream confirmation (1-2 runs)
-#   - Runs DS at BS_pred and BS_pred-step if needed
-#   - Picks highest BS where DS fps/stream >= FPS_THRESHOLD
-#   - Uses dynamic engine (no per-BS engine builds during sweep)
-#
-# Thumb rules:
-#   - batch_size == num_streams (always equal)
-#   - Dynamic engine: min=1, opt=10, max=max(BATCH_SIZES_PROBE) e.g. 8
-#     Extended at build time to max=BS_pred+margin once predicted
-#   - BS_OPT drives production engine build (static, timing cache reuse)
-#
-# Usage:
-#   ./ds-sweep.sh <dynamic_engine> <onnx_path> <config_template> \
-#                 <parser_so> <labels> <engines_dir> <configs_dir> [video]
-################################################################################
-set -euo pipefail
-
-DYNAMIC_ENGINE="$1"
-ONNX_PATH="$2"
-CONFIG_TEMPLATE="$3"
-PARSER_SO="$4"
-LABELS="$5"
-ENGINES_DIR="$6"
-CONFIGS_DIR="$7"
-VIDEO="${8:-/opt/nvidia/deepstream/deepstream/samples/streams/sample_720p.mp4}"
-
-# Derive INPUT_NAME, H, W from the ONNX model — mirrors how engine-build.md does it.
-# Env var overrides let callers handle models with dynamic spatial dims (e.g. H=800 W=800 ./ds-sweep.sh ...).
-SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
-INSPECT_SCRIPT="$(realpath "${SCRIPT_DIR}/../../model/inspect-onnx.py")"
-if [ -z "${INPUT_NAME:-}" ] || [ -z "${H:-}" ] || [ -z "${W:-}" ]; then
-    INSPECT_OUT=$(python3 "${INSPECT_SCRIPT}" "${ONNX_PATH}")
-    INPUT_NAME="${INPUT_NAME:-$(echo "${INSPECT_OUT}" | grep -oP 'input_name:\s*\K\S+')}"
-    H="${H:-$(echo "${INSPECT_OUT}" | grep -oP 'height:\s*\K[0-9]+')}"
-    W="${W:-$(echo "${INSPECT_OUT}" | grep -oP 'width:\s*\K[0-9]+')}"
-fi
-[ -z "${INPUT_NAME}" ] && { echo "ERROR: could not parse INPUT_NAME from ONNX — set INPUT_NAME env var"; exit 1; }
-[ -z "${H}" ] && { echo "ERROR: H not detected (dynamic spatial dims) — set H env var, e.g. H=800"; exit 1; }
-[ -z "${W}" ] && { echo "ERROR: W not detected (dynamic spatial dims) — set W env var, e.g. W=800"; exit 1; }
-
-# DS_ERR_LOG: destination for GStreamer/DeepStream stderr output.
-# Override via environment variable to redirect elsewhere (e.g. a file path or /dev/stderr).
-# Defaults to a log file alongside the sweep engine logs so errors are preserved for diagnosis.
-DS_ERR_LOG="${DS_ERR_LOG:-${ENGINES_DIR}/ds_sweep_gst_errors.log}"
-mkdir -p "$(dirname "${DS_ERR_LOG}")"
-# Truncate/create the log at the start of the sweep so it reflects the current run only.
-: > "${DS_ERR_LOG}"
-echo "[ds-sweep] GStreamer stderr → ${DS_ERR_LOG}"
-
-# TIMING_CACHE="${ENGINES_DIR}/timing.cache"  # used by caller (nv-engine-build), not sweep
-NS_PER_SEC=$(( 1000 * 1000 * 1000 ))  # nanoseconds per second (date +%s%N divisor)
-FPS_THRESHOLD=30          # target fps/stream in DeepStream
-DS_EFFICIENCY=0.65        # DS is ~65% of trtexec throughput (GStreamer pipeline overhead
-                          # includes muxer, memory mgmt, custom parser, metadata — measured)
-TRT_QPS_TARGET=$(echo "scale=4; ${FPS_THRESHOLD} / ${DS_EFFICIENCY}" | bc)  # ~46.2 QPS
-PROBE_SIZES=(1 4 8)       # fast trtexec probe batch sizes
-PROBE_DURATION=10         # seconds per trtexec probe run
-# NEVER use filesrc num-buffers as a frame count — num-buffers counts file byte blocks (4096B),
-# not video frames. Leave num-buffers unset so filesrc reads to natural EOS.
-# Detect actual frame count and FPS via mediainfo — consistent with benchmark-ds.sh.
-VIDEO_FPS=$(mediainfo --Inform="Video;%FrameRate%" "${VIDEO}" 2>/dev/null | awk '{printf "%.0f", $1+0}')
-VIDEO_FPS="${VIDEO_FPS:-30}"
-ACTUAL_FRAMES_PER_STREAM=$(mediainfo --Inform="Video;%FrameCount%" "${VIDEO}" 2>/dev/null)
-if ! echo "${ACTUAL_FRAMES_PER_STREAM}" | grep -qE '^[0-9]+$' || [ "${ACTUAL_FRAMES_PER_STREAM:-0}" -eq 0 ]; then
-    ACTUAL_FRAMES_PER_STREAM=1440   # fallback for sample_720p.mp4: ~48s × 30fps
-fi
-echo "  Video frames/stream: ${ACTUAL_FRAMES_PER_STREAM} (${VIDEO_FPS}fps detected)"
-MUXER_W=1280
-MUXER_H=720
-
-mkdir -p "${CONFIGS_DIR}"
-
-echo "======================================================"
-echo "DS BS_OPT Sweep — 2-Phase Smart Search"
-echo "  FPS threshold : ${FPS_THRESHOLD} fps/stream"
-echo "  DS efficiency : ${DS_EFFICIENCY} (trtexec QPS target: ${TRT_QPS_TARGET})"
-echo "  Probe sizes   : ${PROBE_SIZES[*]}"
-echo "  Input tensor  : ${INPUT_NAME} (${H}x${W})"
-echo "======================================================"
-
-# ── PHASE 1: trtexec probe at BS=1,4,8 ──────────────────
-echo ""
-echo "PHASE 1: trtexec probe (BS=${PROBE_SIZES[*]})"
-
-declare -a PROBE_BS_ARR PROBE_QPS_ARR
-
-for BS in "${PROBE_SIZES[@]}"; do
-    echo "  trtexec BS=${BS}..."
-    LOG="${ENGINES_DIR}/probe_bs${BS}.log"
-    trtexec \
-        --loadEngine="${DYNAMIC_ENGINE}" \
-        --fp16 \
-        --shapes=${INPUT_NAME}:${BS}x3x${H}x${W} \
-        --duration=${PROBE_DURATION} \
-        --warmUp=2000 \
-        > "${LOG}" 2>&1
-    QPS=$(grep "Throughput:" "${LOG}" | grep -oP 'Throughput: \K[0-9.]+' | head -1)
-    echo "    BS=${BS}: ${QPS} QPS"
-    PROBE_BS_ARR+=("${BS}")
-    PROBE_QPS_ARR+=("${QPS}")
-done
-
-# ── Power-law fit: QPS = a × BS^(-alpha) ────────────────
-# Use BS=4 and BS=8 points to fit alpha (most stable region)
-# alpha = log(QPS4/QPS8) / log(8/4)
-QPS4="${PROBE_QPS_ARR[1]}"
-QPS8="${PROBE_QPS_ARR[2]}"
-
-ALPHA=$(python3 -c "
-import math
-qps4, qps8 = float('${QPS4}'), float('${QPS8}')
-alpha = math.log(qps4 / qps8) / math.log(8.0 / 4.0)
-print(f'{alpha:.4f}')
-")
-A_COEFF=$(python3 -c "
-import math
-qps8, alpha = float('${QPS8}'), float('${ALPHA}')
-a = qps8 * (8.0 ** alpha)
-print(f'{a:.4f}')
-")
-
-echo ""
-echo "  Curve fit: QPS = ${A_COEFF} × BS^(-${ALPHA})"
-
-# Solve for BS where QPS = TRT_QPS_TARGET
-# BS_pred = (a / QPS_target)^(1/alpha)
-# Guard: if alpha ~ 0 (flat curve — memory-bandwidth-bound or very small model),
-# 1/alpha diverges. Use the cap directly and let Phase 2 DS runs confirm.
-BS_PRED=$(python3 -c "
-import math
-a, alpha = float('${A_COEFF}'), float('${ALPHA}')
-target = float('${TRT_QPS_TARGET}')
-if abs(alpha) < 1e-3:
-    bs_pred = 128
-else:
-    bs_pred = (a / target) ** (1.0 / alpha)
-print(int(bs_pred))
-")
-
-echo "  Predicted BS_pred = ${BS_PRED} (trtexec QPS ≈ ${TRT_QPS_TARGET} at this batch)"
-echo ""
-
-# Clamp BS_pred to reasonable range [8, 128]
-BS_PRED=$(python3 -c "print(max(8, min(128, int('${BS_PRED}'))))")
-
-# ── PHASE 2: DeepStream confirmation ────────────────────
-echo "PHASE 2: DeepStream confirmation around BS_pred=${BS_PRED}"
-
-# Test BS_pred and BS_pred - small step if first fails
-# Round BS_pred to nearest sensible value
-BS_STEP=$(python3 -c "
-bs = int('${BS_PRED}')
-# step = ~10% of BS_pred, minimum 1
-step = max(1, round(bs * 0.1))
-print(step)
-")
-
-CANDIDATES=("${BS_PRED}")
-BS_LOWER=$(( BS_PRED - BS_STEP ))
-[ "${BS_LOWER}" -ge 1 ] && CANDIDATES+=("${BS_LOWER}")
-
-best_bs=1
-best_fps_stream=0
-best_ips=0
-
-for BS in "${CANDIDATES[@]}"; do
-    echo ""
-    echo "=== DS Confirmation BS=${BS} (${BS} streams) ==="
-
-    # Write nvinfer config pointing to dynamic engine at this batch size
-    BS_CONFIG="${CONFIGS_DIR}/config_infer_sweep_b${BS}.txt"
-    sed \
-        -e "s|model-engine-file=.*|model-engine-file=${DYNAMIC_ENGINE}|" \
-        -e "s|batch-size=.*|batch-size=${BS}|" \
-        -e "s|custom-lib-path=.*|custom-lib-path=${PARSER_SO}|" \
-        -e "s|labelfile-path=.*|labelfile-path=${LABELS}|" \
-        "${CONFIG_TEMPLATE}" > "${BS_CONFIG}"
-
-    # actual frames = ACTUAL_FRAMES_PER_STREAM × BS (no num-buffers limit on filesrc —
-    # let each source read to natural EOS so we always process the full video)
-    TOTAL_FRAMES=$((ACTUAL_FRAMES_PER_STREAM * BS))
-    SOURCES=""
-    for i in $(seq 0 $((BS - 1))); do
-        SOURCES+="filesrc location=${VIDEO} ! qtdemux ! queue ! h264parse ! queue ! nvv4l2decoder ! queue ! mux.sink_${i} "
-    done
-
-    START_TIME=$(date +%s%N)
-    GST_DEBUG=0 gst-launch-1.0 -e \
-        ${SOURCES} \
-        nvstreammux name=mux batch-size=${BS} width=${MUXER_W} height=${MUXER_H} batched-push-timeout=40000 ! \
-        queue ! \
-        nvinfer config-file-path="${BS_CONFIG}" ! \
-        queue ! \
-        fakesink sync=0 2>>"${DS_ERR_LOG}" || true
-    END_TIME=$(date +%s%N)
-
-    # Warn if the pipeline wrote anything to stderr — likely a plugin/config error
-    if [ -s "${DS_ERR_LOG}" ]; then
-        echo "  [warn] GStreamer stderr output captured — see ${DS_ERR_LOG} for details" >&2
-    fi
-
-    ELAPSED_SEC=$(echo "scale=2; $(( END_TIME - START_TIME )) / $NS_PER_SEC" | bc)
-    DS_IPS=$(echo "scale=1; ${TOTAL_FRAMES} / ${ELAPSED_SEC}" | bc)
-    DS_FPS_STREAM=$(echo "scale=1; ${DS_IPS} / ${BS}" | bc)
-    DS_REALTIME=$(echo "scale=2; ${DS_FPS_STREAM} / ${FPS_THRESHOLD}" | bc)
-    DS_FPS_INT=$(echo "${DS_FPS_STREAM}" | cut -d. -f1)
-    DS_IPS_INT=$(echo "${DS_IPS}" | cut -d. -f1)
-
-    echo "  BS=${BS}: wall=${ELAPSED_SEC}s imgs/s=${DS_IPS} fps/stream=${DS_FPS_STREAM} realtime=${DS_REALTIME}x"
-
-    if [ "${DS_FPS_INT}" -ge "${FPS_THRESHOLD}" ]; then
-        best_bs="${BS}"
-        best_fps_stream="${DS_FPS_STREAM}"
-        best_ips="${DS_IPS_INT}"
-        echo "  -> PASS (>=${FPS_THRESHOLD} fps/stream)"
-        break   # highest candidate that passes is BS_OPT
-    else
-        echo "  -> FAIL (<${FPS_THRESHOLD} fps/stream), trying lower..."
-    fi
-done
-
-# Write results
-echo ""
-echo "======================================================"
-echo "SWEEP COMPLETE"
-echo "  BS_OPT         = ${best_bs}"
-echo "  DS fps/stream  = ${best_fps_stream} (threshold: ${FPS_THRESHOLD})"
-echo "  DS imgs/sec    = ${best_ips}"
-echo "  trtexec alpha  = ${ALPHA} (curve steepness)"
-echo "  BS_pred was    = ${BS_PRED}"
-echo "======================================================"
-
-cat > "${ENGINES_DIR}/bs_opt.txt" << EOF
-BS_OPT=${best_bs}
-DS_FPS_PER_STREAM=${best_fps_stream}
-DS_IPS=${best_ips}
-TRT_ALPHA=${ALPHA}
-TRT_A_COEFF=${A_COEFF}
-BS_PRED=${BS_PRED}
-EOF
-
-# Print probe summary
-echo ""
-echo "Phase 1 trtexec probe summary:"
-echo "batch,qps,imgs_per_sec"
-for i in "${!PROBE_BS_ARR[@]}"; do
-    BS="${PROBE_BS_ARR[$i]}"
-    QPS="${PROBE_QPS_ARR[$i]}"
-    IPS=$(echo "scale=0; ${QPS} * ${BS}" | bc)
-    echo "${BS},${QPS},${IPS}"
-done
-
-echo "${best_bs}"
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/extract-frame.sh b/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/extract-frame.sh
deleted file mode 100755
index 4eba5919..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/deepstream/extract-frame.sh
+++ /dev/null
@@ -1,53 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-set -o pipefail
-################################################################################
-# Step 6: Extract first frame from output video as PNG for visual inspection.
-#
-# Usage: ./extract-frame.sh <input_video> <output_png>
-# Example: ./extract-frame.sh yolox_output.mp4 yolox_frame_sample.png
-################################################################################
-
-INPUT="$1"
-OUTPUT="$2"
-
-if [ -z "$INPUT" ] || [ -z "$OUTPUT" ]; then
-    echo "Usage: $0 <input_video> <output_png>"
-    exit 1
-fi
-
-if [[ "$INPUT" == *.ogv ]]; then
-    gst-launch-1.0 \
-        filesrc location="${INPUT}" ! oggdemux ! theoradec ! videoconvert ! "video/x-raw,format=RGB" ! \
-        pngenc snapshot=true ! filesink location="${OUTPUT}" \
-        2>&1 | grep -v "^$"
-else
-    gst-launch-1.0 \
-        filesrc location="${INPUT}" ! qtdemux ! queue ! h264parse ! queue ! nvv4l2decoder ! queue ! \
-        nvvideoconvert ! "video/x-raw,format=RGB" ! videoconvert ! \
-        pngenc snapshot=true ! filesink location="${OUTPUT}" \
-        2>&1 | grep -v "^$"
-fi
-STATUS=$?
-
-if [ $STATUS -eq 0 ] && [ -f "$OUTPUT" ]; then
-    echo "Frame extracted: ${OUTPUT} ($(ls -lh "$OUTPUT" | awk '{print $5}'))"
-else
-    echo "ERROR: Pipeline failed with exit code $STATUS" >&2
-    exit $STATUS
-fi
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/engine/benchmark-trtexec.sh b/skills/deepstream/deepstream-import-vision-model/scripts/engine/benchmark-trtexec.sh
deleted file mode 100755
index 50bde6e1..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/engine/benchmark-trtexec.sh
+++ /dev/null
@@ -1,75 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-################################################################################
-# Step 8a: TensorRT benchmark using trtexec for arbitrary batch sizes.
-# Runs 10-second benchmarks and reports GPU compute time + throughput.
-#
-# Usage: ./benchmark-trtexec.sh <bs:engine> [<bs:engine> ...] [duration_sec]
-# Example: ./benchmark-trtexec.sh 1:yolox_nano_b1.engine 64:yolox_nano_b64.engine
-#          ./benchmark-trtexec.sh 1:b1.engine 64:b64.engine 20
-################################################################################
-
-# Last plain-integer arg is treated as duration; all others are bs:engine pairs.
-DURATION=10
-ENGINE_PAIRS=()
-for arg in "$@"; do
-    if [[ "$arg" =~ ^[0-9]+$ ]]; then
-        DURATION="$arg"
-    else
-        ENGINE_PAIRS+=("$arg")
-    fi
-done
-
-if [ ${#ENGINE_PAIRS[@]} -eq 0 ]; then
-    echo "Usage: $0 <bs:engine> [<bs:engine> ...] [duration_sec]"
-    echo "  e.g. $0 1:model_b1.engine 64:model_b64.engine"
-    exit 1
-fi
-
-TRTEXEC="/usr/src/tensorrt/bin/trtexec"
-
-echo "=== TensorRT Benchmark ==="
-echo "Duration: ${DURATION}s per engine"
-echo ""
-
-for ENGINE_INFO in "${ENGINE_PAIRS[@]}"; do
-    BATCH="${ENGINE_INFO%%:*}"
-    ENGINE="${ENGINE_INFO#*:}"
-
-    if [ ! -f "$ENGINE" ]; then
-        echo "SKIP Batch ${BATCH}: ${ENGINE} not found"
-        echo ""
-        continue
-    fi
-
-    echo "--- Batch ${BATCH}: ${ENGINE} ---"
-    OUTPUT=$($TRTEXEC --loadEngine="$ENGINE" --fp16 --duration="$DURATION" 2>&1)
-
-    THROUGHPUT=$(echo "$OUTPUT" | grep "\[I\] Throughput:" | grep -oP 'Throughput: \K[0-9.]+')
-    GPU_MEAN=$(echo "$OUTPUT" | grep "GPU Compute Time:" | grep -oP 'mean = \K[0-9.]+')
-    GPU_MIN=$(echo "$OUTPUT" | grep "GPU Compute Time:" | grep -oP 'min = \K[0-9.]+')
-    GPU_MAX=$(echo "$OUTPUT" | grep "GPU Compute Time:" | grep -oP 'max = \K[0-9.]+')
-    IMGS_PER_SEC=$(echo "scale=0; $THROUGHPUT * $BATCH" | bc 2>/dev/null)
-
-    echo "  GPU Compute: ${GPU_MEAN} ms (min=${GPU_MIN}, max=${GPU_MAX})"
-    echo "  Throughput:  ${THROUGHPUT} qps"
-    echo "  Images/sec:  ${IMGS_PER_SEC}"
-    echo ""
-done
-
-echo "=== Benchmark Complete ==="
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/model/cleanup.sh b/skills/deepstream/deepstream-import-vision-model/scripts/model/cleanup.sh
deleted file mode 100755
index 1ada608d..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/model/cleanup.sh
+++ /dev/null
@@ -1,94 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-# cleanup.sh — Remove build/models artifacts for a given model name.
-# Validated replacement for ad-hoc directory removal after ONNX export.
-#
-# Only removes paths that:
-#   - are non-empty
-#   - exist
-#   - resolve under ./build/ or ./models/
-#   - match the given MODEL_NAME (regex-validated)
-#
-# Usage:
-#   bash cleanup.sh <MODEL_NAME> [--dry-run]
-#
-# Example:
-#   bash cleanup.sh yolov8n
-#   bash cleanup.sh yolov8n --dry-run
-set -euo pipefail
-
-MODEL_NAME="${1:-}"
-DRY_RUN=false
-if [[ "${2:-}" == "--dry-run" ]]; then
-    DRY_RUN=true
-fi
-
-if [[ -z "$MODEL_NAME" ]]; then
-    echo "Usage: $0 <MODEL_NAME> [--dry-run]" >&2
-    exit 1
-fi
-
-if ! [[ "$MODEL_NAME" =~ ^[A-Za-z0-9._-]+$ ]]; then
-    echo "ERROR: MODEL_NAME must match ^[A-Za-z0-9._-]+$ (got: $MODEL_NAME)" >&2
-    exit 1
-fi
-
-# The regex above accepts "." and ".." — reject them explicitly since those
-# would make the candidate paths (build/.venv_$MODEL_NAME, models/$MODEL_NAME/*)
-# point at directories we don't own.
-if [[ "$MODEL_NAME" == "." || "$MODEL_NAME" == ".." ]]; then
-    echo "ERROR: MODEL_NAME cannot be '.' or '..' (got: $MODEL_NAME)" >&2
-    exit 1
-fi
-
-CWD="$(pwd -P)"
-
-# Paths eligible for removal — all are scoped under CWD's build/ or models/
-CANDIDATES=(
-    "build/.venv_${MODEL_NAME}"
-    "models/${MODEL_NAME}/hf_model"
-    "models/${MODEL_NAME}/onnx_export"
-)
-
-echo "=== cleanup.sh — MODEL_NAME=$MODEL_NAME dry-run=$DRY_RUN ==="
-for rel in "${CANDIDATES[@]}"; do
-    abs="$CWD/$rel"
-    if [[ ! -e "$abs" ]]; then
-        echo "  skip (not present): $rel"
-        continue
-    fi
-
-    # Assert the resolved path is still under CWD's build/ or models/
-    resolved="$(cd "$(dirname "$abs")" && pwd -P)/$(basename "$abs")"
-    case "$resolved" in
-        "$CWD"/build/*|"$CWD"/models/*) ;;
-        *)
-            echo "  SKIP (outside build/ or models/): $resolved"
-            continue
-            ;;
-    esac
-
-    if $DRY_RUN; then
-        echo "  [dry-run] rm -rf $resolved"
-    else
-        echo "  removing: $resolved"
-        rm -rf -- "$resolved"
-    fi
-done
-
-echo "Done."
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/model/hf-download-config.sh b/skills/deepstream/deepstream-import-vision-model/scripts/model/hf-download-config.sh
deleted file mode 100755
index 96e3dae6..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/model/hf-download-config.sh
+++ /dev/null
@@ -1,74 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-# hf-download-config.sh — Download config.json from a HuggingFace repo.
-# Safer replacement for the inline `curl -fsSL ... -o ...` snippet.
-#
-# Usage:
-#   bash hf-download-config.sh <HF_ORG> <MODEL_NAME> <DEST_PATH>
-#
-# Example:
-#   bash hf-download-config.sh onnx-community yolov8n models/yolov8n/config/config.json
-#
-# Honors $HF_TOKEN if set.
-set -euo pipefail
-
-HF_ORG="${1:-}"
-MODEL_NAME="${2:-}"
-DEST="${3:-}"
-
-if [[ -z "$HF_ORG" || -z "$MODEL_NAME" || -z "$DEST" ]]; then
-    echo "Usage: $0 <HF_ORG> <MODEL_NAME> <DEST_PATH>" >&2
-    exit 1
-fi
-
-for arg_name in HF_ORG MODEL_NAME; do
-    val="${!arg_name}"
-    if ! [[ "$val" =~ ^[A-Za-z0-9._/-]+$ ]]; then
-        echo "ERROR: $arg_name contains invalid characters: $val" >&2
-        exit 1
-    fi
-done
-
-# DEST must be a relative path and must not contain .. segments
-# (prevents writes outside the project tree)
-case "$DEST" in
-    /*)
-        echo "ERROR: DEST_PATH must be relative (absolute paths are rejected): $DEST" >&2
-        exit 1
-        ;;
-    *..*)
-        echo "ERROR: DEST_PATH contains '..' — refusing: $DEST" >&2
-        exit 1
-        ;;
-esac
-
-URL="https://huggingface.co/${HF_ORG}/${MODEL_NAME}/resolve/main/config.json"
-
-CURL_OPTS=(-fsSL --proto '=https' --tlsv1.2 --max-time 60 -o "$DEST")
-if [[ -n "${HF_TOKEN:-}" ]]; then
-    CURL_OPTS+=(-H "Authorization: Bearer ${HF_TOKEN}")
-fi
-
-mkdir -p "$(dirname "$DEST")"
-
-if ! curl "${CURL_OPTS[@]}" "$URL"; then
-    echo "ERROR: config.json not found at ${HF_ORG}/${MODEL_NAME} — cannot extract labels" >&2
-    exit 1
-fi
-
-echo "Downloaded: $DEST"
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/model/hf-list-files.sh b/skills/deepstream/deepstream-import-vision-model/scripts/model/hf-list-files.sh
deleted file mode 100755
index aaa75c1a..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/model/hf-list-files.sh
+++ /dev/null
@@ -1,126 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-# hf-list-files.sh — List model files in a HuggingFace repo.
-# Uses the HF tree API with validated inputs, HTTPS+TLSv1.2, and a bounded
-# timeout. Parses the JSON response via the stdlib json module (no shell pipe).
-#
-# Usage:
-#   bash hf-list-files.sh <HF_ORG> <MODEL_NAME> [subpath]
-#
-# Examples:
-#   bash hf-list-files.sh onnx-community yolov8n
-#   bash hf-list-files.sh onnx-community yolov8n onnx        # check /onnx subdir
-#
-# Honors $HF_TOKEN if set (passed as Authorization: Bearer header).
-set -euo pipefail
-
-HF_ORG="${1:-}"
-MODEL_NAME="${2:-}"
-SUBPATH="${3:-}"
-
-if [[ -z "$HF_ORG" || -z "$MODEL_NAME" ]]; then
-    echo "Usage: $0 <HF_ORG> <MODEL_NAME> [subpath]" >&2
-    exit 1
-fi
-
-# Input validation — reject anything that could escape the URL
-for arg_name in HF_ORG MODEL_NAME SUBPATH; do
-    val="${!arg_name:-}"
-    if [[ -n "$val" ]] && ! [[ "$val" =~ ^[A-Za-z0-9._/-]+$ ]]; then
-        echo "ERROR: $arg_name contains invalid characters (must match ^[A-Za-z0-9._/-]+\$): $val" >&2
-        exit 1
-    fi
-done
-
-URL="https://huggingface.co/api/models/${HF_ORG}/${MODEL_NAME}/tree/main"
-[[ -n "$SUBPATH" ]] && URL="${URL}/${SUBPATH}"
-
-# -sS: silent progress but still surface errors on stderr
-# -w "%{http_code}": append HTTP status as the last 3 chars of the response body
-# Drop -f so curl doesn't exit non-zero on 4xx — we inspect the status ourselves
-# so 404 (missing subpath) can be distinguished from network/auth failures.
-CURL_OPTS=(-sS --proto '=https' --tlsv1.2 --max-time 30 -w '%{http_code}')
-if [[ -n "${HF_TOKEN:-}" ]]; then
-    CURL_OPTS+=(-H "Authorization: Bearer ${HF_TOKEN}")
-fi
-
-# Separate exit-code capture from body so we can diagnose failures precisely.
-RESPONSE="$(curl "${CURL_OPTS[@]}" "$URL")"
-CURL_RC=$?
-
-if [[ $CURL_RC -ne 0 ]]; then
-    echo "ERROR: curl failed (exit $CURL_RC) while fetching $URL" >&2
-    exit 1
-fi
-
-# -w appends the 3-digit status to the body; split them back apart.
-HTTP_CODE="${RESPONSE: -3}"
-JSON="${RESPONSE:0:${#RESPONSE}-3}"
-
-case "$HTTP_CODE" in
-    200) ;;  # fall through to parsing
-    404)
-        # Acceptable: the requested subpath (e.g. /onnx) doesn't exist.
-        exit 0
-        ;;
-    401|403)
-        echo "ERROR: HTTP $HTTP_CODE from HuggingFace for $URL (auth/permission)" >&2
-        exit 1
-        ;;
-    *)
-        echo "ERROR: HTTP $HTTP_CODE from HuggingFace for $URL" >&2
-        exit 1
-        ;;
-esac
-
-# 200 but empty body is unexpected — surface it rather than silently swallow.
-if [[ -z "$JSON" ]]; then
-    echo "ERROR: HTTP 200 but empty body from $URL" >&2
-    exit 1
-fi
-
-# Parse via python3 (json module is stdlib). Each line: <path>
-python3 - "$JSON" <<'PYEOF'
-import json, sys
-data = sys.argv[1]
-try:
-    entries = json.loads(data)
-except json.JSONDecodeError as e:
-    # Surface the decode error so callers can distinguish "empty repo" from
-    # "HF returned something we can't parse" (upstream format change, captive
-    # portal HTML, etc.). Truncate the raw data so we don't dump a multi-MB
-    # response into logs.
-    preview = data[:500] + ("... [truncated]" if len(data) > 500 else "")
-    print(f"ERROR: failed to parse JSON from HuggingFace API: {e}", file=sys.stderr)
-    print(f"  raw response: {preview!r}", file=sys.stderr)
-    sys.exit(1)
-if not isinstance(entries, list):
-    preview = repr(entries)[:500]
-    print(
-        f"ERROR: unexpected response type from HuggingFace API: "
-        f"{type(entries).__name__} (expected list)",
-        file=sys.stderr,
-    )
-    print(f"  contents: {preview}", file=sys.stderr)
-    sys.exit(1)
-# Empty list is valid (directory exists but has no files) — exit 0 silently.
-for e in entries:
-    p = e.get("path") if isinstance(e, dict) else None
-    if p:
-        print(p)
-PYEOF
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/model/inspect-onnx.py b/skills/deepstream/deepstream-import-vision-model/scripts/model/inspect-onnx.py
deleted file mode 100644
index 0678d0bc..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/model/inspect-onnx.py
+++ /dev/null
@@ -1,99 +0,0 @@
-#!/usr/bin/env python3
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""
-Step 1: Inspect an ONNX model — inputs, outputs, opset, operators, validity.
-Usage: python3 inspect-onnx.py <onnx_file>
-"""
-import sys
-import onnx
-
-if len(sys.argv) != 2:
-    print(f"Usage: {sys.argv[0]} <onnx_file>")
-    sys.exit(1)
-
-try:
-    model = onnx.load(sys.argv[1])
-except FileNotFoundError:
-    print(f"Error: File '{sys.argv[1]}' not found")
-    sys.exit(1)
-except Exception as e:
-    print(f"Error loading ONNX model: {e}")
-    sys.exit(1)
-
-print("=== ONNX Model Info ===")
-print(f"File:     {sys.argv[1]}")
-opset_ver = model.opset_import[0].version if model.opset_import else "N/A"
-print(f"Opset:    {opset_ver}")
-print(f"IR ver:   {model.ir_version}")
-print(f"Producer: {model.producer_name} {model.producer_version}")
-
-graph = getattr(model, "graph", None)
-if graph is None:
-    print("Error: ONNX model has no graph")
-    sys.exit(1)
-
-print(f"Nodes:    {len(graph.node)}")
-
-dtype_map = {1: "float32", 10: "float16", 7: "int64", 6: "int32", 9: "bool"}
-
-print("\n=== INPUTS ===")
-for inp in graph.input:
-    shape = [d.dim_value if d.dim_value else d.dim_param for d in inp.type.tensor_type.shape.dim]
-    dtype = dtype_map.get(inp.type.tensor_type.elem_type, inp.type.tensor_type.elem_type)
-    print(f"  {inp.name}: shape={shape}, dtype={dtype}")
-
-print("\n=== OUTPUTS ===")
-for out in graph.output:
-    shape = [d.dim_value if d.dim_value else d.dim_param for d in out.type.tensor_type.shape.dim]
-    dtype = dtype_map.get(out.type.tensor_type.elem_type, out.type.tensor_type.elem_type)
-    print(f"  {out.name}: shape={shape}, dtype={dtype}")
-
-print("\n=== Operators ===")
-op_types = sorted(set(n.op_type for n in graph.node))
-print(f"  {', '.join(op_types)}")
-print(f"  Total unique ops: {len(op_types)}")
-
-try:
-    onnx.checker.check_model(model)
-    print("\n✓ ONNX model is valid")
-except Exception as e:
-    print(f"\n✗ ONNX validation error: {e}")
-
-# --- Machine-parseable summary (consumed by nv-engine-build and ds-run-pipeline) ---
-# grep patterns expect lines: "input_name: <name>", "height: <int>", "width: <int>"
-print("\n=== Machine-Parseable Summary ===")
-if graph and graph.input:
-    inp = graph.input[0]
-    dims = inp.type.tensor_type.shape.dim
-    print(f"input_name: {inp.name}")
-    if len(dims) >= 4:
-        # Assume NCHW: dim[0]=batch, dim[1]=channels, dim[2]=H, dim[3]=W
-        h_val = dims[2].dim_value  # 0 means dynamic
-        w_val = dims[3].dim_value
-        if h_val > 0 and w_val > 0:
-            print(f"height: {h_val}")
-            print(f"width: {w_val}")
-        else:
-            # Dynamic spatial dims — print symbol so callers can detect failure
-            print(f"height: DYNAMIC (symbol={dims[2].dim_param or 'unknown'})")
-            print(f"width: DYNAMIC (symbol={dims[3].dim_param or 'unknown'})")
-            print("WARNING: Dynamic H/W — set height and width manually in trtexec flags")
-    else:
-        print(f"WARNING: Input has {len(dims)} dims — expected 4 (NCHW); cannot auto-detect H/W")
-else:
-    print("WARNING: No inputs found in ONNX graph")
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/model/make-static-batch-onnx.py b/skills/deepstream/deepstream-import-vision-model/scripts/model/make-static-batch-onnx.py
deleted file mode 100644
index a4f2b6fc..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/model/make-static-batch-onnx.py
+++ /dev/null
@@ -1,82 +0,0 @@
-#!/usr/bin/env python3
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""
-Step 7: Create static-batch ONNX files from a batch-1 ONNX.
-Patches input/output batch dims and internal Reshape nodes.
-
-Usage: python3 make-static-batch-onnx.py <src_onnx> <dst_onnx> <batch_size>
-Example: python3 make-static-batch-onnx.py yolox_nano.onnx b16/yolox_nano_b16.onnx 16
-"""
-import sys
-import onnx
-import numpy as np
-from onnx import numpy_helper
-
-if len(sys.argv) != 4:
-    print(f"Usage: {sys.argv[0]} <src_onnx> <dst_onnx> <batch_size>")
-    sys.exit(1)
-
-src_path = sys.argv[1]
-dst_path = sys.argv[2]
-try:
-    batch_size = int(sys.argv[3])
-    if batch_size <= 0:
-        raise ValueError("Batch size must be positive")
-except ValueError as e:
-    print(f"Error: Invalid batch size '{sys.argv[3]}': {e}")
-    sys.exit(1)
-
-try:
-    model = onnx.load(src_path)
-except FileNotFoundError:
-    print(f"Error: File '{src_path}' not found")
-    sys.exit(1)
-except Exception as e:
-    print(f"Error loading ONNX model: {e}")
-    sys.exit(1)
-
-graph = getattr(model, "graph", None)
-if graph is None:
-    print("Error: ONNX model has no graph")
-    sys.exit(1)
-
-# Set static batch on inputs
-for inp in graph.input:
-    if len(inp.type.tensor_type.shape.dim) > 0:
-        inp.type.tensor_type.shape.dim[0].dim_param = ""
-        inp.type.tensor_type.shape.dim[0].dim_value = batch_size
-
-# Set static batch on outputs
-for out in graph.output:
-    if len(out.type.tensor_type.shape.dim) > 0:
-        out.type.tensor_type.shape.dim[0].dim_param = ""
-        out.type.tensor_type.shape.dim[0].dim_value = batch_size
-
-# Fix Reshape nodes that reference batch=1
-for node in graph.node:
-    if node.op_type == "Reshape":
-        shape_input = node.input[1]
-        for init in graph.initializer:
-            if init.name == shape_input:
-                shape_data = numpy_helper.to_array(init).copy()
-                if shape_data.size > 0 and shape_data[0] == 1:
-                    shape_data[0] = batch_size
-                    init.CopyFrom(numpy_helper.from_array(shape_data, name=init.name))
-
-onnx.save(model, dst_path)
-print(f"Saved {dst_path} with batch={batch_size}")
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/model/ngc-download.sh b/skills/deepstream/deepstream-import-vision-model/scripts/model/ngc-download.sh
deleted file mode 100755
index 6ac98333..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/model/ngc-download.sh
+++ /dev/null
@@ -1,114 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-# ngc-download.sh — Download all files from a public NGC model version.
-# Prefers the official `ngc` CLI. Falls back to authenticated HTTPS via curl
-# only when the CLI is not installed; the fallback is explicitly warned about.
-#
-# Usage:
-#   bash ngc-download.sh <NGC_ORG> <NGC_TEAM> <MODEL_NAME> <NGC_VERSION> <DEST_DIR>
-#
-# Example:
-#   bash ngc-download.sh nvidia tao peoplenet trainable_v2.6 models/peoplenet/ngc_download
-set -euo pipefail
-
-NGC_ORG="${1:-}"
-NGC_TEAM="${2:-}"
-MODEL_NAME="${3:-}"
-NGC_VERSION="${4:-}"
-DEST_DIR="${5:-}"
-
-if [[ -z "$NGC_ORG" || -z "$MODEL_NAME" || -z "$NGC_VERSION" || -z "$DEST_DIR" ]]; then
-    echo "Usage: $0 <NGC_ORG> <NGC_TEAM> <MODEL_NAME> <NGC_VERSION> <DEST_DIR>" >&2
-    echo "  NGC_TEAM may be empty-string if the model has no team segment." >&2
-    exit 1
-fi
-
-for var in NGC_ORG MODEL_NAME NGC_VERSION; do
-    val="${!var}"
-    if ! [[ "$val" =~ ^[A-Za-z0-9._-]+$ ]]; then
-        echo "ERROR: $var contains invalid characters: $val" >&2
-        exit 1
-    fi
-done
-if [[ -n "$NGC_TEAM" ]] && ! [[ "$NGC_TEAM" =~ ^[A-Za-z0-9._-]+$ ]]; then
-    echo "ERROR: NGC_TEAM contains invalid characters: $NGC_TEAM" >&2
-    exit 1
-fi
-
-case "$DEST_DIR" in
-    ""|"/"|*..*)
-        echo "ERROR: invalid DEST_DIR: $DEST_DIR" >&2
-        exit 1
-        ;;
-esac
-
-mkdir -p "$DEST_DIR"
-
-# Preferred: ngc CLI (authenticated, verified)
-if command -v ngc >/dev/null 2>&1 && ngc --version >/dev/null 2>&1; then
-    if [[ -n "$NGC_TEAM" ]]; then
-        SPEC="${NGC_ORG}/${NGC_TEAM}/${MODEL_NAME}:${NGC_VERSION}"
-    else
-        SPEC="${NGC_ORG}/${MODEL_NAME}:${NGC_VERSION}"
-    fi
-    echo "Using ngc CLI to download $SPEC -> $DEST_DIR"
-    ngc registry model download-version "$SPEC" --dest "$DEST_DIR"
-    exit 0
-fi
-
-# Fallback: HTTPS via curl, public NGC catalog API only
-echo "WARNING: ngc CLI not available — falling back to unauthenticated HTTPS for public files." >&2
-echo "  For gated/private models, install the ngc CLI: https://ngc.nvidia.com/setup/installers/cli" >&2
-
-if [[ -n "$NGC_TEAM" ]]; then
-    NGC_BASE="https://api.ngc.nvidia.com/v2/models/${NGC_ORG}/${NGC_TEAM}/${MODEL_NAME}/versions/${NGC_VERSION}/files"
-else
-    NGC_BASE="https://api.ngc.nvidia.com/v2/models/${NGC_ORG}/${MODEL_NAME}/versions/${NGC_VERSION}/files"
-fi
-
-SCRIPT_DIR="$(cd "$(dirname "$0")" && pwd)"
-FILES="$("$SCRIPT_DIR/ngc-list-files.sh" "$NGC_ORG" "$NGC_TEAM" "$MODEL_NAME" "$NGC_VERSION")"
-
-if [[ -z "$FILES" ]]; then
-    echo "ERROR: No files returned from NGC catalog" >&2
-    exit 1
-fi
-
-echo "NGC files available:"
-echo "$FILES"
-
-while IFS= read -r FNAME; do
-    [[ -z "$FNAME" ]] && continue
-    # Skip anything with traversal characters
-    case "$FNAME" in
-        */..*|..*|*..|/*)
-            echo "  skipping suspicious filename: $FNAME"
-            continue
-            ;;
-    esac
-    DEST_PATH="$DEST_DIR/$FNAME"
-    mkdir -p "$(dirname "$DEST_PATH")"
-    echo "Downloading: $FNAME"
-    if ! curl -fsSL --proto '=https' --tlsv1.2 --max-time 600 \
-             -o "$DEST_PATH" "${NGC_BASE}/${FNAME}"; then
-        echo "  WARNING: failed to download $FNAME — skipping"
-    fi
-done <<< "$FILES"
-
-echo "Done. Files in $DEST_DIR:"
-ls -lh "$DEST_DIR" 2>/dev/null || true
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/model/ngc-list-files.sh b/skills/deepstream/deepstream-import-vision-model/scripts/model/ngc-list-files.sh
deleted file mode 100755
index b348ae65..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/model/ngc-list-files.sh
+++ /dev/null
@@ -1,82 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-# ngc-list-files.sh — List files in a public NGC model version.
-# Safer replacement for the inline curl+python snippet.
-#
-# Usage:
-#   bash ngc-list-files.sh <NGC_ORG> <NGC_TEAM> <MODEL_NAME> <NGC_VERSION>
-#
-# Example:
-#   bash ngc-list-files.sh nvidia tao peoplenet trainable_v2.6
-#
-# Output: one filename per line.
-set -euo pipefail
-
-NGC_ORG="${1:-}"
-NGC_TEAM="${2:-}"
-MODEL_NAME="${3:-}"
-NGC_VERSION="${4:-}"
-
-if [[ -z "$NGC_ORG" || -z "$MODEL_NAME" || -z "$NGC_VERSION" ]]; then
-    echo "Usage: $0 <NGC_ORG> <NGC_TEAM> <MODEL_NAME> <NGC_VERSION>" >&2
-    echo "  NGC_TEAM may be empty-string if the model has no team segment." >&2
-    exit 1
-fi
-
-for var in NGC_ORG MODEL_NAME NGC_VERSION; do
-    val="${!var}"
-    if ! [[ "$val" =~ ^[A-Za-z0-9._-]+$ ]]; then
-        echo "ERROR: $var contains invalid characters: $val" >&2
-        exit 1
-    fi
-done
-if [[ -n "$NGC_TEAM" ]] && ! [[ "$NGC_TEAM" =~ ^[A-Za-z0-9._-]+$ ]]; then
-    echo "ERROR: NGC_TEAM contains invalid characters: $NGC_TEAM" >&2
-    exit 1
-fi
-
-if [[ -n "$NGC_TEAM" ]]; then
-    NGC_BASE="https://api.ngc.nvidia.com/v2/models/${NGC_ORG}/${NGC_TEAM}/${MODEL_NAME}/versions/${NGC_VERSION}/files"
-else
-    NGC_BASE="https://api.ngc.nvidia.com/v2/models/${NGC_ORG}/${MODEL_NAME}/versions/${NGC_VERSION}/files"
-fi
-
-JSON="$(curl -fsSL --proto '=https' --tlsv1.2 --max-time 30 "${NGC_BASE}/" 2>/dev/null || true)"
-
-if [[ -z "$JSON" ]]; then
-    echo "ERROR: Could not retrieve file list from NGC API" >&2
-    echo "URL: ${NGC_BASE}/" >&2
-    exit 1
-fi
-
-python3 - "$JSON" <<'PYEOF'
-import json, sys
-data = sys.argv[1]
-try:
-    files = json.loads(data)
-except json.JSONDecodeError as e:
-    print(f"ERROR parsing NGC file list: {e}", file=sys.stderr)
-    sys.exit(1)
-if isinstance(files, list):
-    names = [f.get("name", "") for f in files if isinstance(f, dict)]
-else:
-    names = [f.get("name", "") for f in files.get("modelFiles", []) if isinstance(f, dict)]
-for n in names:
-    if n:
-        print(n)
-PYEOF
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/model/safetensors-to-onnx.sh b/skills/deepstream/deepstream-import-vision-model/scripts/model/safetensors-to-onnx.sh
deleted file mode 100755
index 9be30298..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/model/safetensors-to-onnx.sh
+++ /dev/null
@@ -1,90 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-################################################################################
-# Step 1 (alternate): Convert SafeTensors model to ONNX using optimum-cli.
-# Uses an isolated Python venv with optimum, transformers, torch.
-# If a venv already exists at the target location, it reuses it.
-#
-# Usage: ./safetensors-to-onnx.sh <hf_model_id_or_path> <output_dir> [--opset 17] [--dtype fp16]
-# Examples:
-#   ./safetensors-to-onnx.sh facebook/detr-resnet-50 ./onnx_export
-#   ./safetensors-to-onnx.sh facebook/detr-resnet-50 ./onnx_export --opset 17 --dtype fp16
-#   ./safetensors-to-onnx.sh ./local_model_dir ./onnx_export
-################################################################################
-set -euo pipefail
-
-MODEL="$1"
-OUTPUT_DIR="$2"
-shift 2
-EXTRA_ARGS=("$@")
-
-if [ -z "$MODEL" ] || [ -z "$OUTPUT_DIR" ]; then
-    echo "Usage: $0 <hf_model_id_or_path> <output_dir> [extra optimum-cli args]"
-    echo ""
-    echo "Examples:"
-    echo "  $0 facebook/detr-resnet-50 ./onnx_export"
-    echo "  $0 facebook/detr-resnet-50 ./onnx_export --opset 17 --dtype fp16"
-    exit 1
-fi
-
-SCRIPT_DIR="$(cd "$(dirname "$0")" && pwd)"
-REPO_ROOT="$(cd "$SCRIPT_DIR/../../.." && pwd)"
-mkdir -p "$REPO_ROOT/build"
-VENV_DIR="$REPO_ROOT/build/.venv_optimum"
-
-echo "=== SafeTensors → ONNX Export ==="
-echo "Model:      $MODEL"
-echo "Output dir: $OUTPUT_DIR"
-echo "Extra args: ${EXTRA_ARGS[*]}"
-echo "Venv:       $VENV_DIR"
-echo ""
-
-# Create venv if it doesn't exist
-if [ ! -f "$VENV_DIR/bin/optimum-cli" ]; then
-    echo "Creating Python venv with optimum..."
-    python3 -m venv "$VENV_DIR" || { echo "Failed to create venv at $VENV_DIR"; exit 1; }
-    source "$VENV_DIR/bin/activate" || { echo "Failed to activate venv"; exit 1; }
-    pip install --upgrade pip -q || { echo "Failed to upgrade pip"; exit 1; }
-    pip install "optimum[exporters]>=1.20,<2.0" "torch<2.12" transformers onnxruntime matplotlib numpy markdown -q || { echo "Failed to install packages"; exit 1; }
-    echo "Venv created and packages installed."
-    echo ""
-else
-    source "$VENV_DIR/bin/activate"
-    echo "Reusing existing venv."
-    echo ""
-fi
-
-# Run export
-echo "Running: optimum-cli export onnx -m $MODEL ${EXTRA_ARGS[*]} $OUTPUT_DIR"
-echo ""
-optimum-cli export onnx -m "$MODEL" "${EXTRA_ARGS[@]}" "$OUTPUT_DIR"
-EXIT_CODE=$?
-
-deactivate 2>/dev/null
-
-if [ $EXIT_CODE -eq 0 ]; then
-    echo ""
-    echo "=== Export Complete ==="
-    echo "ONNX files:"
-    ls -lh "$OUTPUT_DIR"/*.onnx 2>/dev/null
-else
-    echo ""
-    echo "=== Export FAILED (exit code: $EXIT_CODE) ==="
-fi
-
-exit $EXIT_CODE
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/report/generate-benchmark-charts.py b/skills/deepstream/deepstream-import-vision-model/scripts/report/generate-benchmark-charts.py
deleted file mode 100644
index f7b22d5f..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/report/generate-benchmark-charts.py
+++ /dev/null
@@ -1,275 +0,0 @@
-#!/usr/bin/env python3
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""
-Step 8: Generate exactly 5 benchmark charts as PNG images for the report.
-
-Usage: python3 generate-benchmark-charts.py <output_dir> <json_data_file>
-
-Expected JSON format (benchmark_data.json written by nv-import-vision-model-report skill pre-flight):
-{
-  "model_name": "yolo26_nano",
-  "engine": "models/yolo26_nano/benchmarks/engines/yolo26n_dynamic_b256.engine",
-  "max_bs": 256,
-  "trtexec": {
-    "bs1":   {"qps": 220.5, "gpu_mean_ms": 4.53},
-    "bsmax": {"qps": 39.2,  "gpu_mean_ms": 103.7, "p99_ms": 105.1, "imgs_per_sec": 10035.2}
-  },
-  "peak_gpu_streams": 334,
-  "deepstream": {
-    "run1": {"streams": 334, "total_fps": 7850.0, "fps_per_stream": 23.5},
-    "run2": {"streams": 238, "total_fps": 7378.4, "fps_per_stream": 31.0}
-  }
-}
-
-Outputs (fixed names — do not rename):
-  chart_trtexec_bs1_vs_bsmax.png   — grouped bar: QPS BS=1 vs BS=MAX_BS
-  chart_trtexec_throughput.png     — bar: imgs/sec at MAX_BS + PEAK_GPU_STREAMS annotation
-  chart_ds_streams_vs_fps.png      — line: stream count vs fps/stream, 30fps threshold
-  chart_trt_vs_ds.png              — grouped bar: trtexec vs DS Run1 vs DS Run2 total imgs/s
-  chart_efficiency.png             — bar: DS Run1 and Run2 pipeline efficiency %
-"""
-import sys
-import json
-import os
-import matplotlib
-matplotlib.use('Agg')
-import matplotlib.pyplot as plt
-
-plt.rcParams.update({
-    'figure.facecolor': 'white',
-    'axes.facecolor': '#FAFAFA',
-    'axes.grid': True,
-    'grid.alpha': 0.3,
-    'font.size': 11,
-    'axes.titlesize': 13,
-    'axes.titleweight': 'bold',
-})
-
-COLORS = {
-    'blue':   '#2196F3',
-    'green':  '#4CAF50',
-    'orange': '#FF9800',
-    'pink':   '#E91E63',
-    'purple': '#9C27B0',
-    'teal':   '#00BCD4',
-    'red':    '#FF5722',
-}
-
-
-def two_line_title(model_name, subtitle):
-    """Two-line title: model name (line 1) + subtitle (line 2)."""
-    return f'{model_name}\n{subtitle}'
-
-
-def chart_trtexec_bs1_vs_bsmax(data, output_dir):
-    """Grouped bar chart: QPS at BS=1 vs BS=MAX_BS side by side."""
-    max_bs = data['max_bs']
-    qps_bs1   = data['trtexec']['bs1']['qps']
-    qps_bsmax = data['trtexec']['bsmax']['qps']
-
-    labels = ['BS=1', f'BS={max_bs}']
-    values = [qps_bs1, qps_bsmax]
-    colors = [COLORS['blue'], COLORS['green']]
-
-    fig, ax = plt.subplots(figsize=(10, 6))
-    bars = ax.bar(labels, values, color=colors, width=0.5, edgecolor='white', linewidth=1.5)
-    for bar, val in zip(bars, values):
-        ax.text(bar.get_x() + bar.get_width() / 2, bar.get_height() + max(values) * 0.01,
-                f'{val:.1f}', ha='center', va='bottom', fontweight='bold', fontsize=13)
-    ax.set_ylabel('QPS (queries/sec)', fontsize=13)
-    ax.set_ylim(0, max(values) * 1.18)
-    ax.grid(axis='y', alpha=0.3)
-    ax.set_title(two_line_title(data['model_name'], f'trtexec QPS: BS=1 vs BS={max_bs}'))
-    plt.tight_layout()
-    out = os.path.join(output_dir, 'chart_trtexec_bs1_vs_bsmax.png')
-    fig.savefig(out, dpi=150)
-    plt.close(fig)
-    print(f'  chart_trtexec_bs1_vs_bsmax.png')
-
-
-def chart_trtexec_throughput(data, output_dir):
-    """Single bar: GPU-only imgs/sec at MAX_BS with PEAK_GPU_STREAMS annotation."""
-    max_bs          = data['max_bs']
-    imgs_per_sec    = data['trtexec']['bsmax']['imgs_per_sec']
-    peak_streams    = data['peak_gpu_streams']
-    realtime_imgs   = peak_streams * 30  # the throughput that satisfies peak_streams at 30fps
-
-    fig, ax = plt.subplots(figsize=(10, 6))
-    bar = ax.bar([f'BS={max_bs}'], [imgs_per_sec], color=COLORS['blue'], width=0.4,
-                 edgecolor='white', linewidth=1.5)
-    ax.text(bar[0].get_x() + bar[0].get_width() / 2, imgs_per_sec + imgs_per_sec * 0.01,
-            f'{imgs_per_sec:.0f}', ha='center', va='bottom', fontweight='bold', fontsize=13)
-
-    # Annotation line at PEAK_GPU_STREAMS × 30fps threshold
-    ax.axhline(y=realtime_imgs, color=COLORS['red'], linestyle='--', linewidth=2,
-               label=f'PEAK_GPU_STREAMS={peak_streams} × 30fps = {realtime_imgs:.0f} imgs/s')
-    ax.text(0.98, realtime_imgs + imgs_per_sec * 0.01,
-            f'PEAK={peak_streams} streams',
-            ha='right', va='bottom', color=COLORS['red'], fontsize=10, fontweight='bold',
-            transform=ax.get_yaxis_transform())
-
-    ax.set_ylabel('Images / sec', fontsize=13)
-    ax.set_ylim(0, imgs_per_sec * 1.25)
-    ax.grid(axis='y', alpha=0.3)
-    ax.legend(loc='upper left', fontsize=10)
-    ax.set_title(two_line_title(data['model_name'],
-                                f'GPU Throughput at BS={max_bs} (PEAK_GPU_STREAMS={peak_streams})'))
-    plt.tight_layout()
-    out = os.path.join(output_dir, 'chart_trtexec_throughput.png')
-    fig.savefig(out, dpi=150)
-    plt.close(fig)
-    print(f'  chart_trtexec_throughput.png')
-
-
-def chart_ds_streams_vs_fps(data, output_dir):
-    """Line chart: X=stream count, Y=fps/stream. Red dashed line at 30fps."""
-    run1 = data['deepstream']['run1']
-    run2 = data['deepstream']['run2']
-
-    stream_counts = [run1['streams'], run2['streams']]
-    fps_vals      = [run1['fps_per_stream'], run2['fps_per_stream']]
-
-    # Sort by stream count ascending
-    pairs = sorted(zip(stream_counts, fps_vals))
-    stream_counts = [p[0] for p in pairs]
-    fps_vals      = [p[1] for p in pairs]
-
-    fig, ax = plt.subplots(figsize=(10, 6))
-    ax.plot(stream_counts, fps_vals, color=COLORS['blue'], linewidth=2.5,
-            marker='o', markersize=10, zorder=4)
-    for sc, fp in zip(stream_counts, fps_vals):
-        ax.text(sc, fp + max(fps_vals) * 0.025,
-                f'{fp:.1f} fps', ha='center', va='bottom', fontweight='bold', fontsize=12)
-
-    ax.axhline(y=30, color=COLORS['red'], linestyle='--', linewidth=2,
-               label='30 fps/stream real-time threshold')
-
-    ax.set_xlabel('Stream Count', fontsize=13)
-    ax.set_ylabel('FPS / Stream', fontsize=13)
-    lower = -max(fps_vals) * 0.15
-    ax.set_ylim(lower, max(fps_vals) * 1.3)
-    ax.set_xticks(stream_counts)
-    ax.grid(axis='y', alpha=0.3)
-    ax.legend(loc='upper right', fontsize=10)
-
-    # Label each point
-    run_labels = {run1['streams']: 'Run 1\n(PEAK_GPU_STREAMS)',
-                  run2['streams']: 'Run 2\n(RT_STREAMS)'}
-    for sc in stream_counts:
-        ax.annotate(run_labels.get(sc, ''), xy=(sc, 0), xytext=(sc, -max(fps_vals) * 0.12),
-                    ha='center', fontsize=9, color='#555555')
-
-    ax.set_title(two_line_title(data['model_name'], 'DeepStream: FPS/Stream vs Stream Count'))
-    plt.tight_layout()
-    out = os.path.join(output_dir, 'chart_ds_streams_vs_fps.png')
-    fig.savefig(out, dpi=150)
-    plt.close(fig)
-    print(f'  chart_ds_streams_vs_fps.png')
-
-
-def chart_trt_vs_ds(data, output_dir):
-    """Grouped bars: trtexec total imgs/s | DS Run 1 total imgs/s | DS Run 2 total imgs/s."""
-    max_bs       = data['max_bs']
-    trt_imgs     = data['trtexec']['bsmax']['imgs_per_sec']
-    ds1_imgs     = data['deepstream']['run1']['total_fps']
-    ds2_imgs     = data['deepstream']['run2']['total_fps']
-    n1           = data['deepstream']['run1']['streams']
-    n2           = data['deepstream']['run2']['streams']
-
-    labels = [f'trtexec\nBS={max_bs}', f'DS Run 1\n({n1} streams)', f'DS Run 2\n({n2} streams)']
-    values = [trt_imgs, ds1_imgs, ds2_imgs]
-    colors = [COLORS['pink'], COLORS['blue'], COLORS['green']]
-
-    fig, ax = plt.subplots(figsize=(10, 6))
-    bars = ax.bar(labels, values, color=colors, width=0.5, edgecolor='white', linewidth=1.5)
-    for bar, val in zip(bars, values):
-        ax.text(bar.get_x() + bar.get_width() / 2, bar.get_height() + max(values) * 0.01,
-                f'{val:.0f}', ha='center', va='bottom', fontweight='bold', fontsize=13)
-    ax.set_ylabel('Total Images / sec', fontsize=13)
-    ax.set_ylim(0, max(values) * 1.18)
-    ax.grid(axis='y', alpha=0.3)
-    ax.set_title(two_line_title(data['model_name'], 'trtexec vs DeepStream: Total Throughput'))
-    plt.tight_layout()
-    out = os.path.join(output_dir, 'chart_trt_vs_ds.png')
-    fig.savefig(out, dpi=150)
-    plt.close(fig)
-    print(f'  chart_trt_vs_ds.png')
-
-
-def chart_efficiency(data, output_dir):
-    """Bar chart: DS Run 1 and Run 2 pipeline efficiency %, dashed line at 100%."""
-    trt_imgs  = data['trtexec']['bsmax']['imgs_per_sec']
-    ds1_imgs  = data['deepstream']['run1']['total_fps']
-    ds2_imgs  = data['deepstream']['run2']['total_fps']
-    n1        = data['deepstream']['run1']['streams']
-    n2        = data['deepstream']['run2']['streams']
-
-    if trt_imgs <= 0:
-        print("ERROR: trtexec imgs_per_sec is zero or negative — cannot compute efficiency", file=sys.stderr)
-        sys.exit(1)
-    eff1 = round(ds1_imgs / trt_imgs * 100, 1)
-    eff2 = round(ds2_imgs / trt_imgs * 100, 1)
-
-    labels = [f'DS Run 1\n({n1} streams)', f'DS Run 2\n({n2} streams)']
-    values = [eff1, eff2]
-    colors = [COLORS['purple'], COLORS['teal']]
-
-    fig, ax = plt.subplots(figsize=(10, 6))
-    bars = ax.bar(labels, values, color=colors, width=0.4, edgecolor='white', linewidth=1.5)
-    ax.axhline(y=100, color='#333333', linestyle='--', linewidth=1.5, alpha=0.6,
-               label='100% efficiency')
-    for bar, val in zip(bars, values):
-        ax.text(bar.get_x() + bar.get_width() / 2, bar.get_height() + 0.5,
-                f'{val}%', ha='center', va='bottom', fontweight='bold', fontsize=13)
-    ax.set_ylabel('DS Efficiency (%)', fontsize=13)
-    ax.set_ylim(0, max(values) * 1.2)
-    ax.grid(axis='y', alpha=0.3)
-    ax.legend(loc='upper right', fontsize=10)
-    ax.set_title(two_line_title(data['model_name'], 'DeepStream Pipeline Efficiency vs trtexec'))
-    plt.tight_layout()
-    out = os.path.join(output_dir, 'chart_efficiency.png')
-    fig.savefig(out, dpi=150)
-    plt.close(fig)
-    print(f'  chart_efficiency.png')
-
-
-def main():
-    if len(sys.argv) != 3:
-        print(f"Usage: {sys.argv[0]} <output_dir> <json_data_file>")
-        sys.exit(1)
-
-    output_dir = sys.argv[1]
-    json_file  = sys.argv[2]
-
-    os.makedirs(output_dir, exist_ok=True)
-
-    with open(json_file) as f:
-        data = json.load(f)
-
-    model = data.get('model_name', 'unknown')
-    print(f"Generating 5 charts for {model} -> {output_dir}/")
-    chart_trtexec_bs1_vs_bsmax(data, output_dir)
-    chart_trtexec_throughput(data, output_dir)
-    chart_ds_streams_vs_fps(data, output_dir)
-    chart_trt_vs_ds(data, output_dir)
-    chart_efficiency(data, output_dir)
-    print("Done — 5 charts written.")
-
-
-if __name__ == "__main__":
-    main()
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/report/latex-pdf-wrap.tex b/skills/deepstream/deepstream-import-vision-model/scripts/report/latex-pdf-wrap.tex
deleted file mode 100644
index b896efce..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/report/latex-pdf-wrap.tex
+++ /dev/null
@@ -1,53 +0,0 @@
-% SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-% SPDX-License-Identifier: Apache-2.0
-%
-% Licensed under the Apache License, Version 2.0 (the "License");
-% you may not use this file except in compliance with the License.
-% You may obtain a copy of the License at
-%
-% http://www.apache.org/licenses/LICENSE-2.0
-%
-% Unless required by applicable law or agreed to in writing, software
-% distributed under the License is distributed on an "AS IS" BASIS,
-% WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-% See the License for the specific language governing permissions and
-% limitations under the License.
-
-% PDF layout fixes for pandoc -> pdflatex (agent-design-document).
-% Code: listings wraps long lines. Pandoc's default syntax-highlighted Verbatim does NOT
-% break inside a single \NormalTok{...} token.
-
-\usepackage{listings}
-\usepackage{xcolor}
-
-\definecolor{codebg}{gray}{0.94}
-\lstset{
-  backgroundcolor=\color{codebg},
-  basicstyle=\ttfamily\footnotesize,
-  breaklines=true,
-  breakatwhitespace=false,
-  columns=fullflexible,
-  keepspaces=true,
-  tabsize=2,
-  showstringspaces=false,
-  xleftmargin=0.4em,
-  framexleftmargin=0.4em,
-  frame=single,
-  framerule=0.4pt,
-  rulecolor=\color{black!25}
-}
-
-% Ragged-right body text: avoids huge inter-word spaces and overfull lines from justification
-\usepackage{ragged2e}
-\AtBeginDocument{\RaggedRight}
-\emergencystretch=12em
-\sloppy
-
-% Extra break points for \url/\path if hyperref/xurl loaded by pandoc (after preamble)
-\makeatletter
-\AtBeginDocument{%
-  \@ifundefined{UrlBreaks}{}{%
-    \g@addto@macro\UrlBreaks{\do\/\do\-\do\_\do\.\do\:}%
-  }%
-}
-\makeatother
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/report/md-to-html-pdf.py b/skills/deepstream/deepstream-import-vision-model/scripts/report/md-to-html-pdf.py
deleted file mode 100644
index b6f6a6da..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/report/md-to-html-pdf.py
+++ /dev/null
@@ -1,162 +0,0 @@
-#!/usr/bin/env python3
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""
-Convert a GFM-style Markdown benchmark report to a styled HTML file and then
-to PDF via wkhtmltopdf.
-
-Images referenced as ![alt](file.png) are resolved relative to the markdown
-file's directory and embedded as base64 data URIs so the HTML is self-contained.
-
-Usage:
-    python3 md-to-html-pdf.py <report.md> <style.css> <output_dir> [model_name]
-
-    model_name (optional): if provided, PDF is named benchmark_report_{model_name}.pdf
-                           if omitted, derived from output_dir parent folder name
-
-Produces:
-    <output_dir>/benchmark_report.html
-    <output_dir>/benchmark_report_{model_name}.pdf
-"""
-import sys
-import os
-import re
-import base64
-import subprocess
-import markdown
-
-def embed_images(html: str, base_dir: str) -> str:
-    """Replace <img src="file.png"> with base64-embedded data URIs."""
-    def replacer(match):
-        prefix = match.group(1)
-        src = match.group(2)
-        suffix = match.group(3)
-        # Skip URLs and absolute paths
-        if re.match(r'^(https?|data|ftp)://', src) or os.path.isabs(src):
-            return match.group(0)
-        img_path = os.path.realpath(os.path.join(base_dir, src))
-        base_real = os.path.realpath(base_dir)
-        # Reject path traversal outside base_dir
-        if not img_path.startswith(base_real + os.sep) and img_path != base_real:
-            return match.group(0)
-        if os.path.isfile(img_path):
-            ext = os.path.splitext(src)[1].lstrip('.').lower()
-            mime = {'png': 'image/png', 'jpg': 'image/jpeg',
-                    'jpeg': 'image/jpeg', 'svg': 'image/svg+xml',
-                    'gif': 'image/gif'}.get(ext, 'image/png')
-            with open(img_path, 'rb') as f:
-                b64 = base64.b64encode(f.read()).decode()
-            return f'{prefix}data:{mime};base64,{b64}{suffix}'
-        return match.group(0)
-    return re.sub(r'(<img\s[^>]*src=["\'])([^"\']+)(["\'])', replacer, html)
-
-def main():
-    if len(sys.argv) not in (4, 5):
-        print(f"Usage: {sys.argv[0]} <report.md> <style.css> <output_dir> [model_name]")
-        sys.exit(1)
-
-    md_path = sys.argv[1]
-    css_path = sys.argv[2]
-    out_dir = sys.argv[3]
-    os.makedirs(out_dir, exist_ok=True)
-
-    # Derive model name: explicit arg > parent-of-output_dir > "model"
-    if len(sys.argv) == 5:
-        model_name = sys.argv[4]
-    else:
-        # output_dir is typically models/{model_name}/reports/ — walk up two levels
-        abs_out = os.path.abspath(out_dir)
-        model_name = os.path.basename(os.path.dirname(abs_out)) or "model"
-
-    base_dir = os.path.dirname(os.path.abspath(md_path))
-
-    with open(md_path) as f:
-        md_text = f.read()
-
-    # Strip YAML frontmatter
-    md_text = re.sub(r'^---\n.*?\n---\n', '', md_text, count=1, flags=re.DOTALL)
-
-    with open(css_path) as f:
-        css = f.read()
-
-    # Convert markdown to HTML
-    html_body = markdown.markdown(md_text, extensions=['tables', 'fenced_code'])
-
-    # Wrap in full HTML document
-    html = f"""<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<title>DeepStream Benchmark Report — {model_name}</title>
-<style>
-{css}
-@media print {{
-  body {{ max-width: 100%; padding: 10px; }}
-  img {{ max-width: 100%; page-break-inside: avoid; }}
-  table {{ page-break-inside: avoid; }}
-  h2 {{ page-break-after: avoid; }}
-}}
-</style>
-</head>
-<body>
-{html_body}
-</body>
-</html>"""
-
-    # Embed images as base64
-    html = embed_images(html, base_dir)
-
-    html_out = os.path.join(out_dir, 'benchmark_report.html')
-    pdf_out = os.path.join(out_dir, f'benchmark_report_{model_name}.pdf')
-
-    with open(html_out, 'w') as f:
-        f.write(html)
-    print(f"  HTML: {html_out}")
-
-    # Convert to PDF.
-    # Intentionally NOT passing --enable-local-file-access: all images have already
-    # been converted to base64 data: URIs by embed_images(), and the CSS is inlined
-    # in <style>...</style>, so no file:// fetching is needed. Keeping it disabled
-    # blocks a CSS/HTML-injection exfil vector if the upstream Markdown ever carries
-    # untrusted content (e.g. an HF model card).
-    result = subprocess.run(
-        [
-            'wkhtmltopdf',
-            '--page-size', 'A4',
-            '--margin-top', '15mm',
-            '--margin-bottom', '15mm',
-            '--margin-left', '15mm',
-            '--margin-right', '15mm',
-            '--image-quality', '100',
-            '--no-outline',
-            html_out, pdf_out,
-        ],
-        stdout=subprocess.PIPE,
-        stderr=subprocess.PIPE,
-        text=True,
-        shell=False,
-        timeout=300,
-    )
-
-    if result.returncode == 0:
-        print(f"  PDF:  {pdf_out}")
-    else:
-        print(f"  PDF generation failed: {result.stderr[:500]}", file=sys.stderr)
-        sys.exit(1)
-
-if __name__ == '__main__':
-    main()
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/report/md-to-pdf.sh b/skills/deepstream/deepstream-import-vision-model/scripts/report/md-to-pdf.sh
deleted file mode 100755
index f3b15001..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/report/md-to-pdf.sh
+++ /dev/null
@@ -1,70 +0,0 @@
-#!/usr/bin/env bash
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-# Convert GitHub-Flavored Markdown (with optional Mermaid diagrams) to PDF with
-# correct wrapping: listings for code, Lua filter for tables/inline paths, LaTeX header.
-#
-# Usage:
-#   ./md-to-pdf.sh <source.md> [output.pdf]
-# If output.pdf is omitted, writes <source>.pdf next to the source file.
-#
-# Requires: mmdc (Mermaid CLI), pandoc, pdflatex, packages: listings, xcolor, ragged2e.
-#
-# Do NOT replace this with plain "pandoc --highlight-style=..." — highlighted Verbatim
-# boxes do not wrap long lines; --listings + latex-pdf-wrap.tex + pandoc-wrap-tables.lua are required.
-set -euo pipefail
-SCRIPT_DIR="$(cd "$(dirname "$0")" && pwd)"
-
-SRC_INPUT="${1:?Usage: $0 <markdown.md> [output.pdf]}"
-if [[ "$SRC_INPUT" != /* ]]; then
-  SRC="$(cd "$(dirname "$SRC_INPUT")" && pwd)/$(basename "$SRC_INPUT")"
-else
-  SRC="$SRC_INPUT"
-fi
-[[ -f "$SRC" ]] || { echo "error: file not found: $SRC" >&2; exit 1; }
-
-SRC_DIR="$(dirname "$SRC")"
-
-if [[ -n "${2-}" ]]; then
-  OUT="$2"
-  if [[ "$OUT" != /* ]]; then
-    OUT="$(pwd)/$OUT"
-  fi
-else
-  OUT="${SRC%.md}.pdf"
-fi
-
-STEM="$(basename "$SRC" .md)"
-INTERMEDIATE="${SRC_DIR}/${STEM}._pdf.md"
-IMG_DIR="${SRC_DIR}/mermaid_pdf/${STEM}"
-
-python3 "$SCRIPT_DIR/render-mermaid-for-pdf.py" \
-  --img-dir "$IMG_DIR" \
-  "$SRC" \
-  "$INTERMEDIATE"
-
-pandoc "$INTERMEDIATE" \
-  --from=gfm \
-  --lua-filter="$SCRIPT_DIR/pandoc-wrap-tables.lua" \
-  --include-in-header="$SCRIPT_DIR/latex-pdf-wrap.tex" \
-  --pdf-engine=pdflatex \
-  -V geometry:margin=1in \
-  --listings \
-  --resource-path="$SRC_DIR:$SCRIPT_DIR" \
-  -o "$OUT"
-
-echo "Wrote $OUT"
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/report/mermaid-puppeteer-root.json b/skills/deepstream/deepstream-import-vision-model/scripts/report/mermaid-puppeteer-root.json
deleted file mode 100644
index 251b509e..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/report/mermaid-puppeteer-root.json
+++ /dev/null
@@ -1,3 +0,0 @@
-{
-  "args": ["--no-sandbox", "--disable-setuid-sandbox", "--disable-dev-shm-usage"]
-}
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/report/mermaid-puppeteer.json b/skills/deepstream/deepstream-import-vision-model/scripts/report/mermaid-puppeteer.json
deleted file mode 100644
index 08e970da..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/report/mermaid-puppeteer.json
+++ /dev/null
@@ -1,3 +0,0 @@
-{
-  "args": ["--disable-dev-shm-usage"]
-}
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/report/pandoc-wrap-tables.lua b/skills/deepstream/deepstream-import-vision-model/scripts/report/pandoc-wrap-tables.lua
deleted file mode 100644
index 2f4f6603..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/report/pandoc-wrap-tables.lua
+++ /dev/null
@@ -1,70 +0,0 @@
--- SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
--- SPDX-License-Identifier: Apache-2.0
---
--- Licensed under the Apache License, Version 2.0 (the "License");
--- you may not use this file except in compliance with the License.
--- You may obtain a copy of the License at
---
--- http://www.apache.org/licenses/LICENSE-2.0
---
--- Unless required by applicable law or agreed to in writing, software
--- distributed under the License is distributed on an "AS IS" BASIS,
--- WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
--- See the License for the specific language governing permissions and
--- limitations under the License.
-
--- PDF/LaTeX fixes for pandoc: wrapped table columns + breakable long paths in \texttt.
--- listings + pdflatex choke on Unicode in code blocks; normalize before LaTeX.
-
-function CodeBlock(block)
-  block.text = block.text
-    :gsub("\u{2019}", "'")
-    :gsub("\u{2018}", "'")
-    :gsub("\u{201c}", '"')
-    :gsub("\u{201d}", '"')
-    :gsub("\u{2014}", "--")
-    :gsub("\u{2013}", "-")
-    :gsub("\u{00d7}", " x ")
-    :gsub("\u{00a0}", " ")
-    :gsub("\u{2192}", "->") -- Unicode arrow (listings + pdflatex)
-  return block
-end
-
-function Table(tbl)
-  local specs = tbl.colspecs
-  if not specs or #specs == 0 then
-    return tbl
-  end
-  local n = #specs
-  local w = 1.0 / n
-  for i, spec in ipairs(specs) do
-    local align = spec[1]
-    -- Second field: fraction of \linewidth (pandoc LaTeX writer)
-    tbl.colspecs[i] = { align, w }
-  end
-  return tbl
-end
-
--- Long path-like inline code does not wrap in LaTeX \texttt; add \allowbreak after each /.
--- (Inline Code has no .format; do not gate on el.format — nil ~= "" is true in Lua and would skip all.)
-function Code(el)
-  local t = el.text
-  if not t:find("/", 1, true) then
-    return nil
-  end
-  if #t < 32 and not t:match("/home/") and not t:match("%.sh") then
-    return nil
-  end
-  local out = t
-    :gsub("\\", "\\textbackslash{}")
-    :gsub("_", "\\_")
-    :gsub("{", "\\{")
-    :gsub("}", "\\}")
-    :gsub("%$", "\\$")
-    :gsub("#", "\\#")
-    :gsub("%^", "\\textasciicircum{}")
-    :gsub("&", "\\&")
-    :gsub("%%", "\\%")
-  out = out:gsub("/", "/\\allowbreak ")
-  return pandoc.RawInline("latex", "\\texttt{" .. out .. "}")
-end
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/report/render-mermaid-for-pdf.py b/skills/deepstream/deepstream-import-vision-model/scripts/report/render-mermaid-for-pdf.py
deleted file mode 100644
index 4e4bc9e6..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/report/render-mermaid-for-pdf.py
+++ /dev/null
@@ -1,205 +0,0 @@
-#!/usr/bin/env python3
-
-# SPDX-FileCopyrightText: Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
-# SPDX-License-Identifier: Apache-2.0
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""
-Expand ```mermaid ... ``` blocks in a Markdown file into PNG images via mmdc,
-producing a new .md suitable for pandoc -> PDF. Does not modify the source file.
-
-Full PDF pipeline (see docs/md-to-pdf.sh and docs/build-pdf.sh):
-  1. This script: Mermaid -> PNG under docs/mermaid_pdf/<stem>/, replace blocks with ![...](...) links.
-  2. pandoc --from=gfm --listings --lua-filter=pandoc-wrap-tables.lua
-     --include-in-header=latex-pdf-wrap.tex --pdf-engine=pdflatex
-
-Use --listings (not --highlight-style): default highlighted Verbatim splits code into
-unbreakable tokens and overflows the page. The Lua filter wraps pipe tables and long
-path-like inline code; CodeBlock text is normalized for pdflatex (Unicode quotes, etc.).
-"""
-from __future__ import annotations
-
-import argparse
-import os
-import re
-import subprocess
-import sys
-from pathlib import Path
-
-MERMAID_BLOCK = re.compile(
-    r"^```mermaid\s*\n(.*?)^```\s*$",
-    re.MULTILINE | re.DOTALL,
-)
-
-
-def render_one(
-    mmdc: str,
-    body: str,
-    out_png: Path,
-    width: int,
-    scale: float,
-    puppeteer_config: Path | None,
-) -> None:
-    out_png.parent.mkdir(parents=True, exist_ok=True)
-    tmp = out_png.with_suffix(".mmd")
-    tmp.write_text(body.strip() + "\n", encoding="utf-8")
-    cmd = [
-        mmdc,
-        "-i",
-        str(tmp),
-        "-o",
-        str(out_png),
-        "-e",
-        "png",
-        "-b",
-        "white",
-        "-w",
-        str(width),
-        "-s",
-        str(scale),
-        "-q",
-    ]
-    if puppeteer_config is not None:
-        cmd.extend(["-p", str(puppeteer_config)])
-    r = subprocess.run(
-        cmd,
-        stdout=subprocess.PIPE,
-        stderr=subprocess.PIPE,
-        text=True,
-        shell=False,
-        timeout=120,
-    )
-    tmp.unlink(missing_ok=True)
-    if r.returncode != 0:
-        sys.stderr.write(r.stderr or r.stdout or "mmdc failed\n")
-        raise RuntimeError(f"mmdc failed with code {r.returncode}")
-
-
-def main() -> None:
-    ap = argparse.ArgumentParser()
-    ap.add_argument("source", type=Path, help="Input .md path")
-    ap.add_argument("output", type=Path, help="Output .md path")
-    ap.add_argument(
-        "--img-dir",
-        type=Path,
-        default=None,
-        help="Directory for PNGs (default: next to output, mermaid_pdf/)",
-    )
-    ap.add_argument("--mmdc", default="mmdc", help="Path to mmdc binary")
-    ap.add_argument("--width", type=int, default=1100)
-    ap.add_argument("--scale", type=float, default=1.5)
-    ap.add_argument(
-        "--puppeteer-config",
-        type=Path,
-        default=None,
-        help="JSON for Puppeteer (default: mermaid-puppeteer.json next to this script)",
-    )
-    args = ap.parse_args()
-    # Optional: MERMAID_PDF_WIDTH / MERMAID_PDF_SCALE (e.g. build-pdf.sh for design doc)
-    if os.environ.get("MERMAID_PDF_WIDTH"):
-        args.width = int(os.environ["MERMAID_PDF_WIDTH"])
-    if os.environ.get("MERMAID_PDF_SCALE"):
-        args.scale = float(os.environ["MERMAID_PDF_SCALE"])
-
-    script_dir = Path(__file__).resolve().parent
-
-    # Two vetted Puppeteer configs ship alongside this script:
-    #   - mermaid-puppeteer.json       : Chromium sandbox enabled. Used for
-    #                                    non-root execution (the secure
-    #                                    default for laptops, CI runners that
-    #                                    run as a non-root user, etc.).
-    #   - mermaid-puppeteer-root.json  : --no-sandbox / --disable-setuid-sandbox.
-    #                                    Used only when this script runs as
-    #                                    uid 0, because Chromium refuses to
-    #                                    start with the setuid sandbox enabled
-    #                                    when running as root (common inside
-    #                                    container build environments).
-    # Both configs also pass --disable-dev-shm-usage, which is a stability
-    # workaround for small /dev/shm in containers (not a security flag).
-    #
-    # Selection is driven by the effective uid, never by user input. Any
-    # --puppeteer-config that doesn't resolve to one of these two shipped
-    # files is rejected. This prevents an attacker-supplied config from
-    # introducing extra dangerous flags such as --remote-debugging-port
-    # (would expose a control channel to the headless browser) or
-    # --load-extension (would let arbitrary JS run in Chromium).
-    sandboxed_pc = script_dir / "mermaid-puppeteer.json"
-    root_pc = script_dir / "mermaid-puppeteer-root.json"
-
-    is_root = hasattr(os, "geteuid") and os.geteuid() == 0
-    default_pc = root_pc if is_root else sandboxed_pc
-
-    allowed = {p.resolve() for p in (sandboxed_pc, root_pc) if p.exists()}
-    if args.puppeteer_config is not None:
-        requested = args.puppeteer_config.resolve()
-        if requested not in allowed:
-            sys.stderr.write(
-                "Refusing --puppeteer-config: only the shipped configs are "
-                f"allowed ({sandboxed_pc.name}, {root_pc.name}). "
-                f"Got: {requested}\n"
-            )
-            sys.exit(2)
-        default_pc = args.puppeteer_config
-
-    puppeteer_config = default_pc if default_pc.is_file() else None
-    if puppeteer_config is not None:
-        uid_str = str(os.geteuid()) if hasattr(os, "geteuid") else "n/a"
-        sys.stderr.write(
-            f"[render-mermaid-for-pdf] using puppeteer config: "
-            f"{puppeteer_config.name} (uid={uid_str})\n"
-        )
-
-    # Validate source path exists and is a regular file
-    if not args.source.is_file():
-        sys.stderr.write(f"ERROR: source markdown not found: {args.source}\n")
-        sys.exit(1)
-
-    text = args.source.read_text(encoding="utf-8")
-    img_dir = args.img_dir
-    if img_dir is None:
-        img_dir = args.output.parent / "mermaid_pdf"
-
-    n = 0
-
-    out_parent = args.output.parent.resolve()
-
-    def repl(m: re.Match[str]) -> str:
-        nonlocal n
-        n += 1
-        body = m.group(1)
-        png_name = f"diagram_{n:02d}.png"
-        out_png = img_dir / png_name
-        render_one(
-            args.mmdc,
-            body,
-            out_png,
-            args.width,
-            args.scale,
-            puppeteer_config,
-        )
-        try:
-            rel_to_md = out_png.resolve().relative_to(out_parent)
-        except ValueError:
-            # --img-dir is outside the output directory; fall back to os.path.relpath
-            rel_to_md = Path(os.path.relpath(out_png.resolve(), out_parent))
-        return f"\n![Mermaid diagram {n}]({rel_to_md.as_posix()})\n"
-
-    new_text, count = MERMAID_BLOCK.subn(repl, text)
-    args.output.write_text(new_text, encoding="utf-8")
-    if count:
-        print(f"Rendered {count} Mermaid diagram(s) into {img_dir}", file=sys.stderr)
-
-
-if __name__ == "__main__":
-    main()
diff --git a/skills/deepstream/deepstream-import-vision-model/scripts/report/report-style.css b/skills/deepstream/deepstream-import-vision-model/scripts/report/report-style.css
deleted file mode 100644
index 0448d673..00000000
--- a/skills/deepstream/deepstream-import-vision-model/scripts/report/report-style.css
+++ /dev/null
@@ -1,103 +0,0 @@
-body {
-    font-family: 'Segoe UI', Arial, Helvetica, sans-serif;
-    font-size: 14px;
-    line-height: 1.6;
-    color: #1a1a1a;
-    max-width: 900px;
-    margin: 0 auto;
-    padding: 20px;
-}
-
-h1 { color: #1a237e; border-bottom: 3px solid #1a237e; padding-bottom: 8px; }
-h2 { color: #283593; border-bottom: 2px solid #c5cae9; padding-bottom: 6px; margin-top: 30px; }
-h3 { color: #3949ab; margin-top: 20px; }
-
-table {
-    border-collapse: collapse;
-    width: 100%;
-    margin: 16px 0;
-    font-size: 13px;
-    box-shadow: 0 1px 3px rgba(0,0,0,0.12);
-}
-
-thead tr {
-    background-color: #283593;
-    color: #ffffff;
-    font-weight: bold;
-}
-
-th {
-    border: 1px solid #1a237e;
-    padding: 10px 12px;
-    text-align: left;
-}
-
-td {
-    border: 1px solid #c5cae9;
-    padding: 8px 12px;
-    text-align: left;
-}
-
-tbody tr:nth-child(odd) {
-    background-color: #e8eaf6;
-}
-
-tbody tr:nth-child(even) {
-    background-color: #ffffff;
-}
-
-tbody tr:hover {
-    background-color: #c5cae9;
-}
-
-/* Bold first column in tables */
-td:first-child {
-    font-weight: 600;
-    color: #1a237e;
-}
-
-code {
-    background-color: #f5f5f5;
-    border: 1px solid #e0e0e0;
-    border-radius: 3px;
-    padding: 1px 5px;
-    font-size: 12px;
-}
-
-pre {
-    background-color: #263238;
-    color: #eeffff;
-    border-radius: 6px;
-    padding: 14px;
-    overflow-x: auto;
-    font-size: 12px;
-    line-height: 1.5;
-}
-
-pre code {
-    background: none;
-    border: none;
-    color: inherit;
-    padding: 0;
-}
-
-img {
-    max-width: 100%;
-    height: auto;
-    display: block;
-    margin: 16px auto;
-    border-radius: 4px;
-    box-shadow: 0 2px 6px rgba(0,0,0,0.15);
-}
-
-blockquote {
-    border-left: 4px solid #283593;
-    background-color: #e8eaf6;
-    padding: 10px 16px;
-    margin: 16px 0;
-}
-
-strong { color: #1a237e; }
-
-ul, ol { margin: 8px 0; }
-li { margin: 4px 0; }
diff --git a/skills/deepstream/deepstream-import-vision-model/skill-card.md b/skills/deepstream/deepstream-import-vision-model/skill-card.md
deleted file mode 100644
index 8e0a3b98..00000000
--- a/skills/deepstream/deepstream-import-vision-model/skill-card.md
+++ /dev/null
@@ -1,79 +0,0 @@
-## Description: <br>
-Use this skill to bring any vision model from HuggingFace or NVIDIA NGC into an NVIDIA DeepStream pipeline with end-to-end automation: ONNX download, SafeTensors export, TRT engine build, custom nvinfer bbox parser, multi-stream benchmark, and PDF report. <br>
-
-This skill is ready for commercial/non-commercial use. <br>
-
-## Owner
-NVIDIA <br>
-
-### License/Terms of Use: <br>
-CC-BY-4.0 AND Apache-2.0 <br>
-## Use Case: <br>
-Developers and engineers who need to import vision models from HuggingFace or NVIDIA NGC into NVIDIA DeepStream pipelines for object detection, including automated engine building, benchmarking, and performance report generation. <br>
-
-### Deployment Geography for Use: <br>
-Global <br>
-
-## Known Risks and Mitigations: <br>
-Risk: Review before execution as proposals could introduce incorrect or misleading guidance into skills. <br>
-Mitigation: Review and scan skill before deployment. <br>
-
-## Reference(s): <br>
-- [engine-build.md](references/engine-build.md) <br>
-- [model-acquire.md](references/model-acquire.md) <br>
-- [pipeline-run.md](references/pipeline-run.md) <br>
-- [report-generation.md](references/report-generation.md) <br>
-- [NVIDIA DeepStream SDK](https://developer.nvidia.com/deepstream-sdk) <br>
-- [NVIDIA NGC DeepStream Container](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/deepstream) <br>
-
-
-## Skill Output: <br>
-**Output Type(s):** [Shell commands, Configuration files, Code, Files] <br>
-**Output Format:** [Markdown with inline bash code blocks] <br>
-**Output Parameters:** [1D] <br>
-**Other Properties Related to Output:** [Generates TensorRT engines, nvinfer configs, custom bbox parser C++ source, benchmark logs, and PDF reports] <br>
-
-## Evaluation Agents Used: <br>
-- Claude Code (`claude-code`) <br>
-- Codex (`codex`) <br>
-
-
-
-## Evaluation Tasks: <br>
-Evaluated against 5 evaluation tasks (3 positive skill-activation, 2 negative) with 2 attempts per task via NVSkills-Eval external profile. <br>
-
-## Evaluation Metrics Used: <br>
-Reported benchmark dimensions: <br>
-- Security: Checks whether skill-assisted execution avoids unsafe behavior such as secret leakage, destructive commands, or unauthorized access. <br>
-- Correctness: Checks whether the agent follows the expected workflow and produces the correct final output. <br>
-- Discoverability: Checks whether the agent loads the skill when relevant and avoids using it when irrelevant. <br>
-- Effectiveness: Checks whether the agent performs measurably better with the skill than without it. <br>
-- Efficiency: Checks whether the agent uses fewer tokens and avoids redundant work. <br>
-
-Underlying evaluation signals used in this run: <br>
-- `skill_execution`: Verifies that the agent loaded the expected skill and workflow. <br>
-- `skill_efficiency`: Checks routing quality, decoy avoidance, and redundant tool usage. <br>
-- `accuracy`: Grades final-answer correctness against the reference answer. <br>
-- `goal_accuracy`: Checks whether the overall user task completed successfully. <br>
-- `behavior_check`: Verifies expected behavior steps, including safety expectations. <br>
-- `token_efficiency`: Compares token usage with and without the skill. <br>
-
-
-
-## Evaluation Results: <br>
-| Dimension | Num | `claude-code` | `codex` |
-|---|---:|---:|---:|
-| Security | 8 | 68% (+13%) | 72% (+18%) |
-| Correctness | 8 | 83% (-2%) | 89% (+13%) |
-| Discoverability | 8 | 61% (+0%) | 80% (+1%) |
-| Effectiveness | 8 | 80% (+2%) | 81% (+17%) |
-| Efficiency | 8 | 52% (+2%) | 70% (+2%) |
-
-## Skill Version(s): <br>
-1.2.1 (source: frontmatter) <br>
-
-## Ethical Considerations: <br>
-NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal team to ensure this skill meets requirements for the relevant industry and use case and addresses unforeseen product misuse. <br>
-
-(For Release on NVIDIA Platforms Only) <br>
-Please report quality, risk, security vulnerabilities or NVIDIA AI Concerns [here](https://app.intigriti.com/programs/nvidia/nvidiavdp/detail). <br>
diff --git a/skills/deepstream/deepstream-import-vision-model/skill.oms.sig b/skills/deepstream/deepstream-import-vision-model/skill.oms.sig
deleted file mode 100644
index 97cf9273..00000000
--- a/skills/deepstream/deepstream-import-vision-model/skill.oms.sig
+++ /dev/null
@@ -1 +0,0 @@
-{"mediaType":"application/vnd.dev.sigstore.bundle.v0.3+json","verificationMaterial":{"x509CertificateChain":{"certificates":[{"rawBytes":"MIICgzCCAgmgAwIBAgIUKIyS7SxNteQIiWzK1dWj85E6520wCgYIKoZIzj0EAwMwVTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjEpMCcGA1UEAwwgTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBJQ0EgMDEwHhcNMjYwNDAxMDAwMDAwWhcNMjgwNDIyMTUzMzA5WjBUMQswCQYDVQQGEwJVUzEbMBkGA1UECgwSTlZJRElBIENvcnBvcmF0aW9uMSgwJgYDVQQDDB9OVklESUEgQWdlbnQgU2tpbGxzIFNpZ25pbmcgMDAxMHYwEAYHKoZIzj0CAQYFK4EEACIDYgAEYoRM9bQl/dGlwSRNi6bTpIJUXH8Nv9GciP6LSflJYYMLCc296kpyuTSsk5ddbAWiDcFX3C/ydX3jwc+qCLYP6uHy9XphyLjOQ27Yb2J6rBLVtRBS1mgGco/Gr7fL6ODco4GaMIGXMB0GA1UdDgQWBBRQ/5ZW3nJ6lmo9SVk7I15o7UGmpTAfBgNVHSMEGDAWgBRPGpILxMBBleJSsBGjrMKsby1CgjAMBgNVHRMBAf8EAjAAMA4GA1UdDwEB/wQEAwIHgDA3BggrBgEFBQcBAQQrMCkwJwYIKwYBBQUHMAGGG2h0dHA6Ly9vY3NwLm5kaXMubnZpZGlhLmNvbTAKBggqhkjOPQQDAwNoADBlAjAUygu/GiOCIXrgGr4SmLgeEVDcEitfFUv7ALbvLVGVyMysB3mxmO/uInZfXzWcJZsCMQDxuoxj4ZmO30jhkPIcCxGFCOvnUsnfU3TfGcouYm4M6iRpbKvtVnHPiy4bi6pcKf0="},{"rawBytes":"MIICiDCCAg6gAwIBAgIUZsIuSv9NkpJCNqtYEfCouVv5BzowCgYIKoZIzj0EAwMwUTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjElMCMGA1UEAwwcTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBDQTAgFw0yNjA0MDEwMDAwMDBaGA85OTk5MTIzMTIzNTk1OVowVTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjEpMCcGA1UEAwwgTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBJQ0EgMDEwdjAQBgcqhkjOPQIBBgUrgQQAIgNiAASI72cR3ctKGg4VWnB3bNja6g1Z2PnOmFEopkPof+QeIcPk9rT+g9MjJnq51EQXL93a7C2GJ9J985G4o2V85VD7wJ1RaXhluHW2rf3y8bQGeAYaKMr5s/hUgn+M3/9WlWejgaAwgZ0wHQYDVR0OBBYEFE8akgvEwEGV4lKwEaOswqxvLUKCMB8GA1UdIwQYMBaAFItnoAjjfuCEUvzyvWyI2vOGvwPjMBIGA1UdEwEB/wQIMAYBAf8CAQAwDgYDVR0PAQH/BAQDAgEGMDcGCCsGAQUFBwEBBCswKTAnBggrBgEFBQcwAYYbaHR0cDovL29jc3AubmRpcy5udmlkaWEuY29tMAoGCCqGSM49BAMDA2gAMGUCMQCeIMMfAbyzPDacw2MxG+Yt1cikrJX/DVxiGfXuHmkkXn6VgSzE79+lkqDErpVO2gYCMCNEColOyvUvkzZGUEI1hQ3PfMgi3FIo9tHoBKMw4/wGBLFpu/0ubtmbBXM6/UMOEw=="},{"rawBytes":"MIICRTCCAcygAwIBAgIUeJdY3rV86EdvFmG7L8LJBsyQFYkwCgYIKoZIzj0EAwMwUTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjElMCMGA1UEAwwcTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBDQTAgFw0yNjA0MDEwMDAwMDBaGA85OTk5MTIzMTIzNTk1OVowUTELMAkGA1UEBhMCVVMxGzAZBgNVBAoMEk5WSURJQSBDb3Jwb3JhdGlvbjElMCMGA1UEAwwcTlZJRElBIEFnZW50IENhcGFiaWxpdGllcyBDQTB2MBAGByqGSM49AgEGBSuBBAAiA2IABAYpiXCDjJ9NT2eSDhyHJVSw1Tbze18cGG2F/578oWvHxg23eQAhNRYdq88i1iOshZSO6C29doKui5Xpmo/7Ctw9Sx4PP2RzOmIuOLCuTdNtKcTRwi4GEsd5BAFvWj42M6NjMGEwHQYDVR0OBBYEFItnoAjjfuCEUvzyvWyI2vOGvwPjMB8GA1UdIwQYMBaAFItnoAjjfuCEUvzyvWyI2vOGvwPjMA8GA1UdEwEB/wQFMAMBAf8wDgYDVR0PAQH/BAQDAgEGMAoGCCqGSM49BAMDA2cAMGQCMCwtAjWLaNwgGWNCgdyNoTyvNhqWRECRJV2r3+7w8g0PL6NHLOsbkgE09BH95h8XlgIwTaQmbbUh2ChAJ5TA1wRiVDnCcvbzHlZl2jM2FcwQQZlk19LOAbyGMRixbu2Ww/rj"}]},"tlogEntries":[]},"dsseEnvelope":{"payload":"ewogICJfdHlwZSI6ICJodHRwczovL2luLXRvdG8uaW8vU3RhdGVtZW50L3YxIiwKICAic3ViamVjdCI6IFsKICAgIHsKICAgICAgIm5hbWUiOiAiZGVlcHN0cmVhbS1pbXBvcnQtdmlzaW9uLW1vZGVsIiwKICAgICAgImRpZ2VzdCI6IHsKICAgICAgICAic2hhMjU2IjogImY4MTFiYWYzNGFhY2IxNGI2ZmExZGEyNTVmNGIzM2JhMWU4M2YxYzNjN2RlYWY0ZTU5MjNiMjQzZjFkNDg0YmUiCiAgICAgIH0KICAgIH0KICBdLAogICJwcmVkaWNhdGVUeXBlIjogImh0dHBzOi8vbW9kZWxfc2lnbmluZy9zaWduYXR1cmUvdjEuMCIsCiAgInByZWRpY2F0ZSI6IHsKICAgICJyZXNvdXJjZXMiOiBbCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICJjY2VkNzUyZTYzZDc5NDM3NWRjZTZmNzRiOGJhMjYzYmI3OWM2OWIyMzZiMzJmNjI5ZTg0NDUzNGJhM2U2YWI5IiwKICAgICAgICAibmFtZSI6ICJCRU5DSE1BUksubWQiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICI5MTk0ZThmM2YyNTZkNTZiZDE2ZTZiMTQ5NDM5MzNlZTViMGIwOGQyYWI3ODg4MTM0NzZmMjM5MjQ5M2MzZTU2IiwKICAgICAgICAibmFtZSI6ICJTS0lMTC5tZCIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogIjM3ZTFmYjczMWM2ZjkyZjQwY2JmYzQzNTYwY2Q3NWI2NTE5YmFjZWZhMDFmNTJhYTNhZWNlY2E5ZDFhMDcyMmIiLAogICAgICAgICJuYW1lIjogImV2YWxzL2V2YWxzLmpzb24iCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICIxMzFlNTE4MTRiZTM0Y2ZlYzQwNDE1NTdmNTZiMzNkODQ0MWQ2ZjIxYjllZDc0ZjQ2NmZkNmQ3YmU4NzdhNzMwIiwKICAgICAgICAibmFtZSI6ICJyZWZlcmVuY2VzL2VuZ2luZS1idWlsZC5tZCIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogIjRlNWJmZDQwZjUzNmU4MjQ2ODY2ZDU5ZWExYThiMDA1ZjA1Mjg2MzY1NTdhNTFiNGI3MzM0NzE1NGIxYmZiMzciLAogICAgICAgICJuYW1lIjogInJlZmVyZW5jZXMvbW9kZWwtYWNxdWlyZS5tZCIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogIjliZmUwMThlMmRhNDEyYzkzY2FkN2FmZTg3YzgzMGE0YzM0ZDg0YjIyOWYyN2EyY2Y1N2QxZWNkZmI5Nzg3YmIiLAogICAgICAgICJuYW1lIjogInJlZmVyZW5jZXMvcGlwZWxpbmUtcnVuLm1kIgogICAgICB9LAogICAgICB7CiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiLAogICAgICAgICJkaWdlc3QiOiAiY2FiM2M2MzZjYzFmNGMxZjJhMzFmNWQ4MTM1OGYwMzZjY2ExNGEzMzM1NDg1ZGNhMGYzY2MyNTFjMGE2NzlhMiIsCiAgICAgICAgIm5hbWUiOiAicmVmZXJlbmNlcy9yZXBvcnQtZ2VuZXJhdGlvbi5tZCIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogImYzZGYxNzZkODRhNjdkOTU0NDI0ZGJiNjE2OWMwNzQ4NmI4NzBkMTQxYTIzODQ3ZWZmZWEyZjgyYmNhMDViNjEiLAogICAgICAgICJuYW1lIjogInNjcmlwdHMvZGVlcHN0cmVhbS9iZW5jaG1hcmstZHMuc2giCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICJjZDE3NjVhNmFiNjA5ZTJmZjdkOWVjODExMWEzM2RiYTNlZmYxMDczODI2OTViMjI3NTQwOTRkNjQzNjNmMTUzIiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL2RlZXBzdHJlYW0vZHMta2l0dGktZHVtcC5zaCIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogImQxNWYxZGRmYjgzM2JlN2Y5OWU4MWUwOGY5M2RhZmNjMzMwMWExZGU4NjgyN2Y2NTFkYjUwMGNjZGU4OGYxY2UiLAogICAgICAgICJuYW1lIjogInNjcmlwdHMvZGVlcHN0cmVhbS9kcy1wZXJmLXJ1bi5zaCIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogIjZiMGUwZTc2MGRkMDdlNGQyNzI1NDIyZWY1MGYxZWIyNzE3NDdiNDlmMTNjNzg0MmUzZjExZGI0NjQ2OWFjNWMiLAogICAgICAgICJuYW1lIjogInNjcmlwdHMvZGVlcHN0cmVhbS9kcy1zaW5nbGUtc3RyZWFtLnNoIgogICAgICB9LAogICAgICB7CiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiLAogICAgICAgICJkaWdlc3QiOiAiZWJkMjlkMjU5MDQ1MjBlMTQzOGUxOWI3ZWM4Y2U0MWY5NWExZWEyNDU4YTQ5ZDA4ZDE1MDc5NjIwYWU2YjFlOSIsCiAgICAgICAgIm5hbWUiOiAic2NyaXB0cy9kZWVwc3RyZWFtL2RzLXN3ZWVwLnNoIgogICAgICB9LAogICAgICB7CiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiLAogICAgICAgICJkaWdlc3QiOiAiZmU4ZDgwNmY0NmNlYWU4MTA4NjUzZjYwN2UwOGRiNTA5MTE1YTM3ODYyYTUzZWY5NTQ2ZTQzN2Q5YmQ0NTFmNCIsCiAgICAgICAgIm5hbWUiOiAic2NyaXB0cy9kZWVwc3RyZWFtL2V4dHJhY3QtZnJhbWUuc2giCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICI3NDUyYTQ2N2Y1NmJlZGIwODM4ZDhlMzZmZjk3MGFjNjQ2MDA3MzdjOWJlY2NhZTVmNGNjYWNjMzA5ZGM1YWM5IiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL2VuZ2luZS9iZW5jaG1hcmstdHJ0ZXhlYy5zaCIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogIjQxMDlmOTUxYWRmN2EwMzg4N2NjMTMxN2U3YzBjODZkMWMyZDMwODNkYzlhYjk3ZWIwZjRmNzcwYTM5ZWMwZjQiLAogICAgICAgICJuYW1lIjogInNjcmlwdHMvbW9kZWwvY2xlYW51cC5zaCIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogImIwZDYwMDBlMjlmNzVlOTk1ZDk3ZTgyZWUyOTRmODg3ZjZjYWU0OGM1MjdmNmM2NTJmMjg2MDAzYTAzZjM4MTEiLAogICAgICAgICJuYW1lIjogInNjcmlwdHMvbW9kZWwvaGYtZG93bmxvYWQtY29uZmlnLnNoIgogICAgICB9LAogICAgICB7CiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiLAogICAgICAgICJkaWdlc3QiOiAiZmU3M2E3ODA1MGYzNzVkMDJmNjBjYjg3ZmE3NmQxNTc1NjBiMTUzZTVjY2Y5ZDRiNmQ4ZDhkMTMwMzg1NGZlZCIsCiAgICAgICAgIm5hbWUiOiAic2NyaXB0cy9tb2RlbC9oZi1saXN0LWZpbGVzLnNoIgogICAgICB9LAogICAgICB7CiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiLAogICAgICAgICJkaWdlc3QiOiAiNTk1NjMwNWIwNGM4MTFkZDQxZjMxMGY2ODhiZmM3MzU4NGFiMWZkMmZjNWYyYzg2ZDUzZWU5NjVkYjlmMGVhNyIsCiAgICAgICAgIm5hbWUiOiAic2NyaXB0cy9tb2RlbC9pbnNwZWN0LW9ubngucHkiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICIxZTNiNjIyYWEwMDVkNzMxOGFlYzcyZWI2YzBlOTY1OTE3ZTE3Y2Q0ZWEzMGI5MmJiMDcxMzk0ZmExYmQ2N2RjIiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL21vZGVsL21ha2Utc3RhdGljLWJhdGNoLW9ubngucHkiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICI0NGIyY2NhYThjZGRiNzBkNTIxZDc3MDYyMWEzYWExZDVlZTU0MmNmNmY1MTMwZjg1ODAyN2EyOTk3MTYzZDNiIiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL21vZGVsL25nYy1kb3dubG9hZC5zaCIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogImUzZTVlMDllNjhjMmZmZGY0MTNmNzMzMTI1NGM1ZWU4ZGQxYzdiOGUzYzUwNzJlYTVhMTBiYWExZTFhOTVmNDAiLAogICAgICAgICJuYW1lIjogInNjcmlwdHMvbW9kZWwvbmdjLWxpc3QtZmlsZXMuc2giCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICJjY2M5YWJhNjg3ZTg2MDFjMmQyNTRjZmVlMGI3NTBkNmMwNDIyODBiNzM2MmJhNGU3YzM5NmFhMDQ4MmY5NTJiIiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL21vZGVsL3NhZmV0ZW5zb3JzLXRvLW9ubnguc2giCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICIzMjVhN2EzYTRhMzc0MGU2YTNjODY1OTFmMzI0ZTljYjUxMmM2NWJkY2UwMTBmN2Q5NTQ2MDIxODI2ODFlOWUzIiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL3JlcG9ydC9nZW5lcmF0ZS1iZW5jaG1hcmstY2hhcnRzLnB5IgogICAgICB9LAogICAgICB7CiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiLAogICAgICAgICJkaWdlc3QiOiAiMmIyOGE0ZTQ4MzQ5ZDE0MDc2MmFhMjBmMTJjZWE5NjJhNjkyY2I3ZmVlNzUzNzE1Y2I0MDFmZjlhMzNkZDM5ZCIsCiAgICAgICAgIm5hbWUiOiAic2NyaXB0cy9yZXBvcnQvbGF0ZXgtcGRmLXdyYXAudGV4IgogICAgICB9LAogICAgICB7CiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiLAogICAgICAgICJkaWdlc3QiOiAiYTk4MTc3NDIzNDdjMWMyMDNjY2YzZjkxYWM3NTM2MGY5MWRmYmE2MDVjMmQ1YWUwZGFlNWU3OTM0ODY1MGM0OCIsCiAgICAgICAgIm5hbWUiOiAic2NyaXB0cy9yZXBvcnQvbWQtdG8taHRtbC1wZGYucHkiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICJkZGVhYzU4Njg2ODM1YjQxZjJkMGNjZjA0N2YwNzVkMTUyZmIyOTZjMWM2OGIxZDNmYzNiZDNkYThmMTRjMTZiIiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL3JlcG9ydC9tZC10by1wZGYuc2giCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICI4OTZlMmQ4ZjNiZDllODYyMmVkOWM3MDlmYTljNTk3ZmJlM2I5ODk3NmY4MDhiOGViMjJkZGIwYTcxNWJkZDUzIiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL3JlcG9ydC9tZXJtYWlkLXB1cHBldGVlci1yb290Lmpzb24iCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICIwM2FlODFmZGRlYzk3YjdmMzdhNDA4NmYxYWNjZGY1MGJjNGE4MDE4ZDY4MGQyZGU5YjhmNGRmZDBmYjYwYzBmIiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL3JlcG9ydC9tZXJtYWlkLXB1cHBldGVlci5qc29uIgogICAgICB9LAogICAgICB7CiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiLAogICAgICAgICJkaWdlc3QiOiAiYjY3N2NlMDc5MDhiYmQ0YTBjMzNkZjM2NWU5YjFjM2Q5M2M1MmM2ZjBkYWY4MjA4MGNkNDFhZGM4MTE0MjVhNyIsCiAgICAgICAgIm5hbWUiOiAic2NyaXB0cy9yZXBvcnQvcGFuZG9jLXdyYXAtdGFibGVzLmx1YSIKICAgICAgfSwKICAgICAgewogICAgICAgICJhbGdvcml0aG0iOiAic2hhMjU2IiwKICAgICAgICAiZGlnZXN0IjogIjhjMGY1ZjM1ODRiOWMxM2QxNDY2NzQ3ZTJkODQzNDM0YjYyNDRiYzM4NmEyNTQ3Zjc3OGFhMGRhMDVlZWUxZTMiLAogICAgICAgICJuYW1lIjogInNjcmlwdHMvcmVwb3J0L3JlbmRlci1tZXJtYWlkLWZvci1wZGYucHkiCiAgICAgIH0sCiAgICAgIHsKICAgICAgICAiYWxnb3JpdGhtIjogInNoYTI1NiIsCiAgICAgICAgImRpZ2VzdCI6ICJlYWU5YjQ3OTI5MTUyN2QyYWZiZGJmMGI0NWI2NDM3N2M3YjE1ZjgzOTJiMmU3ZDI4YzNmNDg4ZDcyNWZhMDAzIiwKICAgICAgICAibmFtZSI6ICJzY3JpcHRzL3JlcG9ydC9yZXBvcnQtc3R5bGUuY3NzIgogICAgICB9LAogICAgICB7CiAgICAgICAgImFsZ29yaXRobSI6ICJzaGEyNTYiLAogICAgICAgICJkaWdlc3QiOiAiZDRjMWVlOWQzMDZjNTQxZTMwYjg5MzhlM2QxYmQwYmU5NzY0OTY4YTNhYjJmOWFlZTE4ZDcxMWFmODgzYTkwNCIsCiAgICAgICAgIm5hbWUiOiAic2tpbGwtY2FyZC5tZCIKICAgICAgfQogICAgXSwKICAgICJzZXJpYWxpemF0aW9uIjogewogICAgICAiYWxsb3dfc3ltbGlua3MiOiBmYWxzZSwKICAgICAgImlnbm9yZV9wYXRocyI6IFsKICAgICAgICAiLmdpdCIsCiAgICAgICAgIi5naXRodWIiLAogICAgICAgICIuZ2l0YXR0cmlidXRlcyIsCiAgICAgICAgIi5naXRpZ25vcmUiCiAgICAgIF0sCiAgICAgICJoYXNoX3R5cGUiOiAic2hhMjU2IiwKICAgICAgIm1ldGhvZCI6ICJmaWxlcyIKICAgIH0KICB9Cn0=","payloadType":"application/vnd.in-toto+json","signatures":[{"sig":"MGUCMQCEV5I6zIgU/dOK6aY+nkZyW9vt6Ip5WFApKUaaR6oraeICX5lADTYo5Ek4z5ZHsSgCMCqm7kO/HaDDG7oTsf5n1DLda1/aUpTypNLmt2OGYM/3DttfOc1djnhdACH+qRrYjA==","keyid":""}]}}
\ No newline at end of file