fix: preserve API-provided verifier SHA in bundle creation by shihan-fleet · Pull Request #62 · fleet-ai/fleet-sdk

shihan-fleet · 2026-02-16T23:07:34Z

No description provided.

When verifier code contains multiple functions (e.g., a main verifier function and helper functions), the helper functions were not accessible from the main function due to namespace isolation. The exec() call created functions in local_namespace, but the main function's __globals__ pointed to exec_globals which didn't contain the helper functions. This caused NameError when the main function tried to call helpers, which was silently caught and returned 0.0. Fix: Merge local_namespace into exec_globals after exec() so all defined functions are accessible when the verifier is called.

…mespace fix: allow verifier helper functions to be called from main verifier

InstanceRequest changes: - Add: profile_id, async_provision, instance_mode, ssh_public_keys, snapshot_interval_minutes, version (deprecated) - Fix: region default changed from 'us-west-1' to None (server decides) - Fix: created_from default changed from None to 'api' TaskRequest changes: - Add: verifier_func, project_key, data_id, data_version, writer_metadata - Add: model_config with extra='ignore' and populate_by_name=True - Add: alias='env_id' for environment_id field - Remove: metadata (doesn't exist in orchestrator TaskRequest, only in TaskResponse)

…API" This reverts commit 9a0af14.

add metadata to tasks in SDK

bump version

consolidate

…odels Add factual_answer field to support research/factual tasks: - Task model: stores expected answer for verification - TaskRequest: accept factual_answer when creating tasks - TaskResponse: return factual_answer from API Part of: https://linear.app/fleet-ai/issue/ENG-843/import-script-needs-to-support-output-json-schemas Co-authored-by: Cursor <cursoragent@cursor.com>

Co-authored-by: Cursor <cursoragent@cursor.com>

feat: add factual_answer field to Task and API models

Add task_modality field to Task and TaskResponse models to support copying task modality (computer_use, tool_use, browser) when importing tasks via the SDK. Changes: - Add task_modality to TaskResponse model (API response) - Add task_modality to Task model (SDK model) - Pass task_modality from TaskResponse to Task in load_tasks Co-authored-by: Cursor <cursoragent@cursor.com>

Addresses Bugbot comment: load_task_from_json wasn't extracting task_modality from JSON data, causing tasks loaded from JSON files to have task_modality=None even when the JSON contains this field. Co-authored-by: Cursor <cursoragent@cursor.com>

Co-authored-by: Cursor <cursoragent@cursor.com>

Add task_modality field to async Task model, TaskResponse model, and update load_task_from_json and load_tasks to preserve task_modality. Co-authored-by: Cursor <cursoragent@cursor.com>

- Change ScenarioResponse.id from str to int - Change task_scenario_id from Optional[str] to Optional[int] in Task and TaskResponse models - Bump version to 0.2.112 Co-authored-by: Cursor <cursoragent@cursor.com>

fix: use int for scenario IDs to match database schema

The API returns `environment_id` but load_task_from_json was only looking for `env_id` or `env_key`. Now it checks all three field names. Bump version to 0.2.113. Co-authored-by: Cursor <cursoragent@cursor.com>

fix: handle environment_id in load_task_from_json

Previously, import_single_task would catch all exceptions and return None, making it impossible to debug import failures. Now it raises the exception so callers can handle or report the actual error. Bump version to 0.2.114. Co-authored-by: Cursor <cursoragent@cursor.com>

…dling fix: propagate errors from import_single_task instead of swallowing

This field was missing from the SDK, causing the lifecycle status to be lost when copying tasks. The API returns this field but the SDK wasn't capturing it. Changes: - Add task_lifecycle_status field to Task model (sync and async) - Map task_lifecycle_status in load_task_from_json (sync and async) - Bump version to 0.2.115 Co-authored-by: Cursor <cursoragent@cursor.com>

The API returns 'environment_id', so just use that directly instead of a fallback chain of env_id/env_key/environment_id. Co-authored-by: Cursor <cursoragent@cursor.com>

The database uses env_key, so the SDK model should match. Added alias="environment_id" so the API response still maps correctly. Updated all references: - Task.env_id -> Task.env_key - TaskInfo.env_id -> TaskInfo.env_key - Updated docstrings and examples Co-authored-by: Cursor <cursoragent@cursor.com>

The API expects env_id (or environment_id), so we map env_key to env_id in import_single_task before sending. This keeps the SDK using env_key internally (matching DB) while maintaining API compatibility. No API changes needed - this is SDK-only. Co-authored-by: Cursor <cursoragent@cursor.com>

The task_lifecycle_status field was added to the Task model but was missing from: - TaskResponse model (sync and async) - needed to parse API response - load_tasks method - needed to pass the field to Task constructor This completes the task_lifecycle_status support in the SDK. Co-authored-by: Cursor <cursoragent@cursor.com>

The field was renamed to env_key but there was already a property with the same name, causing infinite recursion. Renamed the property to get_env_key() method. Also restored fallback for env_key in load_task_from_json to support JSON files that use env_key field. Co-authored-by: Cursor <cursoragent@cursor.com>

The field was renamed to env_key but there was already a property with the same name, causing infinite recursion. Renamed the property to get_env_key() method. Also restored env_id fallback in load_task_from_json for backward compatibility with existing JSON files. Co-authored-by: Cursor <cursoragent@cursor.com>

The make() method was using self.env_key (raw field) instead of self.get_env_key() (computed method with version). This would cause environments to be created without the version suffix. Co-authored-by: Cursor <cursoragent@cursor.com>

The API returns env_id but TaskInfo was renamed to use env_key. Added alias="env_id" so Pydantic accepts both field names during deserialization of API responses. Co-authored-by: Cursor <cursoragent@cursor.com>

When export_tasks serializes tasks, it outputs env_key. The loading function needs to check for env_key first (canonical name), then fallback to environment_id (API) and env_id (legacy). Co-authored-by: Cursor <cursoragent@cursor.com>

- TaskResponse: rename environment_id -> env_key (alias="environment_id") - TaskRequest: rename environment_id -> env_key (alias="environment_id") - Add ConfigDict(populate_by_name=True) for alias support - Add Task.env_spec property for env_key:version string - Use task.env_spec in Task.make() and make_for_task() - Clean up load_tasks to use task_response.env_key directly - Remove scattered inline env_key:version string building Co-authored-by: Cursor <cursoragent@cursor.com>

- data_spec: renamed from data_key (data_key kept as alias) - has_verifier: whether task has verifier_func or verifier - is_research_based: whether task has a factual_answer - is_action_based: inverse of is_research_based Co-authored-by: Cursor <cursoragent@cursor.com>

TaskInfo has alias="env_id" on env_key field but was missing model_config = ConfigDict(populate_by_name=True). Without this, creating TaskInfo(env_key="...") would fail since only the alias name was accepted. Co-authored-by: Cursor <cursoragent@cursor.com>

Co-authored-by: Cursor <cursoragent@cursor.com>

feat: Add task_lifecycle_status field to Task model

The PUT /v1/tasks/{task_key} endpoint can return environment_id: null, which caused a Pydantic validation error since env_key was required. This made update_task crash instead of returning a TaskResponse. - TaskResponse.env_key: str -> Optional[str] - Task.env_key: str -> Optional[str] - Task.env_spec now returns None when env_key is absent Co-authored-by: Cursor <cursoragent@cursor.com>

When a task has env_key=None, make_for_task would pass None to make() causing a TypeError at ":" in env_key. Now raises a clear ValueError matching the guard in Task.make(). Co-authored-by: Cursor <cursoragent@cursor.com>

fix: make TaskResponse.env_key optional to handle null API responses

…al-env-key" This reverts commit 3a4f711, reversing changes made to 7ec526b.

…v-key revert: restore env_key as required in TaskResponse and Task

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

cursor · 2026-02-16T23:08:43Z

                    raise OSError(f"Cannot create bundle for {self.key}: {e}")

+            if self._bundle_sha is None:
+                self._bundle_sha = _get_bundle_sha(self._bundle_data)


Empty SHA now bypasses hash generation

Medium Severity

_get_or_create_bundle now recomputes the hash only when _bundle_sha is None, but API-loaded verifiers pass missing hashes as "". That preserves an empty sha256 instead of deriving one from bundle_data, so later check/execute calls can run with an invalid SHA.

Additional Locations (1)

fleet/_async/verifiers/verifier.py#L85-L87

mikesklar and others added 30 commits January 13, 2026 12:21

Update agent.py

b301d67

update gemini cua agent with latest updates

32fa85f

update name

8852db9

Merge pull request #38 from fleet-ai/fix/verifier-helper-functions-na…

5f89234

…mespace fix: allow verifier helper functions to be called from main verifier

Bump version to 0.2.104

54feffd

add metadata to tasks

f895cb6

fixes

fef862d

Revert "fix: align InstanceRequest and TaskRequest with orchestrator …

9cdd7e6

…API" This reverts commit 9a0af14.

Merge pull request #40 from fleet-ai/zz/add-metadata-0121

acc58ed

add metadata to tasks in SDK

bump version

58f8faf

Merge pull request #41 from fleet-ai/zz/2.105

5dbb76e

bump version

consolidate

ee6a8a5

Consolidate all metadata into "metadata" in TaskResponse

05112f7

consolidate

README.md

afb516c

README.md

c08b76b

Update README.md

e303638

Update README.md

0c4d149

Delete export_tasks_filtered.py

f0737e5

Update README.md

53609ae

Update README.md

8b92120

chore: bump version to 0.2.107

33459b2

Co-authored-by: Cursor <cursoragent@cursor.com>

chore: update lockfile for 0.2.107

7dbbe39

Co-authored-by: Cursor <cursoragent@cursor.com>

Merge pull request #47 from fleet-ai/add-factual-answer-support

1bcfb21

feat: add factual_answer field to Task and API models

chore: bump version to 0.2.108

875a297

Co-authored-by: Cursor <cursoragent@cursor.com>

feat: add task_modality support to async SDK

aa03cd0

Add task_modality field to async Task model, TaskResponse model, and update load_task_from_json and load_tasks to preserve task_modality. Co-authored-by: Cursor <cursoragent@cursor.com>

andrew-stelmach-fleet and others added 28 commits February 4, 2026 22:15

fix: use int for scenario IDs to match database schema

90bea61

- Change ScenarioResponse.id from str to int - Change task_scenario_id from Optional[str] to Optional[int] in Task and TaskResponse models - Bump version to 0.2.112 Co-authored-by: Cursor <cursoragent@cursor.com>

Merge pull request #51 from fleet-ai/fix/scenario-id-type

73219dc

fix: use int for scenario IDs to match database schema

fix: handle environment_id in load_task_from_json

a5eafaf

The API returns `environment_id` but load_task_from_json was only looking for `env_id` or `env_key`. Now it checks all three field names. Bump version to 0.2.113. Co-authored-by: Cursor <cursoragent@cursor.com>

Merge pull request #52 from fleet-ai/fix/load-task-from-json-env-id

42bea78

fix: handle environment_id in load_task_from_json

Merge pull request #53 from fleet-ai/fix/import-single-task-error-han…

cc239a1

…dling fix: propagate errors from import_single_task instead of swallowing

refactor: Simplify env_id mapping in load_task_from_json

44a5beb

The API returns 'environment_id', so just use that directly instead of a fallback chain of env_id/env_key/environment_id. Co-authored-by: Cursor <cursoragent@cursor.com>

fix: Add alias for TaskInfo env_key field to support env_id

747d945

The API returns env_id but TaskInfo was renamed to use env_key. Added alias="env_id" so Pydantic accepts both field names during deserialization of API responses. Co-authored-by: Cursor <cursoragent@cursor.com>

fix: Update example files to use task.env_key instead of task.env_id

2d438b3

Co-authored-by: Cursor <cursoragent@cursor.com>

Merge pull request #55 from fleet-ai/feat/add-task-lifecycle-status

7ec526b

feat: Add task_lifecycle_status field to Task model

Merge pull request #56 from fleet-ai/fix/task-response-optional-env-key

3a4f711

fix: make TaskResponse.env_key optional to handle null API responses

Revert "Merge pull request #56 from fleet-ai/fix/task-response-option…

5715f73

…al-env-key" This reverts commit 3a4f711, reversing changes made to 7ec526b.

Merge pull request #57 from fleet-ai/revert/task-response-optional-en…

bb30e38

…v-key revert: restore env_key as required in TaskResponse and Task

export_tasks

942a9af

fix: preserve API-provided verifier SHA in bundle creation

8782e96

cursor Bot reviewed Feb 16, 2026

View reviewed changes

gg2001 force-pushed the main branch from 51131ab to e3c5571 Compare April 14, 2026 22:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preserve API-provided verifier SHA in bundle creation#62

fix: preserve API-provided verifier SHA in bundle creation#62
shihan-fleet wants to merge 65 commits intomainfrom
verifier-issue

shihan-fleet commented Feb 16, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot Feb 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

shihan-fleet commented Feb 16, 2026

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot Feb 16, 2026

Choose a reason for hiding this comment

Empty SHA now bypasses hash generation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants