You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The input data is stored in `tasks/eval/agents/web_agents/chat_history_seed.json`. Currently, this file contains **sample data with only one mission** for testing and development purposes.
415
+
416
+
#### Sample Data Structure
417
+
418
+
Each record in the input file represents one step (turn) of a mission:
419
+
415
420
```json
416
421
{
417
-
"id": "mission_1_step_1",
418
-
"mission_id": "mission_1",
419
-
"turn": 1,
420
-
"mission": "Book a flight from NYC to LAX",
421
-
"navigational_directions": "Click on the search button",
422
+
"id": "mission_01_2",
423
+
"mission_id": "mission_01",
424
+
"mission": "search for one way flight from hyd to chennai on nov 1 2025",
425
+
"date": "2025-11-11 15:12:56",
426
+
"navigational_directions": "",
427
+
"turn": 2,
428
+
"chat_history": [
429
+
{
430
+
"role": "system",
431
+
"content": [
432
+
{
433
+
"text": "You are a web automation agent...",
434
+
"type": "text"
435
+
}
436
+
]
437
+
},
438
+
{
439
+
"role": "user",
440
+
"content": [
441
+
{
442
+
"text": "Help me now to complete the assigned mission...",
443
+
"type": "text"
444
+
}
445
+
]
446
+
},
447
+
{
448
+
"content": "I'll help you search for a one-way flight...",
449
+
"role": "assistant",
450
+
"tool_calls": [
451
+
{
452
+
"id": "tooluse_O5Dr64r9RC-lW8BNsdHTng",
453
+
"type": "function",
454
+
"function": {
455
+
"name": "screenshot_tool",
456
+
"arguments": "{\"take_screenshot\": true}"
457
+
}
458
+
}
459
+
]
460
+
},
461
+
{
462
+
"role": "tool",
463
+
"tool_call_id": "tooluse_O5Dr64r9RC-lW8BNsdHTng",
464
+
"name": "screenshot_tool",
465
+
"content": "success"
466
+
}
467
+
],
468
+
"current_user_text": "You are now midway through the assigned mission...",
0 commit comments