Skip to content

feat: add WorkloadExecuting status to evaluation runs#1391

Open
mjnovice wants to merge 1 commit intomainfrom
feat/add-workload-executing-eval-status
Open

feat: add WorkloadExecuting status to evaluation runs#1391
mjnovice wants to merge 1 commit intomainfrom
feat/add-workload-executing-eval-status

Conversation

@mjnovice
Copy link
Contributor

@mjnovice mjnovice commented Feb 28, 2026

Summary

  • Adds WorkloadExecuting status to distinguish agent execution from evaluation/scoring in the eval run lifecycle
  • Eval runs are now created with WorkloadExecuting (4) status instead of IN_PROGRESS (1)
  • After agent execution completes, a new AGENT_EXECUTION_COMPLETED event transitions the run to IN_PROGRESS before evaluators execute
  • New flow: WorkloadExecuting → Running → Completed/Failed

Depends on https://github.com/UiPath/Agents/pull/4529

Changes

  • events.py: Add AGENT_EXECUTION_COMPLETED event type and AgentExecutionCompletedEvent model
  • runtime.py: Publish AGENT_EXECUTION_COMPLETED after agent finishes, before evaluators run
  • _progress_reporter.py: Add WORKLOAD_EXECUTING=4 to enum, create eval runs with new status, handle event to transition to IN_PROGRESS

Related

Test plan

  • Run evaluations end-to-end and verify eval run starts in WorkloadExecuting status
  • Verify status transitions to Running after agent execution completes
  • Verify status transitions to Completed/Failed after evaluators finish
  • Verify resume mode still works correctly
  • Run existing eval tests (pytest tests/cli/eval/)

🤖 Generated with Claude Code

@github-actions github-actions bot added test:uipath-langchain Triggers tests in the uipath-langchain-python repository test:uipath-llamaindex Triggers tests in the uipath-llamaindex-python repository labels Feb 28, 2026
…n runs

Differentiate between agent execution phase (WorkloadExecuting) and
evaluation/scoring phase (Running) in eval run status transitions.

New flow: Pending → WorkloadExecuting → Running → Completed/Failed
with WorkloadFailed as a terminal state for agent execution failures.

Changes:
- Add WORKLOAD_EXECUTING=4 and WORKLOAD_FAILED=5 to EvaluationStatus enum
- Create eval runs with WorkloadExecuting status (was IN_PROGRESS)
- Add MOVE_EVAL_RUN_TO_RUNNING event to transition after agent succeeds
- Add EvalRunStatusUpdateEvent model for status-only updates
- Add workload_failed flag to EvalRunUpdatedEvent for proper status selection
- Add move_eval_run_status method and handler in progress reporter

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@mjnovice mjnovice force-pushed the feat/add-workload-executing-eval-status branch from 3ab9261 to e61b12a Compare March 10, 2026 00:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

test:uipath-langchain Triggers tests in the uipath-langchain-python repository test:uipath-llamaindex Triggers tests in the uipath-llamaindex-python repository

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant