Skip to content

[Harbor] Support agents beyond terminus 2 such as open hands #1184

@CharlieFRuan

Description

@CharlieFRuan

Currently we only support RL on Terminus2:

We'd want to RL on other agent harness via Harbor as well.

This might include plumbing Harbor and make changes if needed in SkyRL internals.

Need to be especially careful about whether open hands does off policy things that make chat history non-strictly appending (e.g. summarization).

We should support RL on other agents with strictly appending chat history first, and then we can support step-wise training for all agents.

Final deliverable: a curve on perhaps CodeContest, compare against Terminus2

Hardware needed:

  • 1xA100 for development (final curve can be run by SkyRL maintainers)
  • Modal/Daytona (8 sandbox concurrency should be enough)

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions