Skip to content

feat: add Harbor integration#751

Merged
bxyu-nvidia merged 18 commits intoNVIDIA-NeMo:mainfrom
grace-lam:feat/harbor_integration
Mar 11, 2026
Merged

feat: add Harbor integration#751
bxyu-nvidia merged 18 commits intoNVIDIA-NeMo:mainfrom
grace-lam:feat/harbor_integration

Conversation

@grace-lam
Copy link
Copy Markdown
Contributor

Adds the full Harbor agent integration in NeMo Gym for multi-turn RL training workflows, including custom agent/env wiring, rollout conversion, and operational docs.

Scope

This PR covers the full responses_api_agents/harbor_agent/ package:

  • app.py (Harbor job orchestration + response shaping)
  • utils.py (trajectory → NeMo Gym output conversion)
  • custom_agents/ (Terminus-2 NeMo Gym compatibility path)
  • custom_envs/singularity/ (HPC-oriented execution environment)
  • configs/harbor_agent.yaml (default agent config)
  • tests/test_app.py (integration/unit behavior checks)
  • README.md

Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
@grace-lam grace-lam force-pushed the feat/harbor_integration branch from dad532c to 524cfd3 Compare February 24, 2026 05:44
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
Signed-off-by: Grace Lam <gralam@nvidia.com>
@cmunley1
Copy link
Copy Markdown
Contributor

cmunley1 commented Mar 7, 2026

Have you been able to run training or reward profiling?

https://docs.nvidia.com/nemo/gym/latest/contribute/environments/new-environment.html#contribution-workflow

@grace-lam
Copy link
Copy Markdown
Contributor Author

Yes, here are sample training and validation plots for an overfitting experiment.

A few other team members are currently testing out the agent and/or building on top of this code.

harbor agent rl rewards

@bxyu-nvidia bxyu-nvidia merged commit 7998527 into NVIDIA-NeMo:main Mar 11, 2026
6 checks passed
MahanFathi pushed a commit that referenced this pull request Mar 24, 2026
Adds the full Harbor agent integration in NeMo Gym for multi-turn RL
training workflows, including custom agent/env wiring, rollout
conversion, and operational docs.

### Scope
This PR covers the full responses_api_agents/harbor_agent/ package:

- app.py (Harbor job orchestration + response shaping)
- utils.py (trajectory → NeMo Gym output conversion)
- custom_agents/ (Terminus-2 NeMo Gym compatibility path)
- custom_envs/singularity/ (HPC-oriented execution environment)
- configs/harbor_agent.yaml (default agent config)
- tests/test_app.py (integration/unit behavior checks)
- README.md

---------

Signed-off-by: Grace Lam <gralam@nvidia.com>
jsw-zorro pushed a commit to niletron/Gym that referenced this pull request Apr 7, 2026
Adds the full Harbor agent integration in NeMo Gym for multi-turn RL
training workflows, including custom agent/env wiring, rollout
conversion, and operational docs.

### Scope
This PR covers the full responses_api_agents/harbor_agent/ package:

- app.py (Harbor job orchestration + response shaping)
- utils.py (trajectory → NeMo Gym output conversion)
- custom_agents/ (Terminus-2 NeMo Gym compatibility path)
- custom_envs/singularity/ (HPC-oriented execution environment)
- configs/harbor_agent.yaml (default agent config)
- tests/test_app.py (integration/unit behavior checks)
- README.md

---------

Signed-off-by: Grace Lam <gralam@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants