Skip to content

Releases: OpenMLRL/LLM_Collab_Code_Completion

v1.3.2

29 Jan 15:05
9241b8a

Choose a tag to compare

Changelog

Reconstruct the reward system and align with CoMLRL 1.3.2.

ClassEval

image

IAC and MAGRPO ested on H200 (33 and 48 h), MAAC need B200 (estimated time 50 h)

v1.0.0

20 Nov 21:33
fbf1d69

Choose a tag to compare

This version provides excellent support for training scenarios involving two agents collaborating.

Model links:

https://huggingface.co/OpenMLRL/CE_2agents_TAKE_JOB_2t_code_feedback_0p
https://huggingface.co/OpenMLRL/CE_2agents_TAKE_JOB_2t_code_feedback_1p

turn_1
turn_2