Open Issues Need Help
View All on GitHub Run OpenHands baseline about 2 months ago
AI Summary: The task requires running an OpenHands baseline using an open-weights model on the existing CodeContests, small repositories, and large repositories benchmarks. This involves adapting the existing scripts to incorporate the OpenHands model and then evaluating and summarizing the results, similar to the existing Claude and Codex baselines.
Complexity:
4/5
enhancement good first issue