An open-source benchmark to evaluate the security and robustness of AI coding agents.

2 stars 1 forks 2 watchers Python GNU General Public License v3.0
3 Open Issues Need Help Last updated: Sep 6, 2025

Open Issues Need Help

View All on GitHub
enhancement good first issue

An open-source benchmark to evaluate the security and robustness of AI coding agents.

Python
good first issue testing

An open-source benchmark to evaluate the security and robustness of AI coding agents.

Python
enhancement good first issue provider

An open-source benchmark to evaluate the security and robustness of AI coding agents.

Python