samcodesign

samcodesign/securedev-bench

An open-source benchmark to evaluate the security and robustness of AI coding agents.

2 stars 1 forks 2 watchers Python GNU General Public License v3.0

3 Open Issues Need Help Last updated: Sep 6, 2025

Open Issues Need Help

View All on GitHub

Add Color and Emojis to Final CLI Report 10 months ago

enhancement good first issue

samcodesign/securedev-bench

2

An open-source benchmark to evaluate the security and robustness of AI coding agents.

Python

[DX] Create a Fast Smoke-Test Command for Contributors 10 months ago

good first issue testing

samcodesign/securedev-bench

2

An open-source benchmark to evaluate the security and robustness of AI coding agents.

Python

Add Ollama Provider for Local LLM Evaluation 10 months ago

enhancement good first issue provider

samcodesign/securedev-bench

2

An open-source benchmark to evaluate the security and robustness of AI coding agents.

Python