Open Issues Need Help
View All on GitHub Add Color and Emojis to Final CLI Report 2 months ago
enhancement good first issue
An open-source benchmark to evaluate the security and robustness of AI coding agents.
Python
good first issue testing
An open-source benchmark to evaluate the security and robustness of AI coding agents.
Python
Add Ollama Provider for Local LLM Evaluation 2 months ago
enhancement good first issue provider
An open-source benchmark to evaluate the security and robustness of AI coding agents.
Python