Open Issues Need Help
View All on GitHub good-first-issue: Add synthetic case for a missing drug-disease or safety gate pattern about 2 hours ago
good first issue data
Clinician-built benchmark and live leaderboard for medical AI safety evaluation.
Python
#ai-benchmark#ai-safety#clinical-nlp#clinician-review#evaluation-framework#failure-analysis#failure-atlas#healthcare-ai#huggingface-spaces#language-model-evaluation#llm-evaluation#llm-safety#medfailbench#medical-ai#medical-ai-safety#medical-language-models#medical-llm#patient-safety#source-verification#turkish-medical-ai
good-first-issue: Add usage example notebook to README or docs about 2 hours ago
documentation good first issue
Clinician-built benchmark and live leaderboard for medical AI safety evaluation.
Python
#ai-benchmark#ai-safety#clinical-nlp#clinician-review#evaluation-framework#failure-analysis#failure-atlas#healthcare-ai#huggingface-spaces#language-model-evaluation#llm-evaluation#llm-safety#medfailbench#medical-ai#medical-ai-safety#medical-language-models#medical-llm#patient-safety#source-verification#turkish-medical-ai