Evidence-grounded scientific agents with replayable evaluation and human review — the public integration + evaluation layer for AI-for-science workflows.

agent-evaluation ai-for-science autonomous-science chemistry evidence-grounded llm materials-science scientific-agents
4 Open Issues Need Help Last updated: Jul 1, 2026

Open Issues Need Help

View All on GitHub
enhancement good first issue

Evidence-grounded scientific agents with replayable evaluation and human review — the public integration + evaluation layer for AI-for-science workflows.

Python
#agent-evaluation#ai-for-science#autonomous-science#chemistry#evidence-grounded#llm#materials-science#scientific-agents

Evidence-grounded scientific agents with replayable evaluation and human review — the public integration + evaluation layer for AI-for-science workflows.

Python
#agent-evaluation#ai-for-science#autonomous-science#chemistry#evidence-grounded#llm#materials-science#scientific-agents
enhancement good first issue

Evidence-grounded scientific agents with replayable evaluation and human review — the public integration + evaluation layer for AI-for-science workflows.

Python
#agent-evaluation#ai-for-science#autonomous-science#chemistry#evidence-grounded#llm#materials-science#scientific-agents
enhancement good first issue

Evidence-grounded scientific agents with replayable evaluation and human review — the public integration + evaluation layer for AI-for-science workflows.

Python
#agent-evaluation#ai-for-science#autonomous-science#chemistry#evidence-grounded#llm#materials-science#scientific-agents