Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.

benchmark forecastbench forecasting llm-benchmarking
8 Open Issues Need Help Last updated: Jul 1, 2026

Open Issues Need Help

View All on GitHub
leaderboard good first issue

Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.

Python
#benchmark#forecastbench#forecasting#llm-benchmarking
leaderboard good first issue

Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.

Python
#benchmark#forecastbench#forecasting#llm-benchmarking
leaderboard good first issue

Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.

Python
#benchmark#forecastbench#forecasting#llm-benchmarking
good first issue external forecasts

Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.

Python
#benchmark#forecastbench#forecasting#llm-benchmarking

Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.

Python
#benchmark#forecastbench#forecasting#llm-benchmarking

Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.

Python
#benchmark#forecastbench#forecasting#llm-benchmarking
data good first issue

Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.

Python
#benchmark#forecastbench#forecasting#llm-benchmarking

Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.

Python
#benchmark#forecastbench#forecasting#llm-benchmarking