Import Code to GitHub

资讯

Provider-agnostic, open-source evaluation infrastructure for language models

OpenBench provides standardized, reproducible benchmarking for LLMs across 30+ evaluation suites (and growing) spanning knowledge, math, reasoning, coding, science, reading comprehension, health, long ...

GitHub6 天

GitHub - terryso/claude-auto-resume

A shell script utility that automatically resumes Claude CLI tasks when usage limits are lifted, or executes custom shell commands after waiting periods. It detects Claude usage restrictions, waits ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果

资讯

Provider-agnostic, open-source evaluation infrastructure for language models

GitHub - terryso/claude-auto-resume

今日热点