A curated, accuracy-first list of benchmarks for evaluating LLMs on scientific reasoning and discovery — math, physics, chemistry, materials, biology, and agentic science.
subinium/Awesome-Scientific-LLM-Benchmarks is sitting at #974 on the trending leaderboard with a pulse of 10/100 with no cross-source channels firing yet — GitHub-stars-only signal so far.
It sits at 8 stars without a fresh weekly delta on record — the trending placement here is steady-state interest in the AI agent / LLM tooling stack rather than a 7-day breakout.
Watch-outs: no tagged release on record (treat as pre-stable).
git clone https://github.com/subinium/Awesome-Scientific-LLM-Benchmarks.gitThen follow the README in the cloned directory.
//COMMENTS · 0
Sign in to join the discussion