Reproducible recipe: serve abliterated Gemma-4-12B (gemma4_unified) at 50-118 tok/s on no-NVLink Blackwell (SM120) via vLLM nightly + ModelOpt FP8/NVFP4 + MTP spec-decode.
lna-lab/gemma4-12b-vllm-sm120 is sitting at #758 on the trending leaderboard with a pulse of 18/100 with no cross-source channels firing yet — GitHub-stars-only signal so far.
It sits at 13 stars without a fresh weekly delta on record — the trending placement here is steady-state interest in the AI agent / LLM tooling stack rather than a 7-day breakout.
Watch-outs: no tagged release on record (treat as pre-stable).
git clone https://github.com/lna-lab/gemma4-12b-vllm-sm120.gitThen follow the README in the cloned directory.
//COMMENTS · 0
Sign in to join the discussion