Speech-aware KV cache pruning for long-form speech LLMs (Qwen2-Audio, SALMONN). Token/head/chunk-level pruners + eval on LibriSpeech-long & GigaSpeech.
jelllott/speechkv-trim has added +12 stars since the first tracked point, with current momentum at 16.90.