[ICML2026] Official Implementation of AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in Unified Multimodal Models via Decompositional Verifiable Reward
huangrh99/AlphaGRPO has added +27 stars since the first tracked point, with current momentum at 16.80.