3

mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR
arXiv 2026
Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments
arXiv 2026
CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data
ACL 2026
Token Distillation: Attention-aware Input Embeddings for New Tokens
ICLR 2026