Hybrid-Gym: Training Coding Agents to Generalize Across Tasks https://arxiv.org/abs/2602.16819 https://www.alphaxiv.org/ru/overview/2602.16819 https://github.com/yiqingxyq/Hybrid-Gym
Hybrid-Gym: Training Coding Agents to Generalize Across Tasks…
0 viewsОткрыть в Telegram →
Из этого канала
- #6087Privileged Information Distillation for Language Models…
Privileged Information Distillation for Language Models https://arxiv.org/abs/2602.04942 https://www.alphaxiv.org/ru/overview/2602.04942
- #6088Self-Distillation Enables Continual Learning…
Self-Distillation Enables Continual Learning https://www.arxiv.org/abs/2601.19897 https://www.alphaxiv.org/ru/overview/2601.19897…
- #6089Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning…
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning https://www.arxiv.org/abs/2402.13669 https://www.alphaxiv.org/ru/overview/2402.13669…
- #6085Your Transformer is Secretly an EOT Solver…
Your Transformer is Secretly an EOT Solver https://elonlit.com/scrivings/your-transformer-is-secretly-an-eot-solver/
- #6083https://huggingface.co/Nanbeige/Nanbeige4.1-3B
https://huggingface.co/Nanbeige/Nanbeige4.1-3B