https://github.com/researchim-ai/re-rl заехало
0 viewsОткрыть в Telegram →
Из этого канала
- #5968I trained a 1.8M params model from scratch on a total of ~40M tokens.…
I trained a 1.8M params model from scratch on a total of ~40M tokens.…
- #5969надо в модельки дома затащить)
надо в модельки дома затащить)
- #5971Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations…
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations https://arxiv.org/abs/2602.05885 https://www.alphaxiv.org/overview/2602.05885…
- #5966BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem…
BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving https://arxiv.org/abs/2502.03438…
- #5964https://huggingface.co/datasets/internlm/Lean-Github
https://huggingface.co/datasets/internlm/Lean-Github