Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning https://www.arxiv.org/abs/2402.13669 https://www.alphaxiv.org/ru/overview/2402.13669 https://github.com/sail-sg/sdft