Mathy AI Substack

Mathy AI Substack

Share this post

Mathy AI Substack
Mathy AI Substack
Fine-Tuning Methods for LLMs(SFT and RL): Explanations, Objectives and Gradients

Fine-Tuning Methods for LLMs(SFT and RL…

Mike Erlihson, Mathy AI
Apr 25
2

Share this post

Mathy AI Substack
Mathy AI Substack
Fine-Tuning Methods for LLMs(SFT and RL): Explanations, Objectives and Gradients
1

Based on Appendix A in the GRPO paper: https://arxiv.org/pdf/2402.03300

Read →
Comments
User's avatar
© 2025 Mike E.
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share