Mathy AI Substack
Subscribe
Sign in
Share this post
Mathy AI Substack
Fine-Tuning Methods for LLMs(SFT and RL): Explanations, Objectives and Gradients
Copy link
Facebook
Email
Notes
More
Fine-Tuning Methods for LLMs(SFT and RL…
Mike Erlihson, Mathy AI
Apr 25
2
Share this post
Mathy AI Substack
Fine-Tuning Methods for LLMs(SFT and RL): Explanations, Objectives and Gradients
Copy link
Facebook
Email
Notes
More
1
Based on Appendix A in the GRPO paper: https://arxiv.org/pdf/2402.03300
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
Fine-Tuning Methods for LLMs(SFT and RL…
Share this post
Based on Appendix A in the GRPO paper: https://arxiv.org/pdf/2402.03300