Mike’s Substack
Subscribe
Sign in
Share this post
Mike’s Substack
Fine-tuning LLM with RL from the angle of the memory
Copy link
Facebook
Email
Notes
More
Fine-tuning LLM with RL from the angle of the…
Mike Erlihson, Mathy AI
Feb 4
3
Share this post
Mike’s Substack
Fine-tuning LLM with RL from the angle of the memory
Copy link
Facebook
Email
Notes
More
How many models you need to store in memory for PPO and GRPO?
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
Fine-tuning LLM with RL from the angle of the…
Share this post
How many models you need to store in memory for PPO and GRPO?