Mathy AI Substack
Subscribe
Sign in
Share this post
Mathy AI Substack
An unusual look of the KL-Divergence Term in Deep Seek R1 Training Objective?
Copy link
Facebook
Email
Notes
More
An unusual look of the KL-Divergence Term in…
Mike Erlihson, Mathy AI
Jan 28
5
Share this post
Mathy AI Substack
An unusual look of the KL-Divergence Term in Deep Seek R1 Training Objective?
Copy link
Facebook
Email
Notes
More
DeepSeek Blog Series: based on http://joschu.net/blog/kl-approx.html
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
An unusual look of the KL-Divergence Term in…
Share this post
DeepSeek Blog Series: based on http://joschu.net/blog/kl-approx.html