← All Posts
#reinforcement learning
#reinforcement learning
reinforcement learning
3 Posts
AI News
TRL v1.0: The Post-Training Library That Learned to Stop Breaking Things
Hugging Face just dropped TRL v1.0, and it's a bigger deal than a version bump....
AI News
NousCoder-14B: Open-Source Coding Model Catches Up Fast, Trained in Four Days
Nous Research's NousCoder-14B matches or beats larger proprietary coding models, trained in just four days...
AI News
David Silver left DeepMind and raised $1.1B to build an AI that doesn’t need us
Former DeepMind researcher David Silver raised $1.1B for Ineffable Intelligence, aiming to build AI that...