How Google Uses Reinforcement Learning
2 engineering articles about Reinforcement Learning from Google's engineering team
Other Google Technologies
Other Companies Using Reinforcement Learning
Articles
Filter:
The article introduces Tunix, a new open-source, JAX-native library designed for post-training of large language models (LLMs).
Srikanth Kilaru, Tianshu Bao
7 min read
Includes Code
Has Summary
--
Gemma 3 is the latest version of the Gemma open-model family, boasting enhanced capabilities such as multimodality, longer context windows, and improved reasoning.
Omar Sanseviero, Philipp Schmid
5 min read
Includes Code
Has Summary
--
You've reached the end! All 2 articles loaded.