How Google Uses Reinforcement Learning

2 engineering articles about Reinforcement Learning from Google's engineering team

Other Google Technologies

Gemini(219)Google Cloud(149)Golang(109)Firebase(102)JAX(102)Vertex AI(73)

Other Companies Using Reinforcement Learning

Articles

Filter:

Google

Advanced

Introducing Tunix: A JAX-Native Library for LLM Post-Training

The article introduces Tunix, a new open-source, JAX-native library designed for post-training of large language models (LLMs).

FlaxGoogle CloudJAXReinforcement LearningRLHF

Srikanth Kilaru, Tianshu Bao

7 min read

Includes Code

Has Summary

Google

Intermediate

Introducing Gemma 3: The Developer Guide

Gemma 3 is the latest version of the Gemma open-model family, boasting enhanced capabilities such as multimodality, longer context windows, and improved reasoning.

Hugging FaceJAXOllamaReinforcement LearningRLHFTransformersVertex AI

Omar Sanseviero, Philipp Schmid

5 min read

Includes Code

Has Summary

You've reached the end! All 2 articles loaded.