How Google Uses Transformers
21 engineering articles about Transformers from Google's engineering team
Other Google Technologies
Other Companies Using Transformers
Articles
Filter:
The article discusses how to fine-tune the Gemma 3 270M model for on-device applications, enabling developers to create custom AI models without the need for expensive hardware.
Ian Ballantyne, Jason Mayes
5 min read
Includes Code
Has Summary
--
The article provides an in-depth exploration of the EmbeddingGemma architecture, detailing its origins, embedding generation process, and the comprehensive training methodology.
Henrique Schechter Vera, Juyeong Ji, Sahil Dua
7 min read
Includes Code
Has Summary
--
EmbeddingGemma is an innovative open embedding model designed for on-device AI applications, featuring 308 million parameters for efficient performance.
Min Choi, Sahil Dua, Alice Lisak
5 min read
Has Summary
--
The article introduces Gemma 3 270M, a compact AI model designed for hyper-efficient task-specific fine-tuning.
Olivier Lacombe, Kathleen Kenealy, Kat Black, Ravin Kumar, Francesco Visin, Jiageng Zhang
5 min read
Has Summary
--
The article introduces Gemma 3n, a mobile-first architecture designed for on-device AI, highlighting its multimodal capabilities and architectural innovations.
Omar Sanseviero, Ian Ballantyne
9 min read
Includes Code
Has Summary
--
The article discusses the new features and improvements in Gemma 3, highlighting its vision-language capabilities, architectural changes for memory efficiency, and enhanced multilingual support.
Ju-yeong Ji, Ravin Kumar
9 min read
Includes Code
Has Summary
--
Gemma 3 is the latest version of the Gemma open-model family, boasting enhanced capabilities such as multimodality, longer context windows, and improved reasoning.
Omar Sanseviero, Philipp Schmid
5 min read
Includes Code
Has Summary
--
The article discusses the launch of ShieldGemma 2, a safety content classifier model built on Gemma 3, aimed at detecting harmful content in both synthetic and natural images.
Dana Kurniawan, Wenjun Zeng, Ryan Mullins
3 min read
Has Summary
--
PaliGemma 2 mix is an advanced vision-language model designed for multiple tasks, allowing developers to utilize a single model for various applications such as image captioning, object detection, ...
Omar Sanseviero, Andreas Steiner
3 min read
Includes Code
Has Summary
--
PaliGemma 2 is the latest vision-language model from Google, designed to simplify the process of building advanced AI that can interpret visual inputs.
Daniel Keysers, Andreas Steiner
3 min read
Has Summary
--
The Web AI Summit 2024, hosted by Google on October 18, 2024, focused on client-side AI for developers, showcasing how machine learning models can operate offline in web browsers.
Jason Mayes
10 min read
Has Summary
--
The article discusses the expansion of the Responsible Generative AI Toolkit, introducing new tools designed for various large language models (LLMs) like Gemma and Gemini.
Ryan Mullins
3 min read
Has Summary
--
The article explores the RecurrentGemma architecture, a hybrid model that combines gated linear recurrences with local sliding window attention, enhancing performance for long context prompts.
Ju-yeong Ji, Ravin Kumar
6 min read
Includes Code
Has Summary
--
The article provides an overview of the Gemma model family architectures, detailing its lightweight, state-of-the-art open models derived from Gemini research.
Ju-yeong Ji, Ravin Kumar
9 min read
Includes Code
Has Summary
--
This article provides a comprehensive guide on using Gemma with Ray on Vertex AI, detailing the steps to set up, fine-tune, and deploy machine learning models.
Ju-yeong Ji, Ivan Nardini
12 min read
Includes Code
Has Summary
--
The article discusses the release of the Gemma 2 model with 27 billion parameters, highlighting its capabilities in Keras and integration with JAX for efficient model training.
Martin Görner
5 min read
Includes Code
Has Summary
--
The article introduces PaliGemma, an open vision-language model, along with Gemma 2, the next generation of the Gemma models, and updates to the Responsible AI Toolkit.
Tris Warkentin, Xiaohua Zhai, Ludovic Peran
4 min read
Has Summary
--
This article discusses how to publish Keras models on Kaggle and Hugging Face, highlighting the ease of sharing fine-tuned models with the community.
Martin Görner
4 min read
Includes Code
Has Summary
--
The article introduces the expansion of the Gemma family with two new models, CodeGemma and RecurrentGemma, designed specifically for developers and researchers.
Tris Warkentin, Jane Fine
4 min read
Has Summary
--
The article highlights the achievements and activities of Google Machine Learning communities in the second quarter of 2023, showcasing various training campaigns, community events, and innovative ...
GolangGoogle CloudGPT-4Hugging FaceJAXKerasKubernetesLangChainMachine LearningPaLMStable DiffusionTensorFlowTransformersVertex AI
Nari Yoon, Bitnoori Keum, Hee Jung, Soonson Kwon
14 min read
Has Summary
--
The article highlights the achievements of Machine Learning Google Developer Experts (GDEs) in Q2 2021, showcasing their contributions to the global ML ecosystem through various events, projects, a...
AutoMLBERTDockerGeminiGolangGoogle CloudJAXKerasKongMachine LearningNeural NetworksServerlessTensorFlowTransformersVertex AI
HyeJung Lee, MJ You, ML Ecosystem Community Managers
6 min read
Has Summary
--
You've reached the end! All 21 articles loaded.