How Google Uses Embedding

18 engineering articles about Embedding from Google's engineering team

Other Google Technologies

Gemini(219)Google Cloud(149)Golang(109)Firebase(102)JAX(102)Vertex AI(73)

Other Companies Using Embedding

Articles

Filter:

Google

Intermediate

Gemma explained: EmbeddingGemma Architecture and Recipe

The article provides an in-depth exploration of the EmbeddingGemma architecture, detailing its origins, embedding generation process, and the comprehensive training methodology.

EmbeddingFine-tuningGeminiHugging FaceTransformerTransformersVertex AI

Henrique Schechter Vera, Juyeong Ji, Sahil Dua

7 min read

Includes Code

Has Summary

Google

Beginner

Gemini Batch API now supports Embeddings and OpenAI Compatibility

The article discusses the recent enhancements to the Gemini Batch API, which now includes support for the Gemini Embedding model and compatibility with the OpenAI SDK.

EmbeddingGemini

Lucia Loher, Patrick Löber

2 min read

Includes Code

Has Summary

Google

Intermediate

From Fine-Tuning to Production: A Scalable Embedding Pipeline with Dataflow

This article discusses the integration of Google's EmbeddingGemma model with Google Cloud's Dataflow to create a scalable embedding pipeline for AI applications.

ApacheEmbeddingGeminiGoogle CloudHugging FaceLarge Language ModelsRetrieval Augmented Generation

Danny McCormick, Ian Ballantyne, Olivier Lacombe

5 min read

Includes Code

Has Summary

Google

Intermediate

Introducing EmbeddingGemma: The Best-in-Class Open Model for On-Device Embeddings

EmbeddingGemma is an innovative open embedding model designed for on-device AI applications, featuring 308 million parameters for efficient performance.

EmbeddingGeminiHugging FaceLangChainOllamaRetrieval Augmented GenerationTransformersVertex AI

Min Choi, Sahil Dua, Alice Lisak

5 min read

Has Summary

Google

Intermediate

Gemini Embedding: Powering RAG and context engineering

The article discusses the Gemini Embedding text model and its applications in various industries, highlighting its effectiveness in enhancing AI applications through context engineering and retriev...

EmbeddingGemini

Vishal Dharmadhikari, Janie Zhang

4 min read

Includes Code

Has Summary

Google

Intermediate

Gemini Embedding now generally available in the Gemini API

The article announces the general availability of the Gemini Embedding text model, gemini-embedding-001, in the Gemini API and Vertex AI.

EmbeddingGeminiVertex AI

Min Choi, Janie Zhang

3 min read

Includes Code

Has Summary

Google

Intermediate

Gemini API I/O updates

The article discusses the latest updates to the Gemini API, highlighting new models and functionalities that enhance developers' ability to create applications using generative AI.

EmbeddingGeminiJSON

Shrestha Basu Mallick, Logan Kilpatrick, Alisa Fortin, Ivan Solovyev

7 min read

Includes Code

Has Summary

Google

Advanced

Build and train a recommender system in 10 minutes using Keras and JAX

The article introduces Keras Recommenders, a new library designed to simplify the creation of state-of-the-art recommendation systems using Keras with JAX, TensorFlow, or PyTorch.

EmbeddingGRUJAXKerasPyTorchTensorFlow

Yufeng Guo, Monica Song

3 min read

Includes Code

Has Summary

Google

Intermediate

Gemma explained: What’s new in Gemma 3

The article discusses the new features and improvements in Gemma 3, highlighting its vision-language capabilities, architectural changes for memory efficiency, and enhanced multilingual support.

BERTEmbeddingGeminiTransformers

Ju-yeong Ji, Ravin Kumar

9 min read

Includes Code

Has Summary

Google

Beginner

State-of-the-art text embedding via the Gemini API

The article discusses the introduction of the Gemini Embedding text model (gemini-embedding-exp-03-07) available through the Gemini API.

EmbeddingGeminiVertex AI

Logan Kilpatrick, Zach Gleicher, Parashar Shah

3 min read

Includes Code

Has Summary

Google

Advanced

Vertex AI RAG Engine: A developers tool

The article discusses the Vertex AI RAG Engine, a tool designed to help developers build grounded generative AI applications by addressing challenges like hallucinations and outdated knowledge.

EmbeddingGenerative AIGoogle CloudLarge Language ModelsRetrieval Augmented GenerationVertex AI

Crispin Velez, Holt Skinner

6 min read

Has Summary

Google

Intermediate

The article discusses the use of multimodal embeddings to enhance visual search capabilities, particularly for artists and enterprise-scale document search.

EmbeddingFirebaseGoogle CloudJavaScriptShellSvelteVertex AI

Anthony Tripaldi

10 min read

Includes Code

Has Summary

Google

Intermediate

Gemma explained: PaliGemma architecture

The article discusses the PaliGemma architecture, a lightweight open vision-language model (VLM) inspired by PaLI-3.

EmbeddingFine-tuningJAXKeras

Ju-yeong Ji, Ravin Kumar

6 min read

Includes Code

Has Summary

Google

Advanced

Gemma explained: RecurrentGemma architecture

The article explores the RecurrentGemma architecture, a hybrid model that combines gated linear recurrences with local sliding window attention, enhancing performance for long context prompts.

EmbeddingNeural NetworksTransformerTransformers

Ju-yeong Ji, Ravin Kumar

6 min read

Includes Code

Has Summary

Google

Intermediate

Gemma explained: What’s new in Gemma 2

The article discusses the release of Gemma 2, a new suite of open models that sets a new standard for performance and accessibility in conversational AI.

EmbeddingFine-tuningGoogle CloudGPTHugging FaceJAXKeras

Ju-yeong Ji, Ravin Kumar

5 min read

Includes Code

Has Summary

Google

Intermediate

Gemma explained: An overview of Gemma model family architectures

The article provides an overview of the Gemma model family architectures, detailing its lightweight, state-of-the-art open models derived from Gemini research.

BERTEmbeddingGeminiGPTHugging FaceKerasT5TransformerTransformers

Ju-yeong Ji, Ravin Kumar

9 min read

Includes Code

Has Summary

Google

Advanced

AI Edge Torch Generative API for Custom LLMs on Device

The article introduces the AI Edge Torch Generative API, designed to enable developers to create high-performance LLMs in PyTorch for deployment on edge devices using the TensorFlow Lite runtime.

EmbeddingPyTorchTensorFlow

Cormac Brick, Haoliang Zhang

10 min read

Includes Code

Has Summary

Google

Intermediate

Summer updates from Coral

The article discusses the latest updates from Coral, including a partnership with balena, new open-source tools, and enhancements to their ML software stack.

Artificial IntelligenceAutoMLEmbeddingGeminiGolangMachine LearningTensorFlow

Coral Team

5 min read

Includes Code

Has Summary

You've reached the end! All 18 articles loaded.