Leveraging Multimodal LLMs for Shopify’s Global Catalogue: Recap of Expo Talk at ICLR 2025

Shopify’s Global Catalogue demonstrates the impact of multimodal LLMs on one of commerce’s hardest problems: building a unified, structured, and continuously evolving understanding of billions of product listings created by millions of merchants.

Audrey-Anne Guindon
13 min readintermediate
--
View Original

Overview

The article discusses Shopify's Global Catalogue, which utilizes multimodal Large Language Models (LLMs) to standardize and enrich product data across its platform. It details the challenges of fragmented product data and the engineering solutions implemented to enhance product discovery and AI-driven commerce.

What You'll Learn

1

How to leverage multimodal LLMs for product data standardization

2

Why structured product data is crucial for AI-driven commerce

3

How to implement a data curation pipeline using LLMs

4

When to apply selective field extraction during model training

Prerequisites & Requirements

  • Understanding of product data structures and AI/ML concepts
  • Familiarity with LLMs and data processing frameworks(optional)

Key Questions Answered

What are the main challenges of fragmented product data on e-commerce platforms?
Fragmented product data leads to issues such as unstructured data, schema heterogeneity, data quality problems, and difficulties in semantic search. These challenges hinder AI agents from delivering accurate product discovery experiences, making it essential to unify and standardize product data for effective AI-driven commerce.
How does Shopify's Global Catalogue improve product discovery?
The Global Catalogue enhances product discovery by standardizing and enriching product data through a four-layer architecture that includes data foundation, understanding, matching, and reconciliation. This structure allows for better semantic search, recommendations, and conversational commerce, ultimately improving the shopping experience.
What is the role of fine-tuning in optimizing LLM performance?
Fine-tuning involves training pre-trained language models on specific datasets to improve their performance for particular tasks. Shopify fine-tunes smaller open-source vision LLMs to achieve better accuracy and control while reducing computational costs, which is crucial given their high volume of product updates.
What infrastructure supports Shopify's LLM-powered inferences?
Shopify's infrastructure supports 40 million LLM calls daily, utilizing technologies like the Triton Inference Server for model serving, a Kafka-based dataflow streaming pipeline for real-time data processing, and FP8 quantization to optimize GPU memory usage. This setup enables efficient handling of vast amounts of product data.

Key Statistics & Figures

Daily product updates processed
10 million
This figure represents the volume of product updates Shopify handles daily from various sources, ensuring that the Global Catalogue remains current.
Daily LLM calls
40 million
Shopify's infrastructure supports this high volume of calls, allowing for extensive product inferences and data processing.
Reduction in median latency
from 2 seconds to 500 milliseconds
This improvement is achieved through selective field prompting, enhancing the responsiveness of the system.
Reduction in GPU usage
40%
This reduction allows Shopify to serve more requests with the same hardware, improving cost-effectiveness.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Infrastructure
Triton Inference Server
Orchestrates model serving across Shopify's GPU fleet.
Data Processing
Kafka
Used for real-time data streaming and processing.
Performance Optimization
Fp8 Quantization
Reduces GPU memory footprint while maintaining inference accuracy.

Key Actionable Insights

1
Implementing a unified product data schema can significantly enhance AI-driven search capabilities.
By standardizing product data across platforms, businesses can improve the accuracy and relevance of search results, leading to better customer experiences and increased sales.
2
Utilizing selective field extraction during model training can improve model generalization and reduce latency.
This approach allows models to adapt to varying extraction requirements without retraining, which is crucial for maintaining performance in dynamic environments.
3
Establishing a robust data curation pipeline is essential for maintaining high-quality product data.
A well-structured pipeline that combines automated and human review processes can enhance data quality and ensure that AI models have access to reliable information.

Common Pitfalls

1
Overlooking the importance of data quality can lead to poor AI model performance.
Without a focus on high-quality, structured data, AI models may struggle to deliver accurate results, leading to a negative impact on user experience and business outcomes.
2
Failing to implement a scalable infrastructure can hinder growth.
As the volume of data increases, a lack of scalable infrastructure can lead to bottlenecks, affecting the speed and efficiency of data processing and model inference.

Related Concepts

Ai-driven Commerce
Data Standardization Techniques
Multimodal Llm Applications
Product Data Management