Shopify logo

How Shopify Uses PySpark

6 engineering articles about PySpark from Shopify's engineering team

Articles

Filter:
Shopify logo
Shopify
Intermediate
The article discusses the complexities of tax compliance for U. S. merchants and details the development of Shopify's Tax Insights feature.
Siraj Ali
12 min read
Has Summary
--
Shopify logo
Shopify
Beginner
The article discusses the implementation of double entry transition tables at Shopify to effectively track state changes for merchants using Shopify Balance.
Justin Pauley
9 min read
Includes Code
Has Summary
--
Shopify logo
Shopify
Beginner
This article discusses the development of Seamster, a production-grade SQL modeling workflow created by Shopify to improve data reporting efficiency.
Michelle Ark
12 min read
Includes Code
Has Summary
--
Shopify logo
Shopify
Intermediate
This article outlines the process of building an email experimentation pipeline from scratch, addressing the challenges faced by Shopify's data teams in conducting A/B tests for external channels.
Mojan Benham
10 min read
Has Summary
--
Shopify logo
Shopify
Intermediate
The article discusses how to track historical state using Type 2 dimensional models in application databases, contrasting it with the traditional Type 1 dimension approach.
Ian Whitestone
13 min read
Includes Code
Has Summary
--
Shopify logo
Shopify
Advanced
The article discusses the challenges and methodologies involved in categorizing products at scale on the Shopify platform, which has over 1 million business owners and billions of products.
Jeet Mehta
13 min read
Includes Code
Has Summary
--

You've reached the end! All 6 articles loaded.