How we built it: Stripe Radar

Ryan Drapeau

What makes Radar, Stripe’s fraud prevention solution, so powerful? Here are some of the key decisions made—and lessons learned—in the years it has taken to build it.

Stripe

•

Ryan Drapeau

•11 min read•advanced•

--

•View Original

ChatGPTElasticsearchXGBoost

Overview

The article discusses the development of Stripe Radar, a fraud prevention solution that evaluates transactions in real-time to prevent fraud. It highlights the evolution of machine learning architectures used in Radar, the importance of feature engineering, and the necessity of providing clear explanations for fraud decisions to users.

What You'll Learn

1

How to evolve machine learning architectures for fraud detection

2

Why feature engineering is critical in improving model performance

3

How to effectively communicate fraud decision explanations to users

Prerequisites & Requirements

Understanding of machine learning concepts and architectures
Experience with feature engineering in machine learning(optional)

Key Questions Answered

How does Stripe Radar assess the likelihood of fraud in transactions?

Stripe Radar evaluates over 1,000 characteristics of a transaction in under 100 milliseconds to determine its likelihood of being fraudulent. This rapid assessment allows legitimate transactions to proceed while blocking or diverting risky ones, achieving an incorrect block rate of only 0.1%.

What improvements were made to the machine learning architecture of Stripe Radar?

In mid-2022, Stripe Radar transitioned from a Wide & Deep model that included XGBoost to a pure deep neural network (DNN) model. This change resulted in faster training times, better scalability, and the ability to leverage advanced ML techniques, significantly enhancing model performance.

What role does feature engineering play in Stripe Radar's effectiveness?

Feature engineering is crucial for improving model performance in Stripe Radar. The team analyzes past fraud attempts to identify new features that can enhance the model's understanding of fraudulent behavior, leading to better detection and reduced false positives.

Why is explaining fraud decisions important for users of Stripe Radar?

Explaining fraud decisions is vital because it helps users understand why transactions are flagged or approved. This transparency reduces frustration and allows users to improve their data quality and customize fraud detection rules, ultimately supporting their business needs.

Key Statistics & Figures

Incorrect block rate

0.1%

This statistic reflects the accuracy of Stripe Radar in allowing legitimate transactions while blocking fraudulent ones.

Reduction in training time

over 85%

The transition to a DNN-only model reduced the training time to less than two hours, significantly improving the model's experimentation velocity.

Increase in training data

10x

Experiments showed that a tenfold increase in training transaction data led to significant improvements in model performance.

Technologies & Tools

Machine Learning

Xgboost

Initially used in the Wide & Deep model for its memorization capabilities.

Machine Learning

Deep Neural Network (dnn)

The current architecture of Stripe Radar, enhancing scalability and performance.

Key Actionable Insights

1
Continuously evolve your machine learning architecture to leverage advancements in technology.
As ML technology progresses, regularly assess your current architecture against new methods to ensure optimal performance and adaptability.

2
Implement robust feature engineering processes to enhance model accuracy.
By analyzing historical fraud patterns and user behaviors, you can identify valuable features that significantly improve your model's predictive capabilities.

3
Invest in clear communication tools for users regarding fraud decisions.
Providing users with insights into why transactions are flagged can improve their trust in the system and help them make informed decisions about data quality.

Common Pitfalls

1

Becoming complacent with existing machine learning architectures can hinder progress.

As technology evolves, sticking to outdated models can limit your ability to leverage new techniques that could significantly enhance performance.

Related Concepts

Machine Learning

Feature Engineering

Fraud Detection Techniques

Deep Learning Architectures