Project RADAR: Intelligent Early Fraud Detection System with Humans in the Loop

Sergey Zelvenskiy, Garvit Harisinghani, Tiffany Yu, Edwin Ng, Robin Wei

Uber

•

Sergey Zelvenskiy, Garvit Harisinghani, Tiffany Yu, Edwin Ng, Robin Wei

•14 min read•advanced•

--

•View Original

ApacheApache SparkPySparkScalaSQL

Overview

The article discusses Project RADAR, an intelligent fraud detection system developed by Uber that integrates machine learning and human expertise to identify and mitigate fraudulent activities in real-time. It emphasizes the importance of explainability in decision-making and outlines the architecture and processes involved in the system's operation.

What You'll Learn

1

How to implement an intelligent fraud detection system using AI and human oversight

2

Why explainability is crucial in fraud detection systems

3

How to analyze time series data for fraud detection

4

When to engage risk analysts in the fraud detection process

5

How to utilize machine learning models for real-time fraud detection

Prerequisites & Requirements

Understanding of AI/ML concepts and fraud detection mechanisms
Familiarity with Apache Spark and data processing frameworks(optional)

Key Questions Answered

What is the purpose of RADAR in fraud detection?

RADAR is an AI-driven fraud detection system that monitors Uber's marketplace for fraudulent activities. It detects the onset of fraud attacks and generates rules to mitigate them, involving human analysts for review and approval to ensure accuracy and effectiveness.

How does RADAR utilize human expertise in its processes?

RADAR incorporates human analysts to review and approve the rules generated by the system. This human-in-the-loop approach ensures that decisions are explainable and that analysts can provide insights into new fraud patterns that automated systems may not catch.

What types of payment fraud does RADAR focus on?

RADAR primarily addresses two types of payment fraud: DNS (do not settle) fraud, where payments cannot be collected after service completion, and chargebacks, where users dispute transactions after payment has been made.

How does RADAR handle time series data for fraud detection?

RADAR analyzes time series data by defining order time (OT) and payment settlement maturity time (PSMT) to detect anomalies. It uses these dimensions to forecast potential fraud attacks and assess their severity over time.

Technologies & Tools

Data Processing

Apache Spark®

Used for processing data and generating time series for fraud detection.

Job Scheduling

Peloton

Helps schedule jobs and scale processing as needed.

Time Series Modeling

Orbit

An open-source package used for Bayesian time series modeling and forecasting.

Pattern Mining

Fp-growth

Used for discovering common patterns in categorical data related to fraud.

Key Actionable Insights

1
Integrate human oversight into your AI systems to enhance decision-making accuracy.
Human analysts can provide critical insights that automated systems may overlook, ensuring that fraud detection processes are more robust and adaptable to new threats.

2
Utilize time series analysis to monitor transaction patterns for signs of fraud.
By analyzing transaction data over time, organizations can identify anomalies that may indicate fraudulent activity, allowing for quicker responses and mitigation efforts.

3
Implement a feedback loop using tools like Jira to track fraud detection tasks.
This ensures that all anomalies are logged and reviewed, which helps refine detection algorithms and improves overall system performance.

4
Focus on explainability in AI-driven fraud detection systems.
Clear explanations of how decisions are made can help build trust with users and ensure compliance with regulatory standards.

5
Adopt a hybrid approach combining machine learning and traditional fraud detection methods.
This allows for the identification of both known and emerging fraud patterns, enhancing the effectiveness of the detection system.

Common Pitfalls

1

Over-reliance on automated systems can lead to missed fraud patterns.

Automated systems may not adapt quickly to new fraud tactics, making human oversight essential for identifying emerging threats.

2

Ignoring the importance of explainability in AI decisions.

Without clear explanations for decisions made by AI, users may lose trust in the system, leading to potential backlash against the organization.

Related Concepts

Machine Learning

Fraud Detection

Time Series Analysis

Human-in-the-loop Systems