Recommend API

Katrina Ni

Slack, as a product, presents many opportunities for recommendation, where we can make suggestions to simplify the user experience and make it more delightful. Each one seems like a terrific use case for machine learning, but it isn’t realistic for us to create a bespoke solution for each. Instead, we developed a unified framework we…

Slack

•

Katrina Ni

•13 min read•intermediate•

--

•View Original

ChefJenkinsKubernetesMachine LearningPrometheusPythonSHAPXGBoost

Overview

The article discusses the Recommend API developed by Slack, which serves as a unified framework for generating recommendations using machine learning. It highlights the infrastructure behind the API, its various applications within Slack, and the importance of MLOps in deploying and maintaining machine learning models effectively.

What You'll Learn

1

How to implement a unified recommendation system using the Recommend API

2

Why MLOps is crucial for deploying machine learning models in production

3

When to apply machine learning for user experience improvements in products

Prerequisites & Requirements

Understanding of machine learning concepts and MLOps
Familiarity with data processing tools like Airflow(optional)

Key Questions Answered

What is the purpose of the Recommend API in Slack?

The Recommend API serves as a unified framework that allows Slack to quickly bootstrap new recommendation use cases, leveraging a common infrastructure for data processing, model training, candidate generation, and monitoring. This enables improved customer experiences across various contexts within the product.

How does Slack ensure data privacy while training ML models?

Slack ensures data privacy by de-identifying training data, using numeric IDs instead of names, and relying on metadata rather than actual content for most features. This approach prevents any risk of leaking customer data while still allowing effective model training.

What metrics does Slack monitor to evaluate ML model performance?

Slack tracks various metrics including reliability metrics from the backend, efficiency metrics from the model serving service, and online metrics such as clickthrough rates and discounted cumulative gain. These metrics help ensure the models are performing well in production.

What challenges does Slack face with cold start problems in recommendations?

The cold start problem occurs when there is insufficient data to generate recommendations for new users. Slack addresses this by initially using hand-tuned models based on heuristics to provide recommendations until sufficient interaction data is gathered for model training.

Key Statistics & Figures

Percentage of corporate AI initiatives struggling to move beyond test stages

up to 88%

This statistic highlights the challenges many companies face in successfully implementing AI solutions.

Increase in clickthrough rate for Composer DMs after migrating to a logistic regression model

+38.86%

This improvement demonstrates the effectiveness of using machine learning models over hand-tuned approaches.

Increase in clickthrough rate for Slackbot channel suggestions after migrating to an XGBoost classification model

+123.57% for leave and +31.92% for archive suggestions

These metrics illustrate the significant impact that advanced machine learning models can have on user engagement.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Data Processing

Airflow

Used for scheduling tasks to process data sequentially.

Container Orchestration

Kubernetes

Used for running jobs and serving machine learning models.

Machine Learning

Xgboost

Experimented with for ranking models in the recommendation system.

Machine Learning

Logistic Regression

Used as a model for improving recommendations in various features.

Key Actionable Insights

1
Implementing a unified recommendation system can streamline the development process for new features across products.
By using a common framework like the Recommend API, teams can avoid duplicating efforts and reduce the time to market for new recommendation features.

2
Monitoring metrics is essential for maintaining the performance of machine learning models in production.
Regularly tracking metrics such as clickthrough rates and latency can help identify issues early and ensure that the models continue to meet user needs.

3
De-identifying training data is crucial for maintaining user privacy while still leveraging machine learning.
By using numeric IDs and focusing on metadata, Slack can develop effective models without compromising customer data security.

Common Pitfalls

1

Neglecting to monitor the performance of machine learning models can lead to degraded user experiences.

Without regular monitoring, issues such as increased latency or reduced accuracy may go unnoticed, negatively impacting user engagement and satisfaction.

2

Failing to de-identify training data can result in privacy violations.

Using identifiable information in training data poses risks of data leaks, which can lead to legal and reputational consequences for organizations.

Related Concepts

Mlops

Machine Learning

Data Privacy

Recommendation Systems