Speech AI Spotlight: How Pendulum Nabs Harmful Narratives Online

David Taubenheim

Over 55% of the global population uses social media, easily sharing online content with just one click. While connecting with others and consuming entertaining…

NVIDIA

•

David Taubenheim

•7 min read•intermediate•

--

•View Original

AWSDeep LearningHelm

Overview

The article discusses how Pendulum leverages AI and natural language processing to detect and mitigate harmful narratives online, particularly on social media platforms. It highlights the importance of early detection in preventing the spread of misinformation and outlines the technological advancements that enable Pendulum's capabilities.

What You'll Learn

1

How to utilize NVIDIA Riva for accurate speech recognition

2

Why early detection of harmful narratives is crucial for online reputation management

3

How to implement a hybrid, zero-shot learning algorithm for narrative detection

Prerequisites & Requirements

Basic understanding of speech AI and natural language processing concepts(optional)
Familiarity with NVIDIA Riva and AWS services(optional)

Key Questions Answered

How does Pendulum's platform detect harmful narratives online?

Pendulum's platform uses accelerated speech AI and natural language processing to analyze vast amounts of media content. It employs the Intelligence Explorer and Narrative Engine to perform deep searches, identifying harmful narratives across various platforms by transcribing and analyzing audio and video content.

What types of falsehoods has Pendulum identified?

Pendulum has identified various harmful narratives, including false information about celebrities, threats against employees, COVID-19 vaccine disinformation, and conspiracies related to the war in Ukraine. These narratives can spread rapidly across social media, making detection critical.

What technology does Pendulum use for speech recognition?

Pendulum utilizes NVIDIA Riva Enterprise for speech recognition, which offers high accuracy and throughput. This technology allows for efficient transcription of audio and video content, enabling the detection of harmful narratives in real-time.

What challenges does Pendulum face in narrative detection?

Pendulum faces challenges such as the presence of non-speech elements in videos that can obscure narratives and the need to accurately transcribe content amidst distracting advertisements. These factors complicate the detection process and require ongoing technical improvements.

Key Statistics & Figures

Videos identified supporting false COVID vaccine narrative

3,360

These videos accounted for 38 million views, highlighting the scale of misinformation present online.

Videos still available on the platform

1,600

These videos have accumulated 16 million views as of the article's writing.

Detection precision

0.74

This indicates that 74% of snippets predicted to support harmful narratives indeed do.

Recall rate

0.83

This means that 83% of snippets supporting the narrative were successfully identified by Pendulum's methods.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Backend

Nvidia Riva

Used for speech recognition to transcribe audio and video content efficiently.

Cloud

AWS

Provides the infrastructure for scaling Pendulum's services and processing large volumes of media.

Backend

Nvidia Triton Inference Server

Handles multiple requests against Pendulum's AI models to optimize processing.

Key Actionable Insights

1
Implementing a robust speech recognition system like NVIDIA Riva can significantly enhance your ability to analyze audio and video content for harmful narratives.
This is particularly relevant for organizations looking to protect their online reputation and respond swiftly to misinformation.

2
Utilizing AI to automate the detection of harmful narratives can save time and resources, allowing teams to focus on strategic responses.
As misinformation spreads rapidly, having automated tools can help organizations stay ahead of potential threats.

3
Adopting a hybrid, zero-shot learning algorithm can improve the precision and recall of narrative detection efforts.
This approach allows for effective identification of harmful content without the need for extensive labeled datasets.

Common Pitfalls

1

Failing to account for non-speech elements in videos can lead to incomplete narrative detection.

This happens because automatic speech recognition relies solely on transcribed speech, missing critical context provided by visuals or music.

2

Over-reliance on metadata instead of raw content can limit the effectiveness of narrative detection.

Many approaches focus on metadata, which may not capture the full scope of harmful narratives present in the actual content.

Related Concepts

Natural Language Processing

Speech Recognition

Misinformation Management

AI In Social Media