NVIDIA Deep Learning Institute Offers Multilingual AI Training at GTC Paris

Rita Fernandes Neves

Large language models (LLMs) are capable of recognizing, summarizing, translating, predicting, and generating content. Yet even the most powerful LLMs face…

NVIDIA

•

Rita Fernandes Neves

•6 min read•intermediate•

--

•View Original

Deep Learning

Overview

NVIDIA Deep Learning Institute is offering a hands-on workshop at GTC Paris focused on enhancing large language models (LLMs) for multilingual applications. The workshop aims to equip developers with the skills to adapt open source LLMs for specialized domains and diverse languages, addressing the limitations of current multilingual models.

What You'll Learn

1

How to build custom evaluation benchmarks using NVIDIA NeMo Evaluator

2

How to implement advanced data cleaning and preparation pipelines with NeMo Curator

3

How to apply advanced distillation, quantization, and pruning techniques for model optimization

Prerequisites & Requirements

Understanding of large language models and multilingual AI concepts
Familiarity with NVIDIA NeMo tools(optional)

Key Questions Answered

What are the limitations of current multilingual large language models?

Current multilingual LLMs often lack accuracy and cultural nuance, primarily because they are predominantly trained on English data, with models like Llama 2 using less than 5% of non-English language data. This can lead to misleading claims about their multilingual capabilities and costly deployment issues.

What skills will be developed in the Adding New Knowledge to LLMs workshop?

Participants will learn to evaluate LLMs, curate data, inject targeted knowledge, and optimize models for domain-specific and multilingual applications. This includes utilizing tools like NVIDIA NeMo Evaluator and Curator to enhance model performance across various languages.

How does NVIDIA DLI address challenges in multilingual AI?

The NVIDIA DLI workshop focuses on overcoming challenges such as fragmented benchmarks, translation artifacts, and task imbalances by teaching systematic evaluation, advanced data curation, and targeted knowledge injection, thereby improving model accuracy and usability in diverse contexts.

Key Statistics & Figures

Percentage of non-English language data in Llama 2 training

less than 5%

This statistic highlights the significant gap in multilingual training data for many LLMs.

Percentage of English data in Llama 2 training

89.7%

This indicates the overwhelming dominance of English in the training datasets, which affects the model's multilingual capabilities.

Technologies & Tools

Tool

Nvidia Nemo Evaluator

Used for building custom evaluation benchmarks to assess LLM limitations.

Tool

Nvidia Nemo Curator

Facilitates advanced data cleaning and preparation for multilingual datasets.

Tool

Nvidia Nemo Model Optimizer

Used for applying optimization techniques like distillation and quantization.

Tool

Nvidia Tensorrt-llm

Enhances model inference efficiency while maintaining performance.

Key Actionable Insights

1
Developing custom evaluation benchmarks is crucial for understanding the limitations of LLMs in specialized domains.
By using tools like NVIDIA NeMo Evaluator, developers can track progress and define metrics that truly reflect the performance needed for specific applications, ensuring better deployment outcomes.

2
Implementing advanced data cleaning and preparation pipelines can significantly enhance the quality of training datasets.
Utilizing NVIDIA NeMo Curator allows developers to create high-quality datasets that address the complexities of multilingual content, which is essential for training effective AI models.

3
Optimizing models through techniques like distillation and quantization can reduce operational costs while maintaining performance.
This is particularly important for deploying AI systems in resource-constrained environments, ensuring that models remain efficient and effective across all targeted languages.

Common Pitfalls

1

Relying on surface-level metrics like BLEU and ROUGE can lead to misleading evaluations of model performance.

These metrics often penalize valid variations in word order and may not accurately reflect the model's true capabilities, leading to suboptimal deployment decisions.

2

Assuming that a model labeled as multilingual is effective across all languages without rigorous testing.

This can result in costly deployment issues, especially in regions where users prefer to interact in their native languages.

Related Concepts

Multilingual AI

Large Language Models

Data Curation Techniques

Model Optimization Strategies