Simulate Real-World Data Centers in the Cloud with NVIDIA Air

The advent of AI has introduced a new type of data center, the AI factory, purpose-built from the ground up to handle AI workloads.

Sophia Schuur
6 min readintermediate
--
View Original

Overview

The article introduces NVIDIA Air, a cloud-based platform designed for simulating real-world data center environments, specifically tailored for AI workloads. It emphasizes the ability to create digital twins of network infrastructure, enabling organizations to accelerate their AI initiatives without the need for physical hardware.

What You'll Learn

1

How to build digital twins of network infrastructure using NVIDIA Air

2

Why NVIDIA Air eliminates the need for a hypervisor in data center simulations

3

How to utilize the drag-and-drop builder for custom topologies in NVIDIA Air

Key Questions Answered

What is NVIDIA Air and how does it benefit organizations?
NVIDIA Air is a cloud-based platform that allows organizations to simulate real-world data center environments. It provides tools for modeling network infrastructure, enabling faster testing and validation of configurations, which accelerates time to AI and enhances return on investment.
How can users create simulations in NVIDIA Air?
Users can create simulations in NVIDIA Air using the Demo Marketplace, which offers prebuilt labs, or through a drag-and-drop builder that allows for fully custom topologies. This flexibility enables users to explore various configurations and learn from existing setups.
What types of network operating systems can be used with NVIDIA Air?
NVIDIA Air supports various network operating systems, including NVIDIA Cumulus and SONiC, and allows users to bring their own operating systems into the platform. This versatility helps users tailor their simulations to specific needs.
What are the benefits of using the drag-and-drop builder in NVIDIA Air?
The drag-and-drop builder in NVIDIA Air simplifies the process of creating custom network topologies. Users can easily add servers and switches, configure their properties, and connect them, making it accessible for users to design complex simulations without extensive technical knowledge.

Technologies & Tools

Cloud-based Simulation Platform
Nvidia Air
Used for simulating real-world data center environments and modeling network infrastructure.
Network Operating System
Nvidia Cumulus
One of the supported operating systems for switches in NVIDIA Air.
Network Operating System
Sonic
Another supported operating system for switches in NVIDIA Air.

Key Actionable Insights

1
Leverage the Demo Marketplace to quickly spin up simulations for learning and experimentation.
The Demo Marketplace provides prebuilt labs that users can launch instantly, allowing them to explore various configurations and learn from established setups without starting from scratch.
2
Utilize the drag-and-drop builder for creating tailored network topologies that meet specific project requirements.
This feature allows users to customize their simulations easily, enabling them to experiment with different configurations and optimize their network designs for performance.
3
Enable the out-of-band management network for easier configuration and management of nodes.
By enabling this feature, users can connect nodes more efficiently and manage configurations via SSH, streamlining the setup process and enhancing control over the simulation environment.

Common Pitfalls

1
Failing to enable the out-of-band management network can complicate node configuration.
Without this setting, users may find it challenging to manage their nodes effectively, especially when needing to transfer files or scripts for configuration.