We're Cutting L40S Prices In Half

We just lowered the prices on NVIDIA L40s GPUs to $1.25 per hour. Why? Because our feet are cold and we burn processor cycles for heat. But also other reasons. Let’s back up. We offer 4 different NVIDIA GPU models; in increasing order of performanc

Kurt Mackey
6 min readintermediate
--
View Original

Overview

Fly.io has announced a significant price reduction for their NVIDIA L40S GPUs, now available at $1.25 per hour. This move aims to cater to the increasing demand for GPU-accelerated AI workloads, particularly for inference tasks, while also addressing customer preferences and market dynamics.

What You'll Learn

1

How to leverage L40S GPUs for AI workloads on Fly.io

2

Why A10 GPUs are preferred for inference tasks over more powerful models

3

When to choose L40S GPUs for specific AI applications

Key Questions Answered

What are the benefits of using L40S GPUs for AI workloads?
L40S GPUs provide AI-optimized performance comparable to A100 GPUs while being priced affordably at $1.25 per hour. This makes them suitable for various AI applications, including inference tasks, without the need for higher-end GPUs that may be more costly and less accessible.
Why are A10 GPUs more popular than expected?
The A10 GPUs are favored by users for their capability to handle random inference tasks effectively, despite being older and less powerful. Their affordability and sufficient performance for mid-sized generative AI workloads make them a preferred choice among customers.
How does Fly.io's pricing strategy impact GPU usage?
Fly.io's pricing strategy, which includes halving the cost of L40S GPUs to $1.25 per hour, encourages users to adopt these GPUs for AI workloads. This strategy aims to attract customers who require efficient and cost-effective solutions for their GPU-accelerated tasks.
What specific applications can be run on L40S GPUs?
L40S GPUs can be used for various applications, including running Llama 3.1 for LLM jobs, Flux for generative AI images, Whisper for automated speech recognition, and even gaming applications like DOOM Eternal, showcasing their versatility in handling different workloads.

Key Statistics & Figures

New price for L40S GPUs
$1.25 per hour
This price reduction aims to attract more users to Fly.io's GPU offerings.
Popularity of A10 GPUs
Most popular GPU in inventory
Despite being the least capable GPU offered, the A10 has become the most sought-after model among users.

Technologies & Tools

Hardware
Nvidia L40s
Used for AI-optimized workloads on Fly.io
Hardware
Nvidia A10
Popular choice for random inference tasks

Key Actionable Insights

1
Consider using L40S GPUs for your next AI project to benefit from their cost-effective pricing and performance.
With the L40S priced at $1.25 per hour, it provides an excellent opportunity for developers to run demanding AI workloads without incurring high costs.
2
Evaluate the specific needs of your AI applications to determine whether A10 or L40S GPUs are more suitable.
Understanding the differences in performance and pricing can help you make informed decisions that align with your project requirements.
3
Leverage Fly.io's infrastructure to optimize the deployment of your AI models.
By utilizing Fly.io's fast networking and object storage, you can enhance the performance of your AI applications significantly.

Common Pitfalls

1
Assuming that higher-end GPUs are always necessary for AI workloads.
Many users overlook the fact that older, less powerful GPUs like the A10 can effectively handle specific tasks, particularly inference, which may not require the latest technology.

Related Concepts

GPU Pricing Strategies
AI Workload Optimization
Inference Vs. Training Workloads