This post is the second in a series (Part 1) that addresses the challenges of training an accurate deep learning model using a large public dataset and…
Overview
This article is the second part of a series on building a real-time redaction application using NVIDIA DeepStream. It focuses on deploying a trained RetinaNet model on an NVIDIA Jetson AGX Xavier device to redact faces in real-time across multiple video streams.
What You'll Learn
How to deploy a trained ONNX model on an NVIDIA Jetson device using DeepStream SDK
Why using TensorRT is essential for low-latency inference in real-time applications
How to build a custom bounding box parser for RetinaNet in DeepStream
When to use a tracker to optimize inference performance in video analytics
Prerequisites & Requirements
- NVIDIA Jetson AGX Xavier device or any NVIDIA GPU
- DeepStream SDK for real-time video analytics
Key Questions Answered
How do you deploy a trained ONNX model using DeepStream SDK?
What are the performance metrics for the RetinaNet model on Jetson AGX Xavier?
What modifications are needed to create a DeepStream redaction app?
When should you use a tracker in a video analytics application?
Key Statistics & Figures
Technologies & Tools
Key Actionable Insights
1Optimize your video analytics application by implementing a tracker to reduce the computational load on the GPU.Using a tracker allows you to skip inference on certain frames, which can significantly improve performance, especially when processing multiple video streams.
2Experiment with different batch sizes when deploying your model to find the optimal performance for your specific hardware.The batch size can greatly affect inference speed and resource utilization, so testing various configurations can lead to improved application responsiveness.
3Utilize the DeepStream SDK's built-in features to streamline the development of your video analytics pipeline.DeepStream provides various tools and plugins that can help you efficiently build and deploy applications, reducing the time and effort needed for custom implementations.