Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice

Over 300M computed tomography (CT) scans are performed globally, 85M in the US alone. Radiologists are looking for ways to speed up their workflow and generate…

Ahmed Harouni
9 min readadvanced
--
View Original

Overview

The article discusses the NVIDIA VISTA-3D foundation model designed for segmenting organs and diseases in computed tomography (CT) images. It highlights the model's capabilities, deployment through NVIDIA NIM microservices, and practical implementation steps for users.

What You'll Learn

1

How to run the VISTA-3D NIM microservice on your data

2

Why using an FTP server is essential for large medical images

3

How to set up a Docker environment for running NIM microservices

Prerequisites & Requirements

  • Docker, Docker Compose, and NVIDIA drivers installed
  • Basic understanding of medical imaging and segmentation concepts(optional)

Key Questions Answered

What is the VISTA-3D model and its capabilities?
The VISTA-3D model is a foundation model developed by NVIDIA for segmenting full-body CT images. It is trained on over 12,000 volumes and can accurately segment more than 100 organs and various lesions, providing both automatic and interactive segmentation capabilities.
How can I test the VISTA-3D NIM microservice?
You can test the VISTA-3D NIM microservice by signing up for a personal key on the NVIDIA API Catalog, which provides 1000 free credits for trying out the microservices. After obtaining the key, you can run inference on sample data using provided code examples.
What are the steps to run VISTA-3D with my own data?
To run VISTA-3D with your own data, you need to set up an FTP server to host your medical images. The VISTA-3D NIM microservice will download the images from this server to perform inference and return the results.
What is the difference between LLM and VISTA-3D NIM microservices?
The main difference is that LLM NIM services accept text directly in the payload, while VISTA-3D requires a URL to the image hosted on an FTP server. This is due to the large size of medical images compared to text payloads.

Key Statistics & Figures

Number of CT scans performed globally
Over 300M
This statistic highlights the scale of computed tomography usage and the potential impact of improved segmentation models.
Number of CT scans performed in the US
85M
This figure emphasizes the demand for efficient and accurate medical imaging solutions in a major healthcare market.
Number of anatomical structures and lesions VISTA-3D is trained on
127 types
This extensive training dataset contributes to the model's accuracy and versatility in segmenting various medical conditions.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

AI/ML Model
Nvidia Vista-3d
Used for segmenting organs and diseases in CT images.
Microservice
Nvidia Nim
Facilitates the deployment and usage of the VISTA-3D model.
Containerization
Docker
Used to run the VISTA-3D NIM microservice locally.
File Transfer Protocol
Ftp
Required for hosting large medical images for inference.

Key Actionable Insights

1
Implementing the VISTA-3D NIM microservice can significantly enhance your medical imaging workflow by providing accurate organ and disease segmentation.
This is particularly beneficial for radiologists looking to improve report generation speed and accuracy, as the model is designed to handle complex cases efficiently.
2
Setting up an FTP server is crucial for managing large medical images when using the VISTA-3D model.
This approach allows for efficient data handling and ensures that the inference process is streamlined, avoiding issues related to large payloads.
3
Using Docker to run NIM microservices locally can provide flexibility and control over your environment.
This setup allows you to leverage your own hardware resources while ensuring that you can customize the deployment according to your specific needs.

Common Pitfalls

1
Failing to set up an FTP server can lead to issues when trying to run inference on large medical images.
Since medical images are typically large, they cannot be sent directly in the API payload. Without an FTP server, users may encounter errors or delays in processing.
2
Neglecting to secure the NGINX server can expose sensitive medical data.
It's crucial to limit access to the NGINX server to prevent unauthorized access, which can be achieved by modifying the Docker Compose configuration.

Related Concepts

Medical Imaging Techniques
AI In Healthcare
Segmentation Algorithms
Nvidia AI Solutions