JVM Profiler: An Open Source Tool for Tracing Distributed JVM Applications at Scale

Bo Yang, Nan Zhu, Felix Cheung, Xu Ning

Uber

•

Bo Yang, Nan Zhu, Felix Cheung, Xu Ning

•9 min read•advanced•

--

•View Original

ApacheApache KafkaApache SparkJavaMySQLSQLTransformer

Overview

The article discusses the JVM Profiler, an open-source tool developed by Uber for tracing distributed JVM applications at scale. It highlights the challenges of profiling in a distributed environment and explains how the JVM Profiler addresses these challenges by providing fine-grained insights into resource usage and performance metrics.

What You'll Learn

1

How to use the JVM Profiler to collect performance metrics from Java applications

2

Why profiling is essential for optimizing resource usage in distributed systems

3

How to integrate JVM Profiler metrics with data infrastructure tools like Kafka and Hive

Prerequisites & Requirements

Understanding of JVM and distributed systems concepts
Familiarity with Apache Kafka and Apache Hive(optional)

Key Questions Answered

What are the main features of the JVM Profiler?

The JVM Profiler includes a Java agent for collecting metrics, advanced profiling capabilities to trace Java methods and arguments, and data analytics reporting to send metrics to systems like Kafka and Hive for analysis.

How does the JVM Profiler improve resource allocation for Spark applications?

By using memory metrics from the JVM Profiler, Uber tracks actual memory usage for each executor, allowing them to set the proper value for the Spark 'executor-memory' argument, leading to optimized resource allocation.

What challenges does the JVM Profiler address in distributed environments?

The JVM Profiler addresses challenges such as correlating metrics across multiple processes and making metrics collection non-intrusive, enabling automatic profiling without modifying user code.

How can metrics from the JVM Profiler be utilized for data analysis?

Metrics collected by the JVM Profiler can be sent to Kafka topics and ingested into HDFS, allowing users to query the data using Hive or Spark, facilitating cluster-wide data analysis.

Key Statistics & Figures

Memory allocation reduction per executor

2GB

This reduction was achieved for one of Uber's largest Spark applications, saving a total of 2TB of memory.

Percentage of applications using less than 80% of allocated memory

70%

This finding indicates potential for further memory optimization across Uber's applications.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Data Streaming

Apache Kafka

Used for sending metrics collected by the JVM Profiler for further analysis.

Data Warehousing

Apache Hive

Used for querying metrics ingested from the JVM Profiler.

Data Processing

Apache Spark

The JVM Profiler is primarily designed to profile Spark applications.

Key Actionable Insights

1
Utilize the JVM Profiler to monitor HDFS NameNode RPC latency to identify performance bottlenecks in Spark applications.
This insight allows developers to pinpoint slow method calls and optimize their applications, enhancing overall performance.

2
Integrate JVM Profiler metrics with Apache Kafka for real-time data analysis and monitoring.
This integration enables teams to leverage metrics for immediate insights, improving decision-making and operational efficiency.

3
Use the JVM Profiler to dynamically inject profiling code into Java methods without altering the original source code.
This capability allows for non-intrusive performance monitoring, making it easier to adapt to changing application requirements.

Common Pitfalls

1

Failing to automatically launch the profiler with each process can lead to missed metrics.

In a distributed environment, processes can start and stop dynamically, so it's essential to ensure the profiler is integrated into the startup process to capture all relevant data.

2

Modifying user code to collect metrics can introduce errors and increase maintenance overhead.

Using non-intrusive profiling methods, such as those provided by the JVM Profiler, helps avoid these issues and allows for easier updates and changes in the application.

Related Concepts

Distributed Systems

Performance Profiling

Resource Optimization

Java Virtual Machine (jvm)