ClickHouse Release 24.12

The ClickHouse Team
16 min readintermediate
--
View Original

Overview

ClickHouse version 24.12 introduces 16 new features, 16 performance optimizations, and 36 bug fixes, enhancing usability and performance for users. Key improvements include Enum usability enhancements, support for JSON subcolumns as primary keys, and automatic JOIN reordering.

What You'll Learn

1

How to utilize JSON subcolumns as primary keys in ClickHouse

2

Why automatic JOIN reordering can improve query performance

3

When to use Enum usability improvements for better query handling

Key Questions Answered

What are the new features introduced in ClickHouse version 24.12?
ClickHouse version 24.12 introduces 16 new features, including Enum usability improvements, Iceberg REST catalog support, reverse table ordering, and the ability to use JSON subcolumns as primary keys. These enhancements aim to improve usability and performance for users.
How does automatic JOIN reordering enhance performance in ClickHouse?
Automatic JOIN reordering in ClickHouse optimizes query execution by determining the most efficient order for JOIN operations based on table sizes. This feature can significantly reduce memory usage and execution time, making queries faster and more efficient.
What performance improvements are seen with the new parallel hash join?
The new parallel hash join in ClickHouse can process 765.04 million rows in just 5.099 seconds, compared to 38.305 seconds with the previous default hash join strategy. This represents an approximately 8-fold performance improvement, showcasing the efficiency of the new join strategy.

Key Statistics & Figures

Number of new features
16
Included in ClickHouse version 24.12
Performance improvement with parallel hash join
8 times faster
Compared to the previous default hash join strategy
Number of bug fixes
36
Included in ClickHouse version 24.12

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Key Actionable Insights

1
Leverage the new JSON subcolumns feature to optimize data storage and retrieval in ClickHouse.
Using JSON subcolumns as primary keys can enhance query performance and data compression, making it a valuable strategy for applications dealing with large JSON datasets.
2
Implement automatic JOIN reordering to improve query execution times.
By allowing ClickHouse to automatically determine the best order for JOIN operations, you can achieve significant performance gains, especially in complex queries involving large datasets.
3
Utilize Enum usability improvements to simplify query writing and reduce errors.
With enhanced support for Enum types, queries that involve conditions on Enum fields can be written more intuitively, reducing the likelihood of encountering type-related errors.

Common Pitfalls

1
Failing to utilize the new Enum usability improvements can lead to query errors.
Without leveraging the new capabilities, users may encounter type-related exceptions that could have been avoided with the updated functionality.
2
Not using automatic JOIN reordering may result in inefficient queries.
By neglecting to enable this feature, users might miss out on significant performance improvements, especially in complex queries with multiple JOINs.

Related Concepts

JSON Data Handling
Data Compression Techniques
Performance Optimization Strategies