Overview
ClickHouse Release 23.5 introduces 29 new features, 22 performance optimizations, and 85 bug fixes, enhancing the overall functionality and performance of the database. Key highlights include the new Azure Blob Storage table function, improvements to ClickHouse Keeper, and faster Parquet file reading capabilities.
What You'll Learn
1
How to query Azure Blob Storage using the new azureBlobStorage function
2
Why ClickHouse Keeper is essential for distributed systems
3
How to utilize the keeper-client for managing ClickHouse Keeper
4
When to use the new Parquet reading optimizations for better performance
Key Questions Answered
What new features are introduced in ClickHouse Release 23.5?
ClickHouse Release 23.5 introduces 29 new features, including the azureBlobStorage function for querying Azure Blob Storage, improvements to ClickHouse Keeper, and enhanced Parquet file reading capabilities. Additionally, it includes 22 performance optimizations and 85 bug fixes.
How does the azureBlobStorage function work?
The azureBlobStorage function allows users to query files in Azure Blob Storage by specifying a connection string, container, and blob path. It supports various file formats and can be used for both querying and exporting data.
What improvements have been made to Parquet file reading in ClickHouse 23.5?
In ClickHouse 23.5, Parquet file reading has been optimized for speed by allowing out-of-order reading and reducing mutex locks, resulting in faster query execution times. For example, a query that took 0.367 seconds in version 23.4 now takes only 0.240 seconds.
What is ClickHouse Keeper and why is it important?
ClickHouse Keeper is a component that provides strongly consistent storage for cluster coordination in ClickHouse, essential for features like data replication and service discovery. It is now production ready and supports large-scale deployments.
Key Statistics & Figures
New features
29
Total new features introduced in ClickHouse Release 23.5.
Performance optimizations
22
Total performance optimizations included in the release.
Bug fixes
85
Total bug fixes addressed in this release.
Query execution time reduction
0.367 seconds to 0.240 seconds
Improvement in query execution time for reading Parquet files from version 23.4 to 23.5.
Technologies & Tools
Some links below are affiliate links. We may earn a commission if you make a purchase.
Database
Clickhouse
Used as the primary database system for analytics and data storage.
Cloud Storage
Azure Blob Storage
Used for storing and querying data in the new azureBlobStorage function.
Key Actionable Insights
1Leverage the new azureBlobStorage function to streamline data queries from Azure Blob Storage, which can enhance your data ingestion processes.This function simplifies the querying of data stored in Azure, making it easier to integrate cloud storage with ClickHouse for analytics.
2Utilize the keeper-client tool for easier management and introspection of ClickHouse Keeper, improving your administrative workflow.This tool simplifies interactions with ClickHouse Keeper, making it more user-friendly for managing cluster coordination tasks.
3Take advantage of the performance improvements in Parquet file reading to optimize your data queries, especially for large datasets.These enhancements can significantly reduce query execution times, making your analytics more efficient.
Common Pitfalls
1
Failing to properly configure the connection string and parameters when using the azureBlobStorage function can lead to query errors.
Ensure that all required parameters are correctly specified to avoid issues when querying data from Azure Blob Storage.
Related Concepts
Azure Blob Storage
Clickhouse Keeper
Parquet File Format
Distributed Systems