ClickHouse Engineering Blog & Tech Articles
High-performance open-source columnar database management system designed for real-time analytics and data warehousing at scale
272 engineering articles, tutorials, and technical insights from ClickHouse's engineering team
Top Technologies
Latest Articles
Filter:
The article discusses how PeerDB facilitates large-scale PostgreSQL migrations, specifically achieving a 1TB migration in just 2 hours.
15 min read
Includes Code
Has Summary
--
The article discusses AI-powered migrations from PostgreSQL to ClickHouse, emphasizing the importance of safe deployment practices and the integration of AI tools to streamline the migration proces...
2 min read
Includes Code
Has Summary
--
The article discusses the pg_stat_ch extension for PostgreSQL, which facilitates the export of metrics to ClickHouse.
1 min read
Has Summary
--
The article discusses the evolving role of metrics in observability, emphasizing that while they remain important, their function is shifting towards being an optimization layer rather than the cor...
7 min read
Includes Code
Has Summary
--
ClickHouse 26. 1 is a major release featuring 25 new features, 43 performance optimizations, and 176 bug fixes.
17 min read
Includes Code
Has Summary
--
This article details how ClickPy, a free Python download statistics platform powered by ClickHouse, scaled to over 2 trillion rows by replacing its legacy cron-based ingestion pipeline with ClickPi...
8 min read
Includes Code
Has Summary
--
This article explains how ClickHouse optimizes Top-N queries (ORDER BY . LIMIT N) using granule-level data skipping indexes.
Tom Schreiber
10 min read
Includes Code
Has Summary
--
20 min read
Includes Code
Has Summary
--
ClickHouse 25. 9 introduces streaming secondary indices, a fundamental change to how secondary indexes (minmax, set, bloom filter, vector, text) are evaluated during query execution.
The article discusses the new functionality in ClickHouse that allows for multiple lightweight projections to behave like true secondary indexes, significantly enhancing query performance without d...
Tom Schreiber
7 min read
Includes Code
Has Summary
--
The article discusses the integration of Bindplane with ClickStack, focusing on how Bindplane facilitates the management of OpenTelemetry collectors at scale.
The Bindplane Team
4 min read
Has Summary
--
The article discusses the development of chDB, a Python library that integrates ClickHouse with Pandas DataFrames for high-performance SQL querying.
The article reviews the significant developments and features introduced in ClickStack over its first seven months since launch, highlighting advancements such as JSON support, integration with Cli...
14 min read
Includes Code
Has Summary
--
The article discusses the need for improved observability in AI Site Reliability Engineering (SRE) rather than relying solely on larger models.
At ClickHouse, we don't like the word "impossible." We believe that with the right tools, everything is a data problem. To prove it, we decided to complete the 2025 Advent of Code unconventionally: using pure ClickHouse SQL.
The article presents a curated list of technology blogs that focus on performance optimizations, algorithms, database development, and more, aimed at helping software engineers find valuable resour...
The article reviews significant demo developments at ClickHouse in 2025, highlighting various applications that showcase its performance and capabilities.
The article discusses key benchmarks conducted over the past year that reveal insights into performance optimization in ClickHouse.
The article discusses the implementation of a PostgreSQL setting (GUC) for the pg_clickhouse extension, focusing on optimizing the parsing of key/value pairs for session settings.
David Wheeler
10 min read
Includes Code
Has Summary
--
The article highlights new functions introduced in ClickHouse in 2025, detailing their usage and benefits for developers.
Mark Needham
10 min read
Includes Code
Has Summary
--
The article discusses the introduction of parallel merge for small GROUP BY in ClickHouse 25.
Jianfei Hu
7 min read
Includes Code
Has Summary
--
The article discusses ClickHouse's journey of integrating Rust into its predominantly C++ codebase without undertaking a complete rewrite.
The article highlights Alexey's favorite features introduced in ClickHouse throughout 2025, including lightweight updates, data lake support, and advancements in text and vector indexing.
12 min read
Includes Code
Has Summary
--
The November 2025 edition of What's New in ClickStack highlights several new features and improvements in the open-source observability stack built for ClickHouse.
ClickHouse version 25. 11 introduces significant enhancements, including 24 new features, 27 performance optimizations, and 97 bug fixes.
The ClickHouse Team
16 min read
Includes Code
Has Summary
--
This article benchmarks five major cloud data warehouses—Snowflake, Databricks, ClickHouse Cloud, BigQuery, and Redshift—across various scales of data to compare their cost-performance.
Tom Schreiber & Lionel Palacin
16 min read
Includes Code
Has Summary
--
This article provides a detailed analysis of how the five major cloud data warehouses—Snowflake, Databricks, ClickHouse Cloud, Google BigQuery, and Amazon Redshift Serverless—calculate compute cost...
Tom Schreiber & Lionel Palacin
27 min read
Has Summary
--
This article discusses the transition from OpenTelemetry (OTel) to Rotel, an open-source Rust project that enhances tracing capabilities at petabyte scale.
The article details the journey of upgrading the chDB kernel from ClickHouse v25. 5 to v25. 8. 2.
The article discusses the development of StockHouse, a real-time market analytics application that leverages ClickHouse, Massive, and Perspective to handle high-frequency financial data.
Lionel Palacin
7 min read
Includes Code
Has Summary
--
This article details the transformation of ClickHouse's internal data warehouse from a traditional BI-first approach to an AI-first model, significantly enhancing user accessibility to analytics.
ClickPipes for PostgreSQL has introduced support for failover replication slots, enhancing reliability and flexibility for users.
4 min read
Has Summary
--
The article discusses the creation of a website for tracking team activity across GitHub repositories, initially intended as a single report but evolved into a comprehensive tool for comparing vari...
The October 2025 edition of What's New in ClickStack highlights significant updates to the open-source observability stack for ClickHouse, including the introduction of alerting features, customiza...
9 min read
Includes Code
Has Summary
--
ClickHouse version 25. 10 introduces significant enhancements, including 20 new features, 30 performance optimizations, and 103 bug fixes.
The article discusses how weather data can be derived from airplane telemetry using ADS-B data.
Alexey Milovidov
7 min read
Includes Code
Has Summary
--
This article discusses how to enhance log compression through log clustering techniques in ClickHouse, focusing on transforming unstructured logs into structured data for efficient storage.
This article explores the process of tracing OpenAI agents using ClickStack, demonstrating how to build an OpenAI agent that interacts with ClickHouse and visualizes the decision-making process.
The article discusses the introduction of QBit, a new column type in ClickHouse that allows for flexible precision in vector search queries.
The article discusses how to efficiently compress Nginx logs using columnar storage, achieving over 170x compression by structuring raw logs into columns, optimizing data types, and ordering data e...
Lionel Palacin & Dale McDiarmid
17 min read
Includes Code
Has Summary
--
The article discusses the potential of lakehouses using open table formats like Apache Iceberg and Delta Lake for observability, highlighting their advantages in scalability, cost-effectiveness, an...
Melvyn Peignon & Dale McDiarmid
24 min read
Includes Code
Has Summary
--
The article discusses the new capabilities of ClickHouse Cloud to query Iceberg and Delta Lake tables through the DataLakeCatalog engine.
The article discusses the enhancements made to the JSON data type in ClickHouse v25. 8, which significantly improve performance and memory efficiency for analytical queries.
Pavel Kruglov
8 min read
Includes Code
Has Summary
--
ClickHouse Release 25. 9 introduces significant enhancements, including 25 new features, 22 performance optimizations, and 83 bug fixes.
The article discusses the development of a trace similarity system built on ClickHouse to analyze crash stack traces from a Continuous Integration (CI) system.
Misha Shiryaev
10 min read
Includes Code
Has Summary
--
The article discusses the rising costs associated with observability in software engineering and proposes a shift towards open, cost-efficient architectures.
Mike Shi
13 min read
Has Summary
--
This article serves as a practical guide for building ClickHouse-powered APIs in web applications, specifically using React and MooseStack.
This article discusses performance optimizations for ClickHouse on Intel's ultra-high core count processors, addressing challenges such as lock contention, memory management, and algorithm optimiza...
Jiebin Sun, Zhiguo Zhou, Wangyang Guo, Tianyou Li
25 min read
Includes Code
Has Summary
--
ClickHouse version 25. 8 introduces 45 new features, 47 performance optimizations, and 119 bug fixes, enhancing its capabilities as a high-performance analytical database.
ClickHouse Team
15 min read
Includes Code
Has Summary
--
This article discusses how to instrument a Next. js application using OpenTelemetry and ClickStack, focusing on the integration of observability and analytics through ClickHouse.