#
SQL Programming Tutorials & Engineering Articles
729 SQL tutorials, guides, and engineering insights from ClickHouse, NVIDIA, Uber, and more
Companies Using This
SQL Articles & Tutorials
Filter:
The article reviews Airbnb's significant advancements in research and technology throughout 2025, focusing on their participation in key conferences and the impactful papers presented.
Malay Haldar
12 min read
Has Summary
--
ClickHouse 26. 1 is a major release featuring 25 new features, 43 performance optimizations, and 176 bug fixes.
17 min read
Includes Code
Has Summary
--
OpenAI and Snowflake have announced a multi-year, $200 million partnership that integrates OpenAI's frontier AI models directly into Snowflake's data platform, including Snowflake Cortex AI and Sno...
OpenAI built a bespoke internal AI data agent powered by GPT-5.
Bonnie Xu
13 min read
Has Summary
--
This article demonstrates how to port a Matrix homeserver from traditional infrastructure (Synapse on VPS) to Cloudflare Workers, creating a serverless, zero-maintenance deployment with automatic p...
Nick Kuntz
9 min read
Includes Code
Has Summary
--
TRUSTBANK partnered with Recursive to build Choice AI, a multi-agent AI system powered by OpenAI's GPT-4. 1 series, to help users navigate Japan's Furusato Nozei (hometown tax donation) program.
OpenAI Team
6 min read
Has Summary
--
OpenAI details how they scaled PostgreSQL to support 800 million ChatGPT users, achieving millions of queries per second through a single-primary architecture with nearly 50 read replicas across mu...
Bohan Zhang
13 min read
Has Summary
--
The article discusses the NVIDIA Multi-Agent Intelligent Warehouse (MAIW), an AI command layer designed to enhance operational efficiency and supply chain intelligence in automated warehouses.
Tarik Hammadou
10 min read
Includes Code
Has Summary
--
The article discusses the importance of evaluations (evals) for AI agents, emphasizing how they help teams identify and resolve issues before they reach production.
The article discusses the development of chDB, a Python library that integrates ClickHouse with Pandas DataFrames for high-performance SQL querying.
The article reviews the significant developments and features introduced in ClickStack over its first seven months since launch, highlighting advancements such as JSON support, integration with Cli...
14 min read
Includes Code
Has Summary
--
The article discusses the need for improved observability in AI Site Reliability Engineering (SRE) rather than relying solely on larger models.
At ClickHouse, we don't like the word "impossible." We believe that with the right tools, everything is a data problem. To prove it, we decided to complete the 2025 Advent of Code unconventionally: using pure ClickHouse SQL.
The article reviews significant demo developments at ClickHouse in 2025, highlighting various applications that showcase its performance and capabilities.
The article discusses key benchmarks conducted over the past year that reveal insights into performance optimization in ClickHouse.
Learn how to build modular and reliable agentic applications using 8 effective multi-agent design patterns with the Agent Development Kit (ADK).
NVIDIA's Sirius, an open-source GPU-native SQL engine, has set a new performance record on ClickBench, enhancing DuckDB with GPU-accelerated analytics.
Xiangyao Yu
6 min read
Has Summary
--
The article discusses ClickHouse's journey of integrating Rust into its predominantly C++ codebase without undertaking a complete rewrite.
The article highlights Alexey's favorite features introduced in ClickHouse throughout 2025, including lightweight updates, data lake support, and advancements in text and vector indexing.
12 min read
Includes Code
Has Summary
--
Litestream VFS is a new SQLite plugin that enables querying databases directly from S3-compatible object storage without downloading the entire database.
The November 2025 edition of What's New in ClickStack highlights several new features and improvements in the open-source observability stack built for ClickHouse.
This article discusses the improvements made to MySQL cluster uptime at Uber through the implementation of MySQL Group Replication (MGR).
This article provides a detailed analysis of how the five major cloud data warehouses—Snowflake, Databricks, ClickHouse Cloud, Google BigQuery, and Amazon Redshift Serverless—calculate compute cost...
Tom Schreiber & Lionel Palacin
27 min read
Has Summary
--
The article discusses the integration of NVIDIA Nemotron RAG with Microsoft SQL Server 2025, showcasing how this collaboration enables the development of scalable AI applications on enterprise data.
The article details the journey of upgrading the chDB kernel from ClickHouse v25. 5 to v25. 8. 2.
The article discusses the development of StockHouse, a real-time market analytics application that leverages ClickHouse, Massive, and Perspective to handle high-frequency financial data.
Lionel Palacin
7 min read
Includes Code
Has Summary
--
The article discusses Uber's implementation of I/O observability for its massive petabyte-scale data lake, focusing on the challenges and solutions in monitoring data access patterns across its hyb...
Arnav Balyan, Kartik Bommepally, Amruth Sampath, Jing Zhao, Akshayaprakash Sharma
10 min read
Has Summary
--
This article details the transformation of ClickHouse's internal data warehouse from a traditional BI-first approach to an AI-first model, significantly enhancing user accessibility to analytics.
The article discusses the creation of a website for tracking team activity across GitHub repositories, initially intended as a single report but evolved into a comprehensive tool for comparing vari...
The October 2025 edition of What's New in ClickStack highlights significant updates to the open-source observability stack for ClickHouse, including the introduction of alerting features, customiza...
9 min read
Includes Code
Has Summary
--
ClickHouse version 25. 10 introduces significant enhancements, including 20 new features, 30 performance optimizations, and 103 bug fixes.
Thomas Ptacek argues that every developer should build an LLM agent to truly understand the technology, demonstrating through progressive Python code examples that a functional agent with tool use ...
The article reflects on a decade of AI platform development at Pinterest, detailing the evolution from fragmented machine learning stacks to a unified AI platform that supports various models.
AutoMLDockerEmbeddingGenerative AIJavaKubernetesLightGBMPySparkPythonPyTorchSeedSQLTensorFlowThriftTransformer
Pinterest Engineering
22 min read
Has Summary
--
This article discusses how to enhance log compression through log clustering techniques in ClickHouse, focusing on transforming unstructured logs into structured data for efficient storage.
This article explores the process of tracing OpenAI agents using ClickStack, demonstrating how to build an OpenAI agent that interacts with ClickHouse and visualizes the decision-making process.
The article discusses how Meta is scaling its Privacy Aware Infrastructure (PAI) to address privacy challenges in the era of Generative AI (GenAI) product innovation.
Rituraj Kirti
11 min read
Has Summary
--
The article discusses the potential of lakehouses using open table formats like Apache Iceberg and Delta Lake for observability, highlighting their advantages in scalability, cost-effectiveness, an...
Melvyn Peignon & Dale McDiarmid
24 min read
Includes Code
Has Summary
--
This article discusses the rebuilding of Uber's Apache Pinot™ query architecture, focusing on the transition from Neutrino to a new query system that utilizes Pinot's Multi-Stage Engine Lite Mode.
The article discusses the collaboration between IBM and NVIDIA to enhance large-scale data analytics through GPU-native Velox and NVIDIA cuDF, highlighting significant performance improvements over...
Gregory Kimball
7 min read
Has Summary
--
The article discusses practical security advice for Large Language Model (LLM) applications based on findings from the NVIDIA AI Red Team.
Rich Harang
7 min read
Includes Code
Has Summary
--
The article reflects on Cloudflare's 15-year journey and the initiatives launched during Birthday Week 2025, emphasizing their commitment to building a better Internet.
Nikita Cano
8 min read
Has Summary
--
This article discusses how Netflix built a resilient data platform using a Write-Ahead Log (WAL) to address data consistency, reliability, and operational efficiency challenges at scale.
The article discusses the relevance of the Common Vulnerabilities and Exposures (CVE) system in relation to AI models, arguing that CVEs should be focused on the frameworks and applications that ut...
Rich Harang
7 min read
Has Summary
--
The article announces the Cloudflare Data Platform, which includes three key products: Cloudflare Pipelines for data ingestion, R2 Data Catalog for managing metadata, and R2 SQL for querying data.
Uber's migration from Spark 2. 4 to Spark 3. 3 involved upgrading over 2 million Spark applications, utilizing innovative automation tools like Iron Dome.
Amruth Sampath, Arnav Balyan, Nimesh Khandelwal, Sumit Singh, Parth Halani, Suprit Acharya
8 min read
Has Summary
--
The article discusses how Palantir optimizes Elasticsearch to enhance its defensive capabilities against poor access patterns, particularly focusing on indexing refresh semantics.
Palantir
18 min read
Includes Code
Has Summary
--
The article discusses the rising costs associated with observability in software engineering and proposes a shift towards open, cost-efficient architectures.
Mike Shi
13 min read
Has Summary
--
Ramp built an internal AI agent called Ramp Research that serves as an agentic data analyst, answering data questions directly in Slack to eliminate the bottleneck of relying on a single on-call an...
Faiz Hilaly, Cesar Duran, Jay Sobel
5 min read
Has Summary
--