#
AWS EC2 Programming Tutorials & Engineering Articles
65 AWS EC2 tutorials, guides, and engineering insights from Netflix, ClickHouse, NVIDIA, and more
Companies Using This
AWS EC2 Articles & Tutorials
Filter:
This article delves into the development and implementation of minions, Stripe's unattended coding agents, which autonomously produce pull requests without human-written code.
20 min read
Includes Code
Has Summary
--
ClickHouse 25. 9 introduces streaming secondary indices, a fundamental change to how secondary indexes (minmax, set, bloom filter, vector, text) are evaluated during query execution.
The article discusses the development of chDB, a Python library that integrates ClickHouse with Pandas DataFrames for high-performance SQL querying.
This article discusses the transition from OpenTelemetry (OTel) to Rotel, an open-source Rust project that enhances tracing capabilities at petabyte scale.
ClickHouse version 25. 10 introduces significant enhancements, including 20 new features, 30 performance optimizations, and 103 bug fixes.
The article discusses the security risks associated with AI-driven applications that generate and execute code autonomously.
ClickHouse Release 25. 9 introduces significant enhancements, including 25 new features, 22 performance optimizations, and 83 bug fixes.
This article discusses the implementation of Change Data Capture (CDC) from Delta Lake to ClickHouse, detailing the architecture, components, and a reference implementation in Python.
The article discusses how ClickHouse efficiently queries Parquet files, a key storage format for Lakehouse architectures, without requiring data ingestion.
The article discusses the shift in user demographics on Fly. io's platform, highlighting how robots have become the primary drivers of growth.
Kurt Mackey
10 min read
Includes Code
Has Summary
--
This article explores the various input formats supported by ClickHouse for data ingestion, focusing on performance and efficiency.
ClickHouse version 25. 1 introduces significant enhancements, including 15 new features, 36 performance optimizations, and 77 bug fixes.
The article discusses the One Billion Documents JSON Challenge, comparing the performance of ClickHouse against other popular databases like MongoDB, Elasticsearch, DuckDB, and PostgreSQL in storin...
Tom Schreiber
33 min read
Includes Code
Has Summary
--
ClickHouse version 24. 12 introduces 16 new features, 16 performance optimizations, and 36 bug fixes, enhancing usability and performance for users.
Montai Therapeutics is leveraging NVIDIA BioNeMo to develop a multimodal AI platform for drug discovery, focusing on Anthromolecule chemistry.
Vega Shah
4 min read
Has Summary
--
The article discusses how Petrobras leverages the NVIDIA Grace CPU to enhance the performance of linear solvers used in reservoir simulation, achieving significant improvements in time-to-solution,...
The article discusses Pinterest's transition from HBase, its first NoSQL datastore, to a new serving architecture with a unified storage service.
The article explores the significant contributions of women in the Data Engineering team at Slack, highlighting their roles in managing complex data systems and fostering a diverse work culture.
The article discusses Slack's migration from AWS Instance Metadata Service version 1 (IMDSv1) to version 2 (IMDSv2), emphasizing the security enhancements and challenges faced during the transition.
Archie Gunasekara
13 min read
Includes Code
Has Summary
--
This article discusses Pinterest's implementation of a finer-grained access control (FGAC) framework to manage data access securely and efficiently within their data engineering platform.
The article details the construction of ClickHouse's Internal Data Warehouse (DWH), emphasizing its architecture, data sources, and operational strategies.
This article provides a comprehensive guide on deploying NVIDIA Riva for speech AI applications using Kubernetes, focusing on autoscaling and load balancing techniques.
Maggie Zhang
13 min read
Includes Code
Has Summary
--
This article discusses the challenges faced by Netflix when migrating a Java microservice to a larger AWS instance, which unexpectedly resulted in suboptimal performance.
Netflix Technology Blog
11 min read
Has Summary
--
The article discusses how Slack utilizes Terraform for managing its infrastructure across multiple cloud providers, including AWS, DigitalOcean, NS1, and GCP.
The article provides an insightful glimpse into the daily routine of Georgi Knox, a Senior Cloud Engineer at Slack Australia.
Georgi Knox
10 min read
Has Summary
--
The article discusses the advantages of utilizing ephemeral compute in Kubernetes environments, particularly through the implementation of short-lived nodes in the Rubix platform.
Palantir
8 min read
Has Summary
--
The article discusses BuildRock, Slack's new build platform designed to enhance the efficiency and safety of code deployment.
Joel Bartlett
13 min read
Has Summary
--
The article discusses AutoTransform, an open-source framework developed by Slack to automate the maintenance, modification, and upgrading of large codebases.
The article discusses the transition to remote development environments at Slack, highlighting the challenges faced with local setups and the benefits of using AWS EC2 instances for development.
The article discusses the implementation of background effects, specifically background blur and background image replacement, for Slack Clips, utilizing web technologies like WebGL and WebAssembly...
Albert Xing
8 min read
Has Summary
--
The article discusses the advantages of using remote ephemeral workspaces to enhance developer productivity at Palantir.
Palantir
10 min read
Has Summary
--
The article discusses how Airbnb has implemented dynamic Kubernetes cluster scaling to optimize cloud spending in response to fluctuating traffic demands.
David Morrison
11 min read
Includes Code
Has Summary
--
This article details Pinterest's experience upgrading their Batch Processing Platform, Monarch, from Hadoop 2. 7. 1 to Hadoop 2. 10. 0.
The article discusses the development of a unified PubSub client library at Pinterest, aimed at improving the scalability, stability, and developer velocity of the Logging Platform.
Pinterest Engineering
12 min read
Includes Code
Has Summary
--
The article discusses Pinterest's transition to executing end-to-end UI tests before code submissions, emphasizing the benefits of shifting testing left in the development process.
The article discusses Pinterest's transition to running a large end-to-end UI test suite before every commit in their Android and iOS repositories.
This article discusses Pinterest's Batch Processing Platform, Monarch, focusing on efficient resource management to ensure quality of service (QoS) while maintaining cost efficiency.
NVIDIA has launched the Morpheus Early Access Program, providing developers with access to its AI development framework for cybersecurity applications.
The article discusses advancements in AutoML using NVIDIA GPUs and RAPIDS, highlighting how AutoGluon simplifies the process of achieving state-of-the-art machine learning accuracy while significan...
AutoMLAWSAWS EC2CatBoostDeep LearningGoogle AutoMLLightGBMMachine LearningPandasPythonscikit-learnXGBoost
Carol McDonald
15 min read
Includes Code
Has Summary
--
The article discusses Agustinus Nalwan's interactive AI bot, Qrio, which was awarded the Jetson Project of the Month.
Nefi Alarcon
2 min read
Has Summary
--
The article discusses the evolution of Netflix Conductor, a workflow orchestration engine that has gained significant adoption within Netflix for managing core workflows.
Netflix Technology Blog
11 min read
Has Summary
--
The article discusses Pinterest's implementation of Presto, an open-source distributed SQL query engine, detailing the challenges faced and solutions developed to manage large-scale data analysis.
The article discusses optimizing End-to-End Memory Networks (MemN2N) using SigOpt and GPUs, focusing on hyperparameter tuning methods to enhance performance in Question Answering (QA) systems.
The article discusses improvements made to Vector, an open-source host-level performance monitoring framework at Netflix, which now includes latency heatmaps and enhanced container support using eB...
Netflix Technology Blog
6 min read
Has Summary
--
The article discusses Netflix's approach to preventing credential compromise in AWS environments by introducing two additional layers of security: API enforcement and metadata protection.
Netflix Technology Blog
11 min read
Has Summary
--
The article discusses Netflix's methodology for detecting credential compromise in AWS environments, emphasizing the importance of monitoring temporary security credentials.
This article discusses how to optimize deep learning training times using NVIDIA V100 Tensor Core GPUs in the AWS Cloud, reducing training durations from weeks to days.
Nefi Alarcon
2 min read
Has Summary
--
Netflix Cloud Security SIRT releases Diffy: A Differencing Engine for Digital Forensics in the Cloud
Netflix's Security Intelligence and Response Team (SIRT) introduces Diffy, a triage tool designed for digital forensics and incident response (DFIR) in cloud environments.