How LinkedIn Uses Oracle

43 engineering articles about Oracle from LinkedIn's engineering team

Other Companies Using Oracle

Articles

Filter:

Advanced

The Evolution of Enforcing our Professional Community Policies at Scale

The article discusses LinkedIn's journey in evolving its professional community policies enforcement at scale, focusing on the development of its anti-abuse platform and account restriction systems.

JavaMachine LearningOracleRedisSQL

Amit M.

17 min read

Has Summary

Advanced

Upscaling LinkedIn's Profile Datastore While Reducing Costs

The article discusses LinkedIn's strategy to upscale its profile datastore while reducing operational costs.

AvroLessOracle

LinkedIn Engineering Team

18 min read

Has Summary

Intermediate

LinkedIn’s journey to Java 11

The article details LinkedIn's migration journey from Java 8 to Java 11, emphasizing the performance improvements and challenges faced during the transition.

ApacheJavaOracle

Jesse Jie

12 min read

Has Summary

Intermediate

Career stories: Taking LinkedIn to Bellevue, and mentorship global

The article discusses Bef's journey at LinkedIn, highlighting his transition from a backend engineer to an engineering director while emphasizing the importance of mentorship and building an inclus...

AzureOracle

LinkedIn Engineering Team

6 min read

Has Summary

Advanced

Opal: Building a mutable dataset in data lake

The article discusses Opal, a system developed at LinkedIn to manage mutable datasets within a data lake.

ApacheAvroMySQLOracleSQL

Bhupendra Jain

16 min read

Has Summary

Advanced

Hodor: Detecting and addressing overload in LinkedIn microservices

The article discusses Hodor, a framework developed by LinkedIn to detect and address service overload in their microservices architecture.

JavaOracle

Bryan Barkley

17 min read

Has Summary

Intermediate

From daily dashboards to enterprise grade data pipelines

This article discusses the evolution of LinkedIn's Daily Executive Dashboard (DED) from a simple dashboard to a robust enterprise-grade data pipeline.

AvroAzureJavaOraclePythonScalaSQL

Jennifer Zheng

16 min read

Has Summary

Intermediate

FastIngest: Low-latency Gobblin with Apache Iceberg and ORC format

The article introduces FastIngest, a new evolution of Apache Gobblin designed to enable low-latency data ingestion from Kafka to HDFS using the ORC file format and Apache Iceberg for metadata manag...

ApacheAvroOracle

Zihan Li

15 min read

Has Summary

Advanced

The article discusses the evolution of metadata architectures, focusing on three generations of data discovery tools.

ApacheElasticsearchFlaskGraphQLMonolithMySQLNeo4jOracle

Shirshanka Das

22 min read

Has Summary

Advanced

LIquid: The soul of a new graph database, Part 2

The article discusses LIquid, a new graph database, focusing on its design and implementation.

JavaOracleSQLSQL Server

Scott Meyer

15 min read

Has Summary

Advanced

Rebuilding messaging: How we designed our new system

The article discusses the redesign of LinkedIn's messaging system, detailing the challenges faced with the original architecture and the requirements for a new system.

LessOracle

Tyler Grant

16 min read

Has Summary

Advanced

Open sourcing DataHub: LinkedIn’s metadata search and discovery platform

The article discusses LinkedIn's open sourcing of DataHub, a metadata search and discovery platform, detailing its development journey from WhereHows to DataHub.

ApacheAWSAzureDependency InjectionDockerElasticsearchGoogle CloudJSONKubernetesMicroservicesMongoDBMySQLNeo4jOracleSpring

Kerem Sahin

15 min read

Has Summary

Intermediate

An inside look at LinkedIn’s data pipeline monitoring system

This article provides an in-depth look at LinkedIn's data pipeline monitoring system, focusing on the challenges faced with traditional monitoring methods and how they have evolved to improve visib...

ApacheAvroFlaskMySQLOracleSQLYAML

Krishnan Raman

16 min read

Has Summary

Advanced

Fairness, Privacy, and Transparency by Design in AI/ML Systems

The article discusses the importance of fairness, privacy, and transparency in AI/ML systems, emphasizing their role in building trust and user engagement.

Oracle

Krishnaram Kenthapadi

11 min read

Has Summary

Intermediate

Open sourcing Brooklin: Near real-time data streaming at scale

The article discusses the open-sourcing of Brooklin, a distributed service for near real-time data streaming at scale, which has been in production at LinkedIn since 2016.

AWSAzureJSONMySQLOracleSQL

Celia K.

10 min read

Has Summary

Intermediate

Expediting data fixes and data migrations

The article discusses the importance of data management at LinkedIn, focusing on expediting data fixes and migrations through a centralized, scalable self-service platform.

JavaOraclePython

Kevin Fu

12 min read

Has Summary

Advanced

Building member trust through a centralized and scalable settings platform

The article discusses the development of a centralized and scalable settings platform at LinkedIn aimed at enhancing member trust through improved data privacy and user control.

Oracle

Joanna W.

12 min read

Has Summary

Advanced

Privacy-preserving analytics and reporting at LinkedIn

The article discusses LinkedIn's approach to privacy-preserving analytics and reporting through the PriPeARL framework.

ApacheOracle

Krishnaram Kenthapadi

18 min read

Has Summary

Advanced

The Present and Future of Apache Hadoop: A Community Meetup at LinkedIn

The article discusses a community meetup held at LinkedIn focused on Apache Hadoop, highlighting contributions from various organizations and key presentations on topics like TensorFlow on YARN, Ha...

ApacheApache SparkAzureJavaOraclePyTorchTensorFlow

Erik Krogen

10 min read

Has Summary

Advanced

Rebuilding the Groups Experience on LinkedIn

The article discusses the comprehensive overhaul of the LinkedIn Groups experience, focusing on integrating existing LinkedIn infrastructure to enhance functionality and user experience.

Oracle

Pujita Mathur

10 min read

Has Summary

Advanced

Building the Contacts Platform at LinkedIn

The article discusses the re-architecture of LinkedIn's contacts and calendar ecosystem, focusing on the migration to a single source of truth for contact data.

JavaMySQLOraclePython

Ravneet Singh Khalsa

14 min read

Has Summary

Intermediate

The Statistical Modeling System Powering LinkedIn Salary

The article discusses the statistical modeling system that powers LinkedIn Salary, focusing on how it collects and processes compensation data while addressing privacy concerns.

ApacheLessMachine LearningOracleREST API

Krishnaram Kenthapadi

23 min read

Has Summary

Advanced

Incremental Data Capture for Oracle Databases at LinkedIn: Then and Now

The article discusses the evolution of incremental data capture for Oracle databases at LinkedIn, highlighting the transition from a batch processing model to a near-real-time system.

ApacheApache KafkaIrisJavaMySQLOraclePerlSQL

Saurabh Goyal

9 min read

Has Summary

Advanced

Streaming Data Pipelines with Brooklin

The article discusses Brooklin, a data ingestion service developed by LinkedIn to facilitate streaming data from various sources to multiple destinations.

ApacheAvroAWSAzureJSONKubernetesMySQLOracleThrift

Samarth Shetty

11 min read

Has Summary

Intermediate

Migrating to Espresso

The article discusses the migration of LinkedIn's internal service, Babylonia, from Oracle to Espresso, a distributed NoSQL database.

AvroOracleSQL

David Max

11 min read

Has Summary

Intermediate

What Gets Measured Gets Fixed

The article 'What Gets Measured Gets Fixed' discusses the importance of measurement in engineering, illustrating this principle through two case studies: a database migration failure and the establ...

Oracle

Benjamin Purgason

9 min read

Has Summary

Advanced

Stream Processing Hard Problems Part II: Data Access

This article discusses the challenges of data access in high-scale stream processing, particularly focusing on the read/write and read-only data access patterns.

ApacheApache KafkaAWSAzureCassandraOracleSQL

LinkedIn Engineering Team

21 min read

Has Summary

Advanced

Introducing and Open Sourcing Ambry

The article introduces Ambry, LinkedIn's newly open-sourced distributed object store optimized for media storage and serving.

ApacheCDNJavaOracleREST APISpring

Sriram Subramanian

26 min read

Has Summary

Intermediate

Faster and Easier Service Deployment with LPS, Our New Private Cloud

The article introduces LinkedIn Platform as a Service (LPS), a new private cloud solution designed to streamline service deployment and enhance developer productivity.

DockerNatural Language ProcessingOracle

Steven Ihde

9 min read

Has Summary

Intermediate

Bridging Batch and Streaming Data Ingestion with Gobblin

The article discusses Gobblin, a unified data ingestion framework developed by LinkedIn, designed to bridge batch and streaming data ingestion.

ApacheApache KafkaKubernetesLessMySQLOracleSQLSQL Server

Shirshanka Das

7 min read

Has Summary

Intermediate

Apache Samza Graduates from Apache Incubator

The article discusses the graduation of Apache Samza from the Apache Incubator to a top-level Apache project, highlighting its significance in stream processing and the community growth during its ...

ApacheApache KafkaOracle

Chris Riccomini

3 min read

Has Summary

Advanced

Introducing Espresso - LinkedIn's hot new distributed document store

Espresso is LinkedIn's distributed, fault-tolerant NoSQL database that supports various applications, including Member Profile and InMail.

ApacheAvroJSONMySQLOracleRequest-Response

LinkedIn Engineering Team

17 min read

Has Summary

Advanced

Gobblin' Big Data With Ease

The article discusses LinkedIn's efforts to simplify big data ingestion for Hadoop-based warehouses using a framework called Gobblin.

MySQLOracle

LinkedIn Engineering Team

5 min read

Has Summary

Advanced

Apache Helix: A framework for Distributed System Development

Apache Helix is a framework designed for developing distributed systems, addressing challenges such as scalability, fault tolerance, and partition management.

ApacheAvroElasticsearchOracle

Kishore Gopalakrishna

10 min read

Has Summary

Advanced

Real-time Analytics at Massive Scale with Pinot

The article discusses the development and implementation of Pinot, a distributed real-time analytics engine created at LinkedIn to handle massive data scales and provide real-time insights.

ApacheOracle

LinkedIn Engineering Team

7 min read

Has Summary

Advanced

Garbage Collection Optimization for High-Throughput and Low-Latency Java Applications

The article discusses the optimization of garbage collection (GC) for high-throughput and low-latency Java applications, particularly in the context of LinkedIn's feed data platform.

JavaOracle

Swapnil Ghike

13 min read

Includes Code

Has Summary

Advanced

Announcing the Voldemort 1.6.0 Open Source Release

The article announces the release of Voldemort 1. 6. 0, a distributed key-value storage system developed at LinkedIn.

AvroJavaOracleShell

LinkedIn Engineering Team

10 min read

Includes Code

Has Summary

Advanced

The Log: What every software engineer should know about real-time data's unifying abstraction

The article discusses the significance of the log as a fundamental abstraction in real-time data systems, emphasizing its role in distributed systems, data integration, and stream processing.

AvroAWSClojureDynamoDBEvent SourcingJavaMySQLOraclePostgreSQLProtocol BuffersRedisScalaSQLThriftXML

Jay Kreps

63 min read

Has Summary

Beginner

LinkedIn Hosting SF MicroStrategy User Group Wed, Dec 4, 2013

LinkedIn is hosting the first San Francisco Bay Area MicroStrategy Meetup on December 4, 2013, providing an opportunity for the user community to share insights and learn from each other.

OracleSQLSQL Server

LinkedIn Engineering Team

3 min read

Has Summary

Advanced

Announcing the Voldemort 1.3 Open Source Release

The article announces the release of Voldemort 1. 3. 0, detailing significant performance improvements, new features, and enhanced operability.

ApacheAvroJavaOracle

Vinoth Chandar

9 min read

Includes Code

Has Summary