#

Large Language Models Programming Tutorials & Engineering Articles

110 Large Language Models tutorials, guides, and engineering insights from NVIDIA, Cloudflare, Google, and more

Large Language Models Articles & Tutorials

Filter:
Google logo
Google
Intermediate
Google announces the public preview of the Developer Knowledge API and its associated Model Context Protocol (MCP) server, providing a canonical, machine-readable gateway to Google's official devel...
Jess Kuras
3 min read
Includes Code
Has Summary
--
Google logo
Google
Advanced
This tutorial demonstrates how to fine-tune FunctionGemma, a small language model for translating natural language into API calls, using Google's Tunix library on TPUs.
Wei Wei
4 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the NVIDIA Nemotron 3, a family of open models designed for agentic AI systems, emphasizing its efficiency and accuracy through innovative architectures and techniques.
Pinterest logo
Pinterest
Intermediate
The article discusses Pinterest's approach to enhancing its observability tools by integrating AI and the Model Context Protocol (MCP).
Pinterest Engineering
12 min read
Has Summary
--
Slack logo
Slack
Advanced
Slack's Security Engineering team describes how they built an AI agent-based system to automate and streamline security investigations.
Dominic Marks
12 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The NVIDIA Blackwell architecture has achieved the fastest training times across all MLPerf Training v5. 1 benchmarks, showcasing significant advancements in AI training performance.
Meta logo
Meta
Intermediate
The article discusses Meta's evolution in infrastructure over 21 years, highlighting the significant changes brought about by AI.
Yee Jiun Song
20 min read
Has Summary
--
Google logo
Google
Intermediate
The article discusses the integration of the Apigee Operator for Kubernetes with the GKE Inference Gateway to enhance API management for AI and Large Language Models (LLMs).
Sanjay Pujare, Jennifer Bennett
4 min read
Includes Code
Has Summary
--
Cloudflare logo
Cloudflare
Advanced
The article discusses a new approach to bot management that leverages behavioral anomaly detection tailored for individual customers.
Google logo
Google
Intermediate
The article discusses the integration of Google’s Agent Development Kit (ADK) for Java with the LangChain4j LLM framework, enabling developers to utilize a variety of Large Language Models (LLMs) f...
Guillaume Laforge
5 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how to enhance the efficiency of Large Language Models (LLMs) during inference by utilizing CPU-GPU memory sharing through NVIDIA's NVLink C2C technology.
Afroze Syed
6 min read
Includes Code
Has Summary
--
Google logo
Google
Intermediate
This article discusses the integration of Google's EmbeddingGemma model with Google Cloud's Dataflow to create a scalable embedding pipeline for AI applications.
Danny McCormick, Ian Ballantyne, Olivier Lacombe
5 min read
Includes Code
Has Summary
--
Cloudflare logo
Cloudflare
Advanced
The article discusses Cloudflare's introduction of unsafe content moderation integrated into its Firewall for AI, aimed at protecting Large Language Models (LLMs) from malicious prompts that could ...
Radwa Radwan
8 min read
Includes Code
Has Summary
--
Cloudflare logo
Cloudflare
Intermediate
The article discusses the transformative impact of AI on various industries and introduces Cloudflare's AI Week 2025, focusing on enhancing security and control over AI technologies.
Kenny Johnson
7 min read
Has Summary
--
Google logo
Google
Advanced
This article provides a comprehensive guide on how to train a GPT-2 model using JAX on TPU, highlighting the ease of leveraging Google TPUs for free.
Cloudflare logo
Cloudflare
Intermediate
Cloudflare has partnered with OpenAI to integrate their new open-weight models into Cloudflare Workers AI, allowing developers to leverage these models for enhanced AI capabilities.
Michelle Chen
4 min read
Includes Code
Has Summary
--
Shopify logo
Shopify
Intermediate
The article discusses Shopify's Global Catalogue, which utilizes multimodal Large Language Models (LLMs) to standardize and enrich product data across its platform.
Audrey-Anne Guindon
13 min read
Has Summary
--
Google logo
Google
Advanced
The article introduces GenAI Processors, an open-source Python library from Google DeepMind aimed at simplifying the development of sophisticated AI applications using Large Language Models (LLMs).
Andre Elisseeff, Alexey Guseynov, Oskar Bunyan, Shrestha Basu Mallick
6 min read
Includes Code
Has Summary
--
Cloudflare logo
Cloudflare
Intermediate
The article explores the changing dynamics of web crawling and referral traffic due to the rise of AI and Large Language Models (LLMs).
David Belson
8 min read
Includes Code
Has Summary
--
ClickHouse logo
ClickHouse
Beginner
This article discusses how to build an agentic application using ClickHouse MCP Server and CopilotKit, focusing on creating a customizable analytics dashboard for the UK real estate market.
Lionel Palacin
10 min read
Includes Code
Has Summary
--
LinkedIn logo
LinkedIn
Advanced
The article discusses JUDE, LinkedIn's platform for generating high-quality embeddings for job recommendations using fine-tuned Large Language Models (LLMs).
NVIDIA logo
NVIDIA
Advanced
This article serves as a comprehensive guide for benchmarking Large Language Models (LLMs) using NVIDIA's GenAI-Perf tool alongside NVIDIA NIM.
Vinh Nguyen
11 min read
Includes Code
Has Summary
--
Pinterest logo
Pinterest
Advanced
The article discusses the implementation of a Large Language Model (LLM)-based relevance system for Pinterest Search, detailing its technical design, model architecture, and the results from both o...
LinkedIn logo
LinkedIn
Advanced
The article discusses the evolution of LinkedIn's Nuage control plane, highlighting its transition from a self-service platform to a comprehensive control plane solution for managing data infrastru...
Aashish Nagpal
21 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Netflix's development of a Foundation Model for Personalized Recommendation, which aims to centralize member preference learning and enhance the efficiency of their recommenda...
Netflix Technology Blog
13 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
This article provides a comprehensive guide on Vision Language Models (VLMs) and their evolution from single-image understanding to advanced video comprehension.
Shubham Agrawal
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the Marco framework, a configurable graph-based task-solving and multi-AI agent system designed to streamline chip design processes.
Mark Ren
8 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
NVIDIA has released a new Generative AI Teaching Kit aimed at enhancing education in generative AI technologies.
Google logo
Google
Intermediate
The article discusses Gemma, a family of lightweight generative AI models, and introduces the concept of Agentic AI, which allows AI to make proactive decisions and utilize external tools.
Ju-yeong Ji
7 min read
Includes Code
Has Summary
--
Google logo
Google
Advanced
The article discusses the Vertex AI RAG Engine, a tool designed to help developers build grounded generative AI applications by addressing challenges like hallucinations and outdated knowledge.
Anthropic logo
Anthropic
Intermediate
The article discusses the upgraded Claude 3. 5 Sonnet model, which achieved a score of 49% on the SWE-bench Verified benchmark, surpassing the previous state-of-the-art model's score of 45%.
15 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the implementation of data-efficient knowledge distillation using NVIDIA NeMo-Aligner during supervised fine-tuning (SFT).
Anna Shors
5 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article introduces the Prompt Engineering Toolkit developed by Uber, which aims to streamline the process of creating and managing prompts for Large Language Models (LLMs).
Palantir logo
Palantir
Intermediate
The article discusses the ethical implications and operational realities of implementing AI Decision Support Systems (AI-DSS) in military contexts.
Slack logo
Slack
Intermediate
The article discusses how Slack is utilizing AI-powered tools to enhance developer productivity and streamline processes.
Anirudh Janga
10 min read
Has Summary
--
Cloudflare logo
Cloudflare
Advanced
The article discusses the challenges and solutions involved in scaling the AI Gateway on the Cloudflare Developer Platform, specifically focusing on extending log storage capabilities from 30 minut...
Catarina Pires Mota
11 min read
Includes Code
Has Summary
--
Pinterest logo
Pinterest
Advanced
This article discusses the implementation of Ray Batch Inference at Pinterest, highlighting its advantages over previous solutions like Apache Spark and Torch Dataloader.
Pinterest Engineering
11 min read
Includes Code
Has Summary
--
Cloudflare logo
Cloudflare
Intermediate
The article discusses Cloudflare's new tools that empower site owners to audit and control how AI models access their content.
Sam Rhea
12 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
NVIDIA showcased its AI security expertise at the Black Hat USA and DEF CON conferences, focusing on the evolving landscape of AI in cybersecurity.
NVIDIA logo
NVIDIA
Intermediate
The article discusses the deployment of diverse AI applications using Multi-LoRA support on NVIDIA RTX AI PCs and workstations.
Annamalai Chockalingam
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how NVIDIA NVLink and NVSwitch enhance the performance of Large Language Model (LLM) inference by enabling efficient multi-GPU computing.
Brian Slechta
7 min read
Has Summary
--
Palantir logo
Palantir
Intermediate
The article discusses the importance of explainability in AI, particularly focusing on Large Language Models (LLMs) and the Chain-of-Thought (CoT) prompting technique.
Palantir
11 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's GenAI Gateway, a unified platform designed to streamline the integration of Large Language Models (LLMs) across various teams within the company.
Tse-Chi Wang, Roopansh Bansal
15 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the launch of Continuum AI by Edgeless Systems, a generative AI framework that ensures data privacy through confidential computing and NVIDIA H100 GPUs.
Laura Martinez
6 min read
Has Summary
--
Palantir logo
Palantir
Intermediate
The article discusses the Product Reliability Incident Management team at Palantir, detailing their proactive and reactive approaches to managing critical incidents across their platforms.
Palantir
10 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
This article explores the complexities of deploying trillion-parameter large language models (LLMs) in production environments, focusing on maximizing throughput and user interactivity.
Amr Elmeleegy
13 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the deployment of LoRA (Low-Rank Adaptation) fine-tuned models using NVIDIA NIM, highlighting the advantages of customizing large language models (LLMs) for specific tasks.
Shashank Verma
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article provides a comprehensive guide on deploying generative AI using NVIDIA NIM microservices, highlighting its ease of use for enterprise developers in both on-premises and cloud environmen...
NVIDIA logo
NVIDIA
Advanced
The article discusses the integration of AI chatbots, particularly Gipi, with NVIDIA TensorRT-LLM and AI foundation models to enhance personalized learning experiences.
Nisanur Genc
5 min read
Has Summary
--
Slack logo
Slack
Intermediate
This article discusses Slack's transition from Enzyme to React Testing Library (RTL) for frontend testing, highlighting the challenges and solutions encountered during the conversion of over 15,000...
Sergii Gorbachov
18 min read
Includes Code
Has Summary
--