#

TensorFlow Programming Tutorials & Engineering Articles

606 TensorFlow tutorials, guides, and engineering insights from NVIDIA, Google, Uber, and more

TensorFlow Articles & Tutorials

Filter:
Google logo
Google
Advanced
LiteRT has evolved from its TensorFlow Lite foundation into a universal on-device AI inference framework, now offering production-ready GPU acceleration across six platforms and streamlined NPU int...
Lu Wang, Chintan Parikh, Jingjiang Li, Terry Heo
9 min read
Includes Code
Has Summary
--
Shopify logo
Shopify
Intermediate
Shopify open-sources Tangle, an ML experimentation platform built to solve six common failure modes in machine learning development.
Shopify Engineering
12 min read
Has Summary
--
Google logo
Google
Advanced
Metrax is a high-performance library designed for efficient and robust model evaluation metrics in JAX, addressing the need for standardized metrics during the migration from TensorFlow.
Yufeng Guo, Jiwon Shin, Jeff Carpenter
5 min read
Includes Code
Has Summary
--
Pinterest logo
Pinterest
Advanced
The article reflects on a decade of AI platform development at Pinterest, detailing the evolution from fragmented machine learning stacks to a unified AI platform that supports various models.
Uber logo
Uber
Advanced
This article discusses how Uber has integrated explainability into its machine learning platform, Michelangelo, using Integrated Gradients (IG) to provide interpretable attributions for deep learni...
Hugh Chen, Eric Wang, Gaoyuan Huang, Howard Yu, Jia Li, Sally Lee
14 min read
Has Summary
--
Google logo
Google
Intermediate
The article introduces Coral NPU, a full-stack, open-source platform designed to enhance Edge AI capabilities on low-power devices.
Billy Rutledge
8 min read
Has Summary
--
Google logo
Google
Advanced
The article discusses building high-performance data pipelines using Grain, a data loading library for JAX, and ArrayRecord, an efficient file format.
Jiyang Kang, Shivaji Dutta, Ihor Indyk, Felix Chern
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the relevance of the Common Vulnerabilities and Exposures (CVE) system in relation to AI models, arguing that CVEs should be focused on the frameworks and applications that ut...
Rich Harang
7 min read
Has Summary
--
Cloudflare logo
Cloudflare
Intermediate
The article discusses the development of AI face cropping technology by Cloudflare, which automatically crops images around detected faces.
Deanna Lam
14 min read
Includes Code
Has Summary
--
Google logo
Google
Advanced
This article provides a comprehensive guide on how to train a GPT-2 model using JAX on TPU, highlighting the ease of leveraging Google TPUs for free.
NVIDIA logo
NVIDIA
Intermediate
The article discusses the impending shortage of healthcare workers and how AI-enabled robotic systems, powered by NVIDIA Isaac for Healthcare, can address these challenges.
Ansley Dunn
6 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the introduction of cuda-cccl, a Python library that provides high-level building blocks for NVIDIA CUDA kernel fusion, enabling developers to write efficient algorithms witho...
Ashwin Srinath
5 min read
Includes Code
Has Summary
--
Google logo
Google
Intermediate
The article discusses how to use KerasHub for loading model weights from SafeTensors into Keras, enabling flexible end-to-end machine learning workflows across different frameworks like JAX, PyTorc...
Yufeng Guo, Divyashree Sreepathihalli, Monica Song
8 min read
Includes Code
Has Summary
--
Shopify logo
Shopify
Intermediate
The article discusses the advancements in React Native graphics through the integration of WebGPU and Skia, highlighting how these technologies enhance performance and enable new capabilities for d...
William Candillon
8 min read
Has Summary
--
Google logo
Google
Advanced
The article introduces Keras Recommenders, a new library designed to simplify the creation of state-of-the-art recommendation systems using Keras with JAX, TensorFlow, or PyTorch.
Yufeng Guo, Monica Song
3 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how to accelerate Deep Learning (DL) and Large Language Model (LLM) inference using Apache Spark in cloud environments.
NVIDIA logo
NVIDIA
Advanced
The article discusses optimizing transformer-based diffusion models for video generation using NVIDIA TensorRT, highlighting significant reductions in latency and total cost of ownership (TCO) achi...
Maximilian Müller
7 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
A new AI-powered tool developed by researchers at the University of Florida and medical centers aims to improve the diagnosis of Parkinson's disease using standard MRI scans.
Michelle Horton
3 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
NVIDIA has open-sourced the KAI Scheduler, a Kubernetes-native GPU scheduling solution under the Apache 2. 0 license, originally developed for the Run:ai platform.
Ronen Dar
9 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
NVIDIA Dynamo is a newly released low-latency distributed inference framework designed to enhance the deployment of generative AI and reasoning models in large-scale environments.
Amr Elmeleegy
12 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
NVIDIA has announced world-record inference performance for the DeepSeek-R1 model using the Blackwell architecture, achieving over 250 tokens per second per user and a maximum throughput of over 30...
NVIDIA logo
NVIDIA
Advanced
The article discusses an advanced deep-learning model designed to automate X-ray analysis for spinal health diagnostics, enhancing speed and accuracy in assessing conditions like scoliosis and kyph...
Michelle Horton
4 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses key insights from the NVIDIA 6G Developer Day 2024, highlighting the integration of AI into 6G infrastructure and the significance of AI-RAN.
Emeka Obiodu
10 min read
Has Summary
--
LinkedIn logo
LinkedIn
Advanced
The article discusses Liger-Kernel, an open-source library designed to enhance GPU efficiency for training large language models (LLMs).
Pin-Lun (Byron) Hsu
10 min read
Includes Code
Has Summary
--
Google logo
Google
Intermediate
The article announces that Francois Chollet, the creator of Keras, is leaving Google to pursue new opportunities.
Bill Jia, Xavi Amatriain
2 min read
Has Summary
--
Google logo
Google
Advanced
The Web AI Summit 2024, hosted by Google on October 18, 2024, focused on client-side AI for developers, showcasing how machine learning models can operate offline in web browsers.
NVIDIA logo
NVIDIA
Intermediate
The article discusses the advancements in AI agents facilitated by NVIDIA AI Enterprise, emphasizing enhanced security, streamlined deployment, and management of AI pipelines.
NVIDIA logo
NVIDIA
Advanced
The article discusses how to scale Large Language Models (LLMs) using NVIDIA Triton and NVIDIA TensorRT-LLM in a Kubernetes environment.
NVIDIA logo
NVIDIA
Advanced
The article discusses the Brain-Machine Interactive Neuromodulation Research Tool (BMINT), which utilizes closed-loop neuromodulation techniques to treat brain diseases like epilepsy and Parkinson'...
Shouyan Wang
4 min read
Has Summary
--
Pinterest logo
Pinterest
Advanced
This article discusses the implementation of Ray Batch Inference at Pinterest, highlighting its advantages over previous solutions like Apache Spark and Torch Dataloader.
Pinterest Engineering
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the development and deployment of real-time neural receivers (NRX) in 5G New Radio (5G NR) systems, highlighting their potential to enhance wireless communication through AI-d...
Sebastian Cammerer
10 min read
Has Summary
--
Google logo
Google
Intermediate
LiteRT, formerly known as TensorFlow Lite, is a high-performance runtime for on-device AI that now supports models from multiple frameworks including PyTorch, JAX, and Keras.
Google AI Edge team
4 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the release of NVIDIA TAO 5. 5, a framework that simplifies AI model development and deployment.
Monika Jhuria
12 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the impressive performance of the NVIDIA Triton Inference Server in the MLPerf Inference v4.
Google logo
Google
Intermediate
The article discusses optimizing Large Language Model (LLM) inference at the edge using TensorFlow Lite (TFLite) and XNNPack.
Quentin Khan, Linkun Chen
6 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses NVIDIA's fVDB, a deep-learning framework designed to build spatial intelligence from real-world 3D data.
Ken Museth
6 min read
Has Summary
--
Google logo
Google
Intermediate
The article discusses the release of the Gemma 2 model with 27 billion parameters, highlighting its capabilities in Keras and integration with JAX for efficient model training.
Martin Görner
5 min read
Includes Code
Has Summary
--
Google logo
Google
Intermediate
Model Explorer is a powerful graph visualization tool designed to simplify the development and optimization of machine learning models for edge devices.
Kristen Wright, Eric Yang
6 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses NVIDIA Metropolis, a platform for real-time vision AI that streamlines deployment through microservices and workflows.
NVIDIA logo
NVIDIA
Intermediate
The article discusses how to build a zero-copy AI sensor processing pipeline using OpenCV within the NVIDIA Holoscan SDK.
Meiran Peng
7 min read
Includes Code
Has Summary
--
Google logo
Google
Advanced
The article introduces the AI Edge Torch Generative API, designed to enable developers to create high-performance LLMs in PyTorch for deployment on edge devices using the TensorFlow Lite runtime.
Cormac Brick, Haoliang Zhang
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the enhancements made in NVIDIA's cuDNN 9 library, focusing on the acceleration of Transformers through the implementation of Scaled Dot Product Attention (SDPA).
Matthew Nicely
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how Snap's ML engineering team enhanced the apparel shopping experience using AI, specifically through the Screenshop service integrated into Snapchat.
Google logo
Google
Beginner
Google AI Edge Torch provides a seamless integration from PyTorch to TensorFlow Lite (TFLite), enhancing model coverage and CPU performance for mobile devices.
Cormac Brick, Advait Jain, Haoliang Zhang
5 min read
Includes Code
Has Summary
--
Google logo
Google
Intermediate
The article recaps the Google I/O 2024 event, highlighting advancements in AI technologies aimed at making AI accessible for developers.
NVIDIA logo
NVIDIA
Advanced
The article provides a comprehensive guide on building a Retrieval-Augmented Generation (RAG) pipeline using NVIDIA AI LangChain AI Endpoints.
Google logo
Google
Intermediate
This article discusses how to publish Keras models on Kaggle and Hugging Face, highlighting the ease of sharing fine-tuned models with the community.
Uber logo
Uber
Advanced
The article discusses Uber's evolution in machine learning (ML) through its centralized platform, Michelangelo, highlighting its transition from predictive to generative AI.
NVIDIA logo
NVIDIA
Intermediate
The article discusses NVIDIA AI Enterprise IGX, a software solution designed for mission-critical AI applications at the edge.
Suhas Hariharapura Sheshadri
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the NVIDIA 6G Developer Program, which aims to accelerate the development of 6G technology by providing access to AI/ML tools, simulation environments, and software-defined ne...
Kuntal Chowdhury
9 min read
Has Summary
--