A conference talk covering browser-based AI APIs for web developers, focusing on the WebNN (Web Neural Network) standard and its ecosystem. The talk explains how modern browsers can run AI workloads directly on-device using CPU, GPU, and NPU hardware — without cloud dependencies. It covers the layered stack from low-level WebNN to ONNX Runtime Web to Transformers.js, showing how app developers can add AI features (image recognition, NLP, audio) with just a few lines of code. Key benefits highlighted include privacy (data stays on device), offline capability, low latency, and no API costs. Also covers Chrome/Edge built-in AI APIs for micro-interactions and practical setup steps including origin trials and driver requirements for NPU access.

45m watch time

Sort: