Netflix's live streaming operations grew from engineers manually running one show per month in 2023 to a dedicated operations organization handling 70+ events per month by early 2026. The post details the evolution through four operational phases: an all-hands engineering era, introduction of specialized Streaming Operations Engineers (SOEs) and Broadcast Operations Engineers (BOEs), a co-pilot control room model, and finally a Transmission Operations Center (TOC) fleet model enabling up to 10 concurrent events. The Live Command Center (LCC) provides end-to-end visibility across the entire pipeline, ingesting up to 38 million telemetry events per second. A tiered event classification system (Low-Profile, High-Profile, Big Bet) and a Live Operational Level (LOL) model determine staffing and standby requirements. Key lessons include the importance of standardized runbooks, separating planning from operations roles, a vendor-operator model for elastic workforce scaling, and integrated IP-based communications between the BOC and LCC. Netflix is now expanding internationally with a London Live Operations Center and planning to merge the LCC and BOC into a single facility.

17m read timeFrom netflixtechblog.com
Post cover image
Table of contents
Humble BeginningsThe Architecture of Live Operations

Sort: