Netflix built an internal post-training framework to scale LLM fine-tuning from experimentation to production. The framework abstracts infrastructure complexity across four dimensions: data (streaming, sequence packing, loss masking), model (sharding, LoRA, architecture support), compute (distributed job orchestration,
Table of contents
IntroductionA Model Developer’s Post-Training JourneyThe Netflix Post-Training FrameworkLearnings from Building the Post-Training FrameworkGet Netflix Technology Blog ’s stories in your inboxWrap upAcknowledgementsSort: