OpenAI's Codex coding agent can now orchestrate end-to-end machine learning experiments using Hugging Face Skills. The integration enables Codex to fine-tune models, monitor training with Trackio, evaluate checkpoints, generate reports, and deploy models—all through natural language instructions. The tutorial demonstrates fine-tuning a Qwen3-0.6B model on coding problems, with Codex handling dataset validation, hardware selection, job submission, progress tracking, and GGUF conversion for local deployment. The system supports SFT, DPO, and GRPO training methods for models from 0.5B to 7B parameters on Hugging Face's cloud infrastructure.
Table of contents
GOAL: End-to-end Machine Learning experimentsSetup and InstallYour first AI ExperimentWhat's NextResourcesSort: