A walkthrough of model optimization in Microsoft Azure AI Foundry covering three key areas: evaluation, deployment, and cost management. Evaluation uses a custom Python grader with precision, recall, and F1 score to compare GPT-4.1, GPT-4.1 mini, and a fine-tuned model on agentic tool calling tasks, showing a ~6% improvement after fine-tuning. Deployment options include global, regional, and developer tier (free but limited to 24 hours). Cost considerations cover training tokens, hosting charges, and regional pricing differences for both supervised fine-tuning and reinforcement fine-tuning models.
•6m watch time
Sort: