Daily Dose of DS offers a daily dose of inspiration, education, and motivation for data scientists and aspiring data professionals. Through bite-sized articles, tutorials, and curated resources, readers embark on a journey to master the art and science of data analysis, machine learning, and artificial intelligence. By staying updated with the latest trends, techniques, and tools in data science, readers can hone their skills and stay ahead in this rapidly evolving field.

Daily Dose of Data Science | Avi Chawla | Substack

A practical guide to evaluating AI agents beyond simple task completion using the DeepEval open-source framework. Covers six agentic metrics split into two layers: full-trace metrics (PlanQualityMetric, PlanAdherenceMetric, TaskCompletionMetric, StepEfficiencyMetric) and component-level metrics (ToolCorrectnessMetric, ArgumentCorrectnessMetric). Also demonstrates how to use DeepEval's ConversationSimulator to auto-generate multi-turn test cases from scenario definitions, and how to apply conversational metrics like ConversationCompletenessMetric and TurnRelevancyMetric. Code examples show how to instrument agents with @observe decorators and run evaluations in a structured pipeline.

Six Key Metrics for AI Agent Evaluation

InsForge: The first backend built for AI coding agents, not human dashboards​