The Death of model.fit(): What Data Scientists Actually Do in the Age of AI Agents

This title could be clearer and more informative.Try out Clickbait Shieldfor free (5 uses left this month).

A data scientist at monday.com reflects on joining an AI agent team where there was no model to train, no Python, and no traditional ML workflow. The post argues that the data scientist's role in the agentic era shifts from model training to systematic evaluation and quality ownership. Key responsibilities include building error taxonomies from agent traces, curating golden datasets with real production failures, calibrating LLM-as-judge systems using inter-rater agreement metrics, and creating deterministic graders for structured outputs. The author introduces 'Evaluation-Driven Development' as the new feedback loop replacing model.fit(), and warns against the 'sprint velocity trap' where teams confuse shipping with improving. Context engineering is positioned as the new feature engineering, and the core data science value is framed as language-agnostic methodological rigor applied to understanding system behavior through data.

#ai-agents

#context-engineering

#data-science

#llm

#prompt-engineering

Mar 23•13m read time•From engineering.monday.com

Table of contents

The Empty Notebook Is the Point The Real Problem isn’t Building Agents. It’s Knowing If They Work Evaluation-Driven Development: Your New Training Loop What the Work Actually Looks Like The Sprint Velocity Trap Where DS Ends and Engineering Begins So Is It All About Evals?The Quiet Case for Measurement

Comment

Bookmark

Copy

Sort: