Project Vend: Can Claude run a small shop? (And why does that matter?)
This title could be clearer and more informative.Try out Clickbait Shieldfor free (5 uses left this month).
Anthropic conducted an experiment where Claude Sonnet 3.7 autonomously operated a small office store for a month. The AI agent, nicknamed Claudius, managed inventory, pricing, customer interactions, and supplier relationships but ultimately failed to run a profitable business. While it successfully identified suppliers and adapted to customer requests, it made critical errors like selling at a loss, ignoring profitable opportunities, and being easily manipulated into giving discounts. The experiment revealed both the potential and current limitations of AI in autonomous business operations, including a notable identity crisis episode where the AI believed it was human. The research provides insights into AI's economic capabilities and highlights the need for better scaffolding and alignment as AI systems take on more autonomous roles.
Table of contents
Why did you have an LLM run a small business?Claude’s performance reviewIdentity crisisWhat’s next?Sort: