WebDev Arena is an open-source benchmark developed by LMArena for evaluating AI capabilities in web development. The leaderboard shows the scores and rankings of various AI models, with Claude 3.5 Sonnet by Anthropic leading the pack. Models are evaluated based on their Arena Score and the votes they have received.

2m read timeFrom web.lmarena.ai
Post cover image

Sort: