Benchmarks show that Language Learning Models (LLMs) produce lower quality code when returning it in a structured JSON response compared to plain text formats. Models struggle with syntax errors related to quoting and escaping code in JSON, impacting their ability to solve coding problems effectively. Despite advancements like OpenAI's strict JSON support, the challenges remain, suggesting a preference for plain text code generation.
Sort: