A New Yorker investigation reveals how Sam Altman's AI safety commitments at OpenAI fell short on hallucinations, deceptive alignment, and oversight.

The New Stack is a publication covering trends and technologies in cloud-native development, DevOps, and software delivery. Developers can learn about containerization, Kubernetes, and cloud computing, as well as explore topics such as microservices architecture, serverless computing, and continuous integration/continuous delivery (CI/CD) pipelines.

The New Stack

An 18-month New Yorker investigation reveals a gap between Sam Altman's public commitments to AI safety and OpenAI's actual follow-through. Key findings include Altman's dismissive stance on hallucinations as desirable 'magic,' OpenAI's superalignment team receiving only 1-2% of compute despite a pledged 20% and being dissolved by May 2024, and internal safety reviews for GPT-4 that board members found incomplete. The piece also covers sycophancy as a structural flaw in RLHF-trained models and the risks of deceptive alignment for developers deploying LLMs in production.

Sam Altman promised billions for AI safety. Here’s what OpenAI actually spent.