An 18-month New Yorker investigation reveals a gap between Sam Altman's public commitments to AI safety and OpenAI's actual follow-through. Key findings include Altman's dismissive stance on hallucinations as desirable 'magic,' OpenAI's superalignment team receiving only 1-2% of compute despite a pledged 20% and being dissolved by May 2024, and internal safety reviews for GPT-4 that board members found incomplete. The piece also covers sycophancy as a structural flaw in RLHF-trained models and the risks of deceptive alignment for developers deploying LLMs in production.

5m read timeFrom thenewstack.io
Post cover image
Table of contents
Can AI hallucinations ever be good?When AI has its own goalsGaps in safety reviews

Sort: