AI detection tools claim to identify AI-generated text but cannot provide definitive proof. Large language models learn from human text, making their output inherently human-like by design. While current models share a recognizable "house style" due to RLHF and safety tuning, detection tools achieve only ~90% accuracy at best, leading to significant false positives. These tools themselves use AI and can only provide probabilistic assessments. The billion-dollar detection industry, along with "humanizing" services, creates perverse incentives that harm legitimate writers who face unjust accusations. Students now self-censor their writing or document their process to defend against false positives.
Table of contents
Why AI detection is hardWhy AI detection tools might work anywayHow do AI detection tools work?Humanizing toolsFalse positives and social harmSort: