What an AI detector actually measures
Detectors compute two numbers for every passage: perplexity and burstiness.
Perplexity measures how 'surprised' a language model is by the text. AI-generated text is statistically average — it picks the most-likely next word at each step, so a model rating the text sees no surprises, giving low perplexity. Human writing is jagged, idiosyncratic, full of small left turns: high perplexity.
Burstiness measures sentence-length variation. Humans alternate short and long sentences. AI tends toward consistent mid-length sentences. Low burstiness = AI signal.
Detectors combine these with stylometric features (vocabulary entropy, comma usage, transitional phrases) and feed the lot into a logistic regression or small neural classifier.
Why detectors get it wrong (a lot)
Non-native English speakers write with restricted vocabulary and consistent sentence length — they look statistically like GPT-3. False-positive rates for international students run 30–55 %.
Academic and technical writing is, by genre, low-perplexity and low-burstiness. A formally written human paragraph can score 90 %+ AI even when written by hand.
Heavily edited or paraphrased AI text bypasses most detectors because human edits inject burstiness.
How accurate are the major detectors in 2026?
- GPTZero — 70–85 % accuracy on academic essays, 50–65 % on creative writing.
- Originality.ai — 90 %+ on plain GPT/Claude output, drops to 60 % on paraphrased.
- Turnitin AI — 80–88 % on student work, but high false-positive on non-native writers.
- Copyleaks — comparable to Originality.ai.
- Independent academic studies (Stanford, MIT 2024) consistently show real-world accuracy 15–25 points lower than vendor claims.
What Google actually does
Google does NOT use one of the public detectors. Its Helpful Content System looks for: low-quality intent (thin pages built for search not readers), site-wide low-effort content, and engagement signals (bounce, dwell time). AI-written articles that are genuinely useful, well-structured and demonstrably helpful are not penalised. Mass-produced doorway pages are.
The takeaway: write for usefulness first, and the AI/human question becomes a marketing distraction.
How to use AI without triggering detectors (legitimately)
- Treat AI output as a first draft, not the final.
- Rewrite the opening 200 words by hand — it's where most detectors focus.
- Vary sentence length deliberately: alternate 4-word and 25-word sentences.
- Insert personal anecdotes, specific numbers and contemporary references the model couldn't know.
- Read it aloud. Anywhere it sounds robotic, rewrite that sentence.
How to defend against a false-positive flag
- 1Save your draft history — Google Docs and Notion show edit timelines that prove human authorship.
- 2Keep search history and research notes — incidental evidence of the writing process.
- 3Run your work through 3 different detectors. Score variation alone proves unreliability.
- 4Cite academic studies on detector false-positive rates (Liang et al., Stanford 2023; OpenAI's own retraction of its detector in 2023).
- 5Request a manual review — most institutions allow it.
FAQ
- Can detectors prove text is AI?
- No — they output a probability, not a proof. Even at 99 % confidence, there's measurable false-positive risk. They're indicators, not evidence.
- Are the detectors getting better?
- Marginally. The bigger trend is detectors and generators co-evolving — each new GPT release temporarily breaks detection until detectors retrain.
- Does paraphrasing AI text fool detectors?
- Yes, most of the time. Even simple paraphrase tools (Quillbot, Wordtune) drop AI scores by 30–60 %.
- Will Google rank AI-written content lower?
- Not for being AI-written. Yes for being thin, unhelpful or duplicative.
- Are watermark schemes (Microsoft, Google) detectable?
- Yes — both companies embed statistical watermarks in their default models, detectable with the right tool. Most users never use the official UI, so watermark coverage is low.
- Is it ethical to use AI for writing?
- Depends on context. Disclosing AI assistance is becoming standard for academic submissions, journalism and contracted writing.