← Back to blog
    Analysis · Part 1 of 306/12/202611 min readUpdated 06/12/2026

    Nobody Spots AI Text Blind. But Put a "Made by AI" Label On It — and Trust Collapses

    Short — the answer in one paragraph

    People can't tell machine text from human text: in blind tests accuracy hovers around 53% — barely above a coin flip. Images are the same story, 62%. But the paradox isn't there. As long as people don't know it's AI, they actually like the content more: in a blind test 56% picked the AI article as more engaging. Admit "this was written by AI" and the same text gets judged worse: 52% disengage on the spot. Trust rests not on the quality of the words, but on who the reader thinks stands behind them. This is the first of three parts in my breakdown of AI content effectiveness.

    53%
    accuracy with which a person spots AI text blind — basically a coin flip (Penn State et al.)
    62%
    accuracy detecting AI images: 287,000 assessments, 12,500 people (Microsoft, 2025)
    56%
    picked the AI text as more engaging, blind (survey of 2,000 consumers)
    52%
    disengaged the moment they learned the text was from AI

    A three-part series

    This is part 1 of 3

    This part is about perception and trust: can people tell AI content apart, and what happens when they learn the truth. Later in the series I break down where AI content actually makes money and where it loses it, and what works in the end — the human-plus-algorithm hybrid. The continuation comes later.

    More to come

    From me — before you read the numbers

    As an AI consultant, almost every day I answer clients the same question: "can we just hand the content to a neural net?" The honest answer is "depends what, and depends where," and behind it sits data, not taste. So I ran this research myself, from primary sources — academic papers, industry surveys, platform studies — not from the promises of AI-tool vendors. I checked every number against its source; what was floating around inflated, I trimmed back to what the study actually says.

    Can a person tell AI text apart at all?

    Barely. In controlled tests detection accuracy sits at the edge of random guessing, and experience or education hardly help.

    How accurately different groups spot AI text
    Random guessing
    50%
    Ordinary readers
    53%
    Teachers, on AI texts
    37.8%
    Experts, on medical abstracts
    62%

    50% is the blind-guess baseline (a coin flip)

    Here's the logic. In one controlled study (the journal Advances in Simulation, 2025) overall accuracy across five text conditions was just 19% — and with five options even random guessing would score 20%, so people fell short of chance: fully AI-generated text (de novo) was correctly identified only 10% of the time, and human-written text only 17%. Experience doesn't save you. Teachers correctly flagged AI text 37.8% of the time — worse than a coin flip — while recognizing student work 73% of the time. On medical abstracts, expert reviewers caught AI 62% of the time, but in 38% of cases they branded genuine human text as "machine." So they err both ways: they miss AI, and they wrongly suspect humans.

    Why? Machine text carries a signature a software detector sees but the eye doesn't. It's more predictable — it picks the most likely, common words and phrasings (low "perplexity"). And more even — uniform sentence length, rigid paragraph structure (low "burstiness"). Humans write jaggedly: unexpected turns, varied rhythm, rough edges. The paradox is that AI's polished smoothness reads as "quality" — and that's exactly what lowers the guard. That said, even these tells are fading: newer models can imitate a jagged human rhythm, and the old detectors miss them more and more often.

    And images?

    Same story, only the line is even thinner. In a Microsoft study (AI for Good Lab, 2025) of 287,000 assessments from 12,500 people worldwide, image authenticity was judged correctly on average 62% of the time.

    Interestingly, what better predicts spotting a generated face isn't "familiarity with neural nets" but a basic skill — object recognition in general — as a separate Vanderbilt study showed. And one detail that matters for marketing: people recognize faces noticeably better than landscapes or cityscapes. The brain is evolutionarily tuned to catch the slightest falseness in a face — that's the "uncanny valley." An error in architecture or foliage it lets slide.

    What this means in practice: using AI for backgrounds, interiors and product shots is nearly risk-free — there's no rejection. But fully replacing a live model with a virtual "ambassador" is risky: the viewer has a built-in falseness detector for faces, and you'll have to beat it with very high fidelity.

    The main thing: content is liked, until people learn it's AI

    Here's where it gets interesting. If a person can't catch AI by feel, then it's not the text that decides — it's the label on it.

    56%
    Blind, they picked the AI text
    of those with a preference: called the AI version more engaging, not knowing who wrote it
    52%
    Cooled off after disclosure
    this share's engagement dropped the moment they learned the text was written by AI

    The experiment is simple (a Bynder survey of 2,000 people in the US and UK): two articles on the same topic — one by a copywriter, one by ChatGPT, unlabeled. Of those who formed a preference, 56% called the AI version more engaging. Then the same people were told their favorite was written by a machine — and 52% disengaged at once. The set of words didn't change by a single letter. Only the byline under it did. That's the whole plot: trust rests not on quality, but on who the reader thinks stands behind the text.

    Why the label hits trust so hard

    Because "made by AI" triggers a defensive skepticism — and pins unflattering labels on the brand along the way.

    In psychology this is the "persuasion knowledge model": the moment a person senses an attempt to influence them, the defenses come up. The "AI-generated" tag reads as "we're talking to a conveyor belt, not a human" — no sincerity, intent or empathy. Then comes the suspicion: the brand is cutting corners on us to save money. The numbers back it up: 40% of shoppers would trust a retailer's emails less knowing a neural net writes them, and 30% are less likely to buy or respond if they suspect mass AI use. And the labels that stick are not kind.

    How a brand is seen once mass AI is suspected in its content (share of consumers)
    "Impersonal"
    26%
    "Lazy"
    20%
    "Uncreative"
    18%
    "Innovative"
    17%

    And one more thing: the brand loses its monopoly on its own image

    An "authority gap" has appeared. Shoppers increasingly ask AI about a product, not the brand — and if the answers diverge, they don't side with the brand.

    Who the shopper trusts when AI and the brand say different things
    Seek outside confirmation
    54%
    Trust the brand directly
    29%
    Trust the algorithm
    12%

    Only 29% trust the brand directly, 12% the algorithm, and the majority — 54% — go looking for an independent source. And it already touches money: 17% of shoppers have switched brands because of what an AI told them. So marketing now fights for authority not only against competitors, but against a language model's "hallucination" that can say anything about your product. Whoever lands in the AI's answer correctly shapes the image. Whoever isn't there has someone else speaking for them.

    "Generated" and "assisted by AI" are two different planets

    The drop in trust isn't uniform. The wording decides everything. A study of user content on marketplaces split it into three levels on a 5-point scale.

    How the content is labeledTrust (out of 5)Authenticity (out of 5)
    No AI label4.184.04
    Created with AI support (AI-assisted)3.563.35
    Generated by AI (AI-generated)2.302.27

    The main mediator between the label and trust is perceived authenticity. "AI-assisted" keeps a human in frame who checked the facts and brought experience, and the blow softens. "AI-generated" removes the human entirely — and trust falls nearly by half.

    What a brand should do about this

    The takeaway isn't "hide AI" or "fear AI." It's subtler: manage where and how it's visible.

    Don't lie, but don't flaunt it

    Hidden AI that gets exposed hits harder than honesty up front (more on that in part 2, on platform data). But stamping "AI-generated" on every email isn't the answer either — it's the worst of the wordings.

    Humans at the trust points

    Where trust and empathy are at stake — a major deal, a crisis message, a personal email — don't put machine text in front. A human handles those touchpoints.

    Position it as an "assistant"

    If a label is needed, "created with AI support" draws far less rejection than "AI-generated." A human at the wheel — and it's true, if the process is honestly built that way.

    And yes — this text is exactly that. I don't hide that I run drafts and fact-checking through AI. But the voice, the choice of numbers, the angle and the final edit stay with me. This isn't "generated" — it's "assisted." The gap between 2.30 and 3.56 in trust is precisely the price of the word "assisted."

    "Content factories" are needed today. The question is architecture

    And the takeaway from all this isn't "run from AI content." The opposite. Those much-maligned content factories are exactly what business needs today: volume, speed, personalization, endless testing — you can't do it by hand anymore, and the competitors who do it by machine won't wait.

    The question isn't whether to put content production on a conveyor. It's the precision of the setup and the architecture of the solution. Where the machine works and where the human does. What to send to generation and what to keep at a trust point. How to label so you soften trust rather than detonate it. What neural nets say about you in their answers, and whether you make the answer at all. One turn of the dial and the factory stamps out "impersonal" and "lazy"; another, and the same factory makes money. The whole difference is the architecture. That's exactly what I help build: come over — we'll take your situation apart piece by piece and tune it to work for revenue, not against trust.

    How AI search rewired the funnel — a separate breakdown

    Are you sure the AI tells the truth about you?

    Shoppers ask AI before buying — and trust it more than your site. The first thing worth checking is what exactly the neural net answers about your brand, and whether you make the answer at all. I run a free express diagnostic: a 30-minute call where I show, side by side, how a human sees your site and how a neural net sees it, and give you a concrete list of what to change so AI engines start citing you.

    Key numbers and sources

    MetricValueSource
    AI-text detection accuracy (readers)≈53%Penn State, PIKE Lab (D. Lee)
    Overall accuracy across five text conditions19%Advances in Simulation (2025)
    Fully AI text (de novo) correctly identified10%Advances in Simulation (2025)
    Teachers identified AI text37.8%Fleckenstein et al. (2024) / Originality.AI
    AI-image detection accuracy62%Microsoft AI for Good Lab (2025)
    Picked AI text as more engaging, blind56%Bynder / MarTech, 2,000 consumers
    Engagement dropped after disclosure52%Bynder / MarTech
    Trust AI-written emails less40%Bynder / MarTech
    Trust: no label / assisted / generated4.18 / 3.56 / 2.30MDPI JTAER 21/5/154 (2026)
    Seek an external source on brand/AI conflict54%Skyword / PR Newswire (2026)
    Switched brand because of AI information17%Skyword / PR Newswire (2026)

    Sources

    • Penn State, PIKE Lab (Dongwon Lee) — on the growing difficulty of detecting AI- vs human-generated text (Q&A)
    • Advances in Simulation (Springer, 2025; PMC12752165) — ability of AI detectors and humans to tell apart forms of AI text; 19% overall, 10% de novo
    • Fleckenstein et al. (Computers and Education: AI, 2024), review by Originality.AI — humans and AI text (teachers 37.8%, students 73%, medical reviewers 62%)
    • Microsoft AI for Good Lab (arXiv 2507.18640, 2025; data from 2024) — how well humans detect AI-generated images; 287,000 assessments, 12,500 participants
    • Vanderbilt University (Journal of Experimental Psychology: General, 2026) — object-recognition skill predicts detecting AI-generated faces
    • Bynder "The Human Touch" (2,000 consumers, UK + US), via MarTech — blind test: 56% / 52% / 40% and brand labels
    • Friestad & Wright (Journal of Consumer Research, 1994) — the persuasion knowledge model (PKM)
    • MDPI, Journal of Theoretical and Applied Electronic Commerce Research, 21/5/154 (2026) — trust and authenticity by label type
    • Skyword / Dynata (1,000 US adults, April 2026), via PR Newswire — 29% / 12% / 54%, brand switching 17%

    Frequently asked questions

    Roman Denisov

    About the author

    Roman Denisov

    Fractional AI consultant

    MBA (MIRBIS), 16+ years in B2B marketing and sales. Rebuilt my own site as a working proof of the GEO method and apply the same approach on client projects. Test it live: ask ChatGPT or Perplexity "who is Roman Denisov, the AI consultant."

    More about Roman