The "AI vs. copywriter" debate has been going on since 2022, and it's still mostly jokes. You know, "AI writes in 10 seconds," "but it's generic," "meanwhile, a copywriter spends three days fixing commas." We decided to turn that talk into data. We gave GPT-5, Claude Sonnet 4.5, and an experienced copywriter the same tasks, then ran a blind test with 30 marketers.

Below, you'll find the full experiment report: tasks, methodology, category results, and — most importantly — conclusions for anyone deciding whether to hire a copywriter in 2026 or build their process around AI.

Experiment Setup

Participants:

  • GPT-5 — OpenAI's flagship model as of May 2026, available in the Quantium chat.
  • Claude Sonnet 4.5 — Anthropic's flagship, a benchmark leader for text, also in Quantium.
  • Anna (Copywriter) — 8 years experience, worked with Tinkoff, Yandex.Eda, Skyeng. Doesn't use AI. Fee: ₽5000 for a landing page hero.

Tasks (all participants got the same brief):

  • Task 1. Hero section for a fitness app landing page, targeting 50+. Headline, subhead, CTA.
  • Task 2. 5-email onboarding sequence for a small business financial management SaaS.
  • Task 3. Review article: "How to Choose a CRM for a Real Estate Agency" — 1500 words with practical tips.
  • Task 4. Sales post for a premium shoe brand's Telegram channel — 250 words with an offer and CTA.
  • Task 5. Meta tags (title and description) for 10 category pages of an electronics online store.

Blind Test: 30 marketers (from middle to head of marketing) evaluated all options against 5 criteria — brief relevance, readability, emotional resonance, conversion potential, and originality — on a scale of 1 to 10. Author names were hidden. Texts were presented randomly.

Task Results

Task 1. Landing Page Hero

AuthorAvg. ScoreTimeWins
GPT-57.230 secTop 1 for 14 of 30
Claude 4.56.845 secTop 1 for 7 of 30
Anna (Copywriter)6.590 minTop 1 for 9 of 30

Unexpected result: GPT-5 won on average score. Experts pointed to a sharper, conversion-focused headline ("Power that returns. At any age."). Anna offered a more nuanced, emotional option ("Age is a number. Power is a habit.") which 9 experts liked, but didn't land with the rest.

Task 2. Email Sequence

AuthorAvg ScoreTimeFeatures
Claude 4.57.52 minBest sequence structure
GPT-57.31.5 minStrong subject lines
Anna7.14 hoursBest tone of voice

The experiment's closest results. Claude won thanks to its systematic approach: each email logically handles one task, and transitions between emails flow smoothly. Anna lost by just 0.4 points, but experts' comments noted her sequence "sounds like a real human" the most.

Task 3. 1500-word Review Article

AuthorAvg ScoreTimeWeaknesses
Anna7.86 hoursWeak SEO structure
Claude 4.57.43 minLacked practical specifics
GPT-57.02 minGeneric advice in places

The first clear human win. Experts commented: "You can tell the author knows the industry inside out," "there are insights you won't find on Google's first page." Claude was close in structure and readability but lacked specifics. GPT-5's text was the most "generic," without a strong authorial voice.

Task 4. Sales Post for Telegram

AuthorAvg ScoreTimeFeatures
Anna8.140 minBest feel for a premium audience
GPT-57.01 minStrong CTA, weak style
Claude 4.56.71.5 minToo "literary" for premium

The clearest human win, and by the largest margin. In the premium segment, tonal nuances are crucial: phrase rhythm, verb choice, pauses. Experts noted AI versions "sell," but they don't "seduce." That's the subtle difference a marketer with luxury audience experience hears.

Task 5. Meta Tags for 10 Categories

AuthorAvg ScoreTimeFeatures
GPT-58.43 minPerfect title length
Claude 4.58.23 minStrong descriptions
Anna6.92.5 hoursDidn't always hit length requirements

A complete AI win. Meta tags are a technical task with clear rules (title length 50-60 characters, description 150-160). AI nails these parameters on the first try; humans spend time counting characters and rewriting. Plus, AI generates 10 options at once, while a human needs to focus.

Summary Table

TaskWinnerAI TimeHuman Time
Hero Landing PageGPT-5 (7.2)30 sec90 min
Email SequenceClaude (7.5)2 min4 hours
Review ArticleAnna (7.8)3 min6 hours
Telegram PostAnna (8.1)1 min40 min
Meta TagsGPT-5 (8.4)3 min2.5 hours

Score: AI — 3, Human — 2. That's just the surface, though. The qualitative takeaways are what really count.

Where AI Wins

  • Speed. We're talking 100-200x faster. No human can compete on any task.
  • Rule-based technical tasks. Meta tags, fixed-length headlines, platform-specific formatting — AI nails it every time.
  • Conversion hooks. Oddly enough, AI is great at generating CTAs. It's trained on thousands of sales texts. Anna's were more subtle, but less "loud."
  • Hypotheses. AI spits out 5 headline versions in 2 minutes. Invaluable for A/B tests.
  • Long-form structure. Give it a clear brief, and AI builds an article outline better than a human under deadline pressure.

Where AI Loses

  • Brand voice nuances. It's about "ear" — AI doesn't yet distinguish between "premium" and "luxury," or "cozy" and "family-friendly."
  • Emotional precision. Finding words that truly resonate with a reader is a rare skill, and AI often misses the mark.
  • Cultural context. Local humor, current event references, regional specifics — AI only knows them superficially.
  • Personal experience and insights. Someone with 5 years in an industry knows things that aren't in training data.
  • Solutions in uncertainty. "What if we rethink the whole approach and do the opposite?" — that's intuition, not statistics.

Hybrid Model: What We Recommend

The experiment's main takeaway: the optimal process isn't AI *or* human, but AI *plus* human, in the right sequence. Here's the specific workflow that proved itself in our newsroom:

Step 1. Brief. A human crafts a clear brief: audience, problem, desired action, tone of voice, essential elements.
Step 2. AI Draft Generation. GPT-5 or Claude generates 3-5 versions based on the brief.
Step 3. Selection & Assembly. A human picks the best parts from the options and assembles the final draft.
Step 4. Fine-Tuning. A human edits it — adjusting the tone, adding nuances, fact-checking, and removing AI markers (common clichés, templated phrases).
Step 5. AI Check. AI runs the final text through for errors, readability, and SEO parameters.

This process delivers better results than either side working alone: human-level quality, 3-4x faster. A landing page hero section takes 20 minutes with this workflow. Anna would spend 90 minutes. GPT-5 would take 30 seconds (but the quality would be different).

How to Spot AI Text from Human: 2026 Markers

In our blind test, 30 experts guessed the author correctly 62% of the time on average. That's still significantly better than chance, but not perfect. Here are the markers an experienced editor uses to identify AI style in May 2026:

  • "Not only X, but also Y." Claude's favorite construction. It appears 3-5 times more often than in human-written texts.
  • Triple Listings. AI loves the "A, B, and C" rhythm, even when two elements would suffice. It's a tokenizer's statistical quirk.
  • Em Dashes Everywhere. GPT models use em dashes where a human would use a comma or a period. It's a distinct artifact.
  • "Critical," "Crucial," "Essential." AI's vocabulary is packed with superlatives, often without specific justification.
  • Final Synthesis Paragraphs. AI almost always ends text with a summarizing "So, we've looked at..." — a pattern from training on school essays.
  • Confident Passive Constructions. "It can be noted that...", "It's important to understand that..." — AI hides behind impersonal phrasing.

The 2026 paradox: the best AI editors are AIs themselves. If you run an AI draft through the same or another AI with the prompt "remove AI markers, make the text more human," the perceived quality improves by 15-25%. But final human editing still gets the highest scores.

AI vs. Copywriter Cost: The Real Economics

It's tempting to only look at AI generation costs, but that's misleading. You need to factor in the total cost: AI + editing time + task setup time.

ScenarioPure AICopywriterAI + Editor
Landing Page Hero$0.05₽5000$0.05 + 20 min editor
5-Email Sequence$0.15₽15000$0.15 + 45 min editor
1500-Word Article$0.30₽12000$0.30 + 90 min editor
10 Meta Tags$0.10₽5000$0.10 + 10 min editor

When you scale, the economics shift heavily toward a hybrid model. Say a marketing agency generates 50 articles a month for various clients: without AI — 50 × ₽12000 = ₽600 000. With a hybrid — 50 × ($0.30 + 1.5h editor at ₽1500/hour) ≈ ₽115 000. That's over 80% savings, with comparable quality.

But here's the critical detail: the hybrid doesn't work without a skilled editor. If an AI draft gets published without edits, quality tanks to an AI level. That's not saving money; that's creating problems.

Which Formats AI Does Best

Beyond the five experiment tasks, we tested AI copywriting on a dozen other formats. Here's a breakdown of where AI performs reliably and where it needs heavy editing:

AI Works Great (Minimal Editing)

  • SEO meta tags (title, description, H1).
  • Alt tags for images.
  • Product descriptions for online stores, based on templates.
  • FAQ blocks with common questions.
  • Text translation between languages.
  • Summarizing long documents.
  • Email subject lines — especially variations for A/B testing.

AI Works Decently (Moderate Editing)

  • Landing page hero sections and CTA blocks.
  • Onboarding email sequences.
  • Social media posts based on a brief.
  • SEO articles on standard topics.
  • Scripts for video clips and podcasts.

AI Needs Heavy Editing or Isn't Suitable

  • Texts requiring unique expertise (opinions, research, case studies).
  • Corporate communications in crisis situations.
  • Texts for the premium segment and luxury brands.
  • Journalism requiring fact-checking and source verification.
  • Legal and medical texts.
  • Personal stories, narratives, op-eds.

SEO Nuances: Will Google Penalize AI Content?

Businesses' main fear when switching to AI copywriting is losing search rankings. The logic is simple: "Google can detect AI texts and penalizes them." What does Google say, and what's practice showing from 2024-2026?

Google's Official Stance (Search Liaison, February 2023, confirmed in Helpful Content System Update 2024): "We don't differentiate between AI-generated and human content. We evaluate usefulness to the user. AI content can be helpful, or it can be garbage — just like human content."

What does this mean in practice:

  • Thin AI content with obvious patterns gets penalized by the Helpful Content System (HCU) or manual review.
  • Carefully edited AI content with unique expertise ranks just like human-written content.
  • AI content farms churning out hundreds of articles daily consistently drop in HCU updates.

According to major SEO services (Ahrefs, Semrush, Sistrix), big Google updates in 2024-2025 hit "mass-produced" content regardless of its origin. Specifically, an 800-word page with a templated structure has an equally low chance of ranking, whether an AI or a junior copywriter wrote it.

The practical takeaway: you can use AI for SEO content, but with two conditions. First, mandatory editing that adds unique expertise (examples, case studies, screenshots, data). Second, don't churn out hundreds of identical articles — that's a pattern Google recognizes well.

Forecast: What's Next for the Profession

Copywriters won't disappear, but 60-70% of routine tasks will go to AI by the end of 2027. Who'll stay:

  • Content strategists. Those who can define tasks, create briefs, and choose a brand's tone of voice.
  • Senior copywriters for the premium segment. Where nuance matters more than volume.
  • Domain experts writing in their field. Doctors, lawyers, engineers, and practical marketers with expertise and personal experience.
  • AI editors. A new role: someone who refines AI drafts for publication.

Who's out: mid-level generalists writing high volumes of templated content. "10 tips on how to…" SEO articles, product descriptions, basic email newsletters — AI will handle almost all of it.

AI Copywriting in Quantium

Quantium's chat offers GPT-5, Claude Sonnet 4.5, Gemini 2.5 Pro, and DeepSeek V3 — all four flagship models being tested for AI copywriting in 2026. Costs: 1 credit for a standard request, 3-5 credits for complex ones with reasoning. With 3000 credits on the Basic plan, that's 600-3000 texts per month.

To optimize your workflow, we recommend our auto-tasks feature. It lets you set up regular generations (like a morning post to a channel from a template) without manually accessing the bot.

Related materials: GPT vs Gemini vs Grok Comparison, Quantium Marketer Case Study, Auto-Tasks for Content, ChatGPT in Telegram.

Q
Quantium Editorial 30+ AI models in one Telegram bot

Try Quantium for Free

20 credits per month on the free plan. 30+ AI models in one Telegram bot.

Open Bot →

Read Also