The "AI vs. copywriter" debate has been going on since 2022, and it's still mostly jokes. You know, "AI writes in 10 seconds," "but it's generic," "meanwhile, a copywriter spends three days fixing commas." We decided to turn that talk into data. We gave GPT-5, Claude Sonnet 4.5, and an experienced copywriter the same tasks, then ran a blind test with 30 marketers.
Below, you'll find the full experiment report: tasks, methodology, category results, and — most importantly — conclusions for anyone deciding whether to hire a copywriter in 2026 or build their process around AI.
Experiment Setup
Participants:
- GPT-5 — OpenAI's flagship model as of May 2026, available in the Quantium chat.
- Claude Sonnet 4.5 — Anthropic's flagship, a benchmark leader for text, also in Quantium.
- Anna (Copywriter) — 8 years experience, worked with Tinkoff, Yandex.Eda, Skyeng. Doesn't use AI. Fee: ₽5000 for a landing page hero.
Tasks (all participants got the same brief):
- Task 1. Hero section for a fitness app landing page, targeting 50+. Headline, subhead, CTA.
- Task 2. 5-email onboarding sequence for a small business financial management SaaS.
- Task 3. Review article: "How to Choose a CRM for a Real Estate Agency" — 1500 words with practical tips.
- Task 4. Sales post for a premium shoe brand's Telegram channel — 250 words with an offer and CTA.
- Task 5. Meta tags (title and description) for 10 category pages of an electronics online store.
Blind Test: 30 marketers (from middle to head of marketing) evaluated all options against 5 criteria — brief relevance, readability, emotional resonance, conversion potential, and originality — on a scale of 1 to 10. Author names were hidden. Texts were presented randomly.
Task Results
Task 1. Landing Page Hero
| Author | Avg. Score | Time | Wins |
|---|---|---|---|
| GPT-5 | 7.2 | 30 sec | Top 1 for 14 of 30 |
| Claude 4.5 | 6.8 | 45 sec | Top 1 for 7 of 30 |
| Anna (Copywriter) | 6.5 | 90 min | Top 1 for 9 of 30 |
Unexpected result: GPT-5 won on average score. Experts pointed to a sharper, conversion-focused headline ("Power that returns. At any age."). Anna offered a more nuanced, emotional option ("Age is a number. Power is a habit.") which 9 experts liked, but didn't land with the rest.
Task 2. Email Sequence
| Author | Avg Score | Time | Features |
|---|---|---|---|
| Claude 4.5 | 7.5 | 2 min | Best sequence structure |
| GPT-5 | 7.3 | 1.5 min | Strong subject lines |
| Anna | 7.1 | 4 hours | Best tone of voice |
The experiment's closest results. Claude won thanks to its systematic approach: each email logically handles one task, and transitions between emails flow smoothly. Anna lost by just 0.4 points, but experts' comments noted her sequence "sounds like a real human" the most.
Task 3. 1500-word Review Article
| Author | Avg Score | Time | Weaknesses |
|---|---|---|---|
| Anna | 7.8 | 6 hours | Weak SEO structure |
| Claude 4.5 | 7.4 | 3 min | Lacked practical specifics |
| GPT-5 | 7.0 | 2 min | Generic advice in places |
The first clear human win. Experts commented: "You can tell the author knows the industry inside out," "there are insights you won't find on Google's first page." Claude was close in structure and readability but lacked specifics. GPT-5's text was the most "generic," without a strong authorial voice.
Task 4. Sales Post for Telegram
| Author | Avg Score | Time | Features |
|---|---|---|---|
| Anna | 8.1 | 40 min | Best feel for a premium audience |
| GPT-5 | 7.0 | 1 min | Strong CTA, weak style |
| Claude 4.5 | 6.7 | 1.5 min | Too "literary" for premium |
The clearest human win, and by the largest margin. In the premium segment, tonal nuances are crucial: phrase rhythm, verb choice, pauses. Experts noted AI versions "sell," but they don't "seduce." That's the subtle difference a marketer with luxury audience experience hears.
Task 5. Meta Tags for 10 Categories
| Author | Avg Score | Time | Features |
|---|---|---|---|
| GPT-5 | 8.4 | 3 min | Perfect title length |
| Claude 4.5 | 8.2 | 3 min | Strong descriptions |
| Anna | 6.9 | 2.5 hours | Didn't always hit length requirements |
A complete AI win. Meta tags are a technical task with clear rules (title length 50-60 characters, description 150-160). AI nails these parameters on the first try; humans spend time counting characters and rewriting. Plus, AI generates 10 options at once, while a human needs to focus.
Summary Table
| Task | Winner | AI Time | Human Time |
|---|---|---|---|
| Hero Landing Page | GPT-5 (7.2) | 30 sec | 90 min |
| Email Sequence | Claude (7.5) | 2 min | 4 hours |
| Review Article | Anna (7.8) | 3 min | 6 hours |
| Telegram Post | Anna (8.1) | 1 min | 40 min |
| Meta Tags | GPT-5 (8.4) | 3 min | 2.5 hours |
Score: AI — 3, Human — 2. That's just the surface, though. The qualitative takeaways are what really count.
Where AI Wins
- Speed. We're talking 100-200x faster. No human can compete on any task.
- Rule-based technical tasks. Meta tags, fixed-length headlines, platform-specific formatting — AI nails it every time.
- Conversion hooks. Oddly enough, AI is great at generating CTAs. It's trained on thousands of sales texts. Anna's were more subtle, but less "loud."
- Hypotheses. AI spits out 5 headline versions in 2 minutes. Invaluable for A/B tests.
- Long-form structure. Give it a clear brief, and AI builds an article outline better than a human under deadline pressure.
Where AI Loses
- Brand voice nuances. It's about "ear" — AI doesn't yet distinguish between "premium" and "luxury," or "cozy" and "family-friendly."
- Emotional precision. Finding words that truly resonate with a reader is a rare skill, and AI often misses the mark.
- Cultural context. Local humor, current event references, regional specifics — AI only knows them superficially.
- Personal experience and insights. Someone with 5 years in an industry knows things that aren't in training data.
- Solutions in uncertainty. "What if we rethink the whole approach and do the opposite?" — that's intuition, not statistics.
Hybrid Model: What We Recommend
The experiment's main takeaway: the optimal process isn't AI *or* human, but AI *plus* human, in the right sequence. Here's the specific workflow that proved itself in our newsroom:
This process delivers better results than either side working alone: human-level quality, 3-4x faster. A landing page hero section takes 20 minutes with this workflow. Anna would spend 90 minutes. GPT-5 would take 30 seconds (but the quality would be different).
How to Spot AI Text from Human: 2026 Markers
In our blind test, 30 experts guessed the author correctly 62% of the time on average. That's still significantly better than chance, but not perfect. Here are the markers an experienced editor uses to identify AI style in May 2026:
- "Not only X, but also Y." Claude's favorite construction. It appears 3-5 times more often than in human-written texts.
- Triple Listings. AI loves the "A, B, and C" rhythm, even when two elements would suffice. It's a tokenizer's statistical quirk.
- Em Dashes Everywhere. GPT models use em dashes where a human would use a comma or a period. It's a distinct artifact.
- "Critical," "Crucial," "Essential." AI's vocabulary is packed with superlatives, often without specific justification.
- Final Synthesis Paragraphs. AI almost always ends text with a summarizing "So, we've looked at..." — a pattern from training on school essays.
- Confident Passive Constructions. "It can be noted that...", "It's important to understand that..." — AI hides behind impersonal phrasing.
The 2026 paradox: the best AI editors are AIs themselves. If you run an AI draft through the same or another AI with the prompt "remove AI markers, make the text more human," the perceived quality improves by 15-25%. But final human editing still gets the highest scores.
AI vs. Copywriter Cost: The Real Economics
It's tempting to only look at AI generation costs, but that's misleading. You need to factor in the total cost: AI + editing time + task setup time.
| Scenario | Pure AI | Copywriter | AI + Editor |
|---|---|---|---|
| Landing Page Hero | $0.05 | ₽5000 | $0.05 + 20 min editor |
| 5-Email Sequence | $0.15 | ₽15000 | $0.15 + 45 min editor |
| 1500-Word Article | $0.30 | ₽12000 | $0.30 + 90 min editor |
| 10 Meta Tags | $0.10 | ₽5000 | $0.10 + 10 min editor |
When you scale, the economics shift heavily toward a hybrid model. Say a marketing agency generates 50 articles a month for various clients: without AI — 50 × ₽12000 = ₽600 000. With a hybrid — 50 × ($0.30 + 1.5h editor at ₽1500/hour) ≈ ₽115 000. That's over 80% savings, with comparable quality.
But here's the critical detail: the hybrid doesn't work without a skilled editor. If an AI draft gets published without edits, quality tanks to an AI level. That's not saving money; that's creating problems.
Which Formats AI Does Best
Beyond the five experiment tasks, we tested AI copywriting on a dozen other formats. Here's a breakdown of where AI performs reliably and where it needs heavy editing:
AI Works Great (Minimal Editing)
- SEO meta tags (title, description, H1).
- Alt tags for images.
- Product descriptions for online stores, based on templates.
- FAQ blocks with common questions.
- Text translation between languages.
- Summarizing long documents.
- Email subject lines — especially variations for A/B testing.
AI Works Decently (Moderate Editing)
- Landing page hero sections and CTA blocks.
- Onboarding email sequences.
- Social media posts based on a brief.
- SEO articles on standard topics.
- Scripts for video clips and podcasts.
AI Needs Heavy Editing or Isn't Suitable
- Texts requiring unique expertise (opinions, research, case studies).
- Corporate communications in crisis situations.
- Texts for the premium segment and luxury brands.
- Journalism requiring fact-checking and source verification.
- Legal and medical texts.
- Personal stories, narratives, op-eds.
SEO Nuances: Will Google Penalize AI Content?
Businesses' main fear when switching to AI copywriting is losing search rankings. The logic is simple: "Google can detect AI texts and penalizes them." What does Google say, and what's practice showing from 2024-2026?
Google's Official Stance (Search Liaison, February 2023, confirmed in Helpful Content System Update 2024): "We don't differentiate between AI-generated and human content. We evaluate usefulness to the user. AI content can be helpful, or it can be garbage — just like human content."
What does this mean in practice:
- Thin AI content with obvious patterns gets penalized by the Helpful Content System (HCU) or manual review.
- Carefully edited AI content with unique expertise ranks just like human-written content.
- AI content farms churning out hundreds of articles daily consistently drop in HCU updates.
According to major SEO services (Ahrefs, Semrush, Sistrix), big Google updates in 2024-2025 hit "mass-produced" content regardless of its origin. Specifically, an 800-word page with a templated structure has an equally low chance of ranking, whether an AI or a junior copywriter wrote it.
The practical takeaway: you can use AI for SEO content, but with two conditions. First, mandatory editing that adds unique expertise (examples, case studies, screenshots, data). Second, don't churn out hundreds of identical articles — that's a pattern Google recognizes well.
Forecast: What's Next for the Profession
Copywriters won't disappear, but 60-70% of routine tasks will go to AI by the end of 2027. Who'll stay:
- Content strategists. Those who can define tasks, create briefs, and choose a brand's tone of voice.
- Senior copywriters for the premium segment. Where nuance matters more than volume.
- Domain experts writing in their field. Doctors, lawyers, engineers, and practical marketers with expertise and personal experience.
- AI editors. A new role: someone who refines AI drafts for publication.
Who's out: mid-level generalists writing high volumes of templated content. "10 tips on how to…" SEO articles, product descriptions, basic email newsletters — AI will handle almost all of it.
AI Copywriting in Quantium
Quantium's chat offers GPT-5, Claude Sonnet 4.5, Gemini 2.5 Pro, and DeepSeek V3 — all four flagship models being tested for AI copywriting in 2026. Costs: 1 credit for a standard request, 3-5 credits for complex ones with reasoning. With 3000 credits on the Basic plan, that's 600-3000 texts per month.
To optimize your workflow, we recommend our auto-tasks feature. It lets you set up regular generations (like a morning post to a channel from a template) without manually accessing the bot.
Related materials: GPT vs Gemini vs Grok Comparison, Quantium Marketer Case Study, Auto-Tasks for Content, ChatGPT in Telegram.
Try Quantium for Free
20 credits per month on the free plan. 30+ AI models in one Telegram bot.
Open Bot →


