The "AI vs. copywriter" debate has been going on since 2022, mostly as jokes: "AI writes in 10 seconds," "but it's generic," "meanwhile, a copywriter spends three days fixing commas." We decided to turn that conversation into data. We gave GPT-5, Claude Sonnet 4.5, and an experienced copywriter the same tasks, then ran a blind test with 30 marketers.
Below, you'll find the full experiment report: tasks, methodology, results for each category, and most importantly, takeaways for anyone deciding whether to hire a copywriter in 2026 or build their process around AI.
Experiment Setup
Participants:
- GPT-5 — OpenAI's flagship model as of May 2026, available in the Quantium chat.
- Claude Sonnet 4.5 — Anthropic's flagship, a text benchmark leader, also in Quantium.
- Copywriter Anna — 8 years of experience, worked with Tinkoff, Yandex.Eda, Skyeng. Doesn't use AI for work. Fee: ₽5000 per landing page hero section.
Tasks (all participants received the exact same brief):
- Task 1. Hero section for a fitness app landing page targeting 50+. Headline, subhead, CTA.
- Task 2. 5-email onboarding sequence for a small business financial management SaaS.
- Task 3. Review article "How to Choose a CRM for a Real Estate Agency" — 1500 words with practical tips.
- Task 4. Sales post for a premium footwear brand's Telegram channel — 250 words with an offer and CTA.
- Task 5. Meta tags (title and description) for 10 category pages on an electronics online store.
Blind Test: 30 marketers (from middle to head of marketing) rated all options on 5 criteria (brief relevance, readability, emotional resonance, conversion potential, originality) from 1 to 10. Author names were hidden. Texts were presented in random order.
Results by Task
Task 1. Landing Page Hero
| Author | Average Score | Time | Wins |
|---|---|---|---|
| GPT-5 | 7.2 | 30 sec | Top-1 for 14 of 30 |
| Claude 4.5 | 6.8 | 45 sec | Top-1 for 7 of 30 |
| Anna (copywriter) | 6.5 | 90 min | Top-1 for 9 of 30 |
Unexpectedly, GPT-5 won on average score. Experts noted its sharper, conversion-focused headline ("Strength Returns. At Any Age."). Anna offered a more nuanced, emotional option ("Age is just a number. Strength is a habit.") which nine experts liked but didn't land with the rest.
Task 2. Email Sequence
| Author | Average Score | Time | Highlights |
|---|---|---|---|
| Claude 4.5 | 7.5 | 2 min | Best sequence structure |
| GPT-5 | 7.3 | 1.5 min | Strong subject lines |
| Anna | 7.1 | 4 hours | Best tone of voice |
This was the experiment's tightest result. Claude won thanks to its systematic approach: each email logically completes a single task, and transitions between emails flowed smoothly. Anna lost by just 0.4 points, but her sequence got the most "sounds like a real person" comments from experts.
Task 3. 1500-word Review Article
| Author | Average Score | Time | Weaknesses |
|---|---|---|---|
| Anna | 7.8 | 6 hours | Weak SEO structure |
| Claude 4.5 | 7.4 | 3 min | Lacked practical specifics |
| GPT-5 | 7.0 | 2 min | Generic advice at times |
The first clear human win. Experts commented: "You can tell the author knows the industry inside out," "there are insights not found on Google's first page." Claude was close in structure and readability but lacked specifics. GPT-5 offered the most "general" text, without a strong authorial voice.
Task 4. Telegram Sales Post
| Author | Average Score | Time | Highlights |
|---|---|---|---|
| Anna | 8.1 | 40 min | Best feel for premium audience |
| GPT-5 | 7.0 | 1 min | Strong CTA, weak style |
| Claude 4.5 | 6.7 | 1.5 min | Too "literary" for premium |
The clearest human victory, and the biggest lead. In the premium segment, tonal nuances are critical: sentence rhythm, verb choice, pauses. Experts noted the AI versions "sold" but didn't "seduce." That's the subtle difference an experienced luxury marketer hears.
Task 5. Meta Tags for 10 Categories
| Author | Average Score | Time | Highlights |
|---|---|---|---|
| GPT-5 | 8.4 | 3 min | Perfect title length adherence |
| Claude 4.5 | 8.2 | 3 min | Strong descriptions |
| Anna | 6.9 | 2.5 hours | Didn't always hit length requirements |
A complete AI victory. Meta tags are a technical task with clear rules (title length 50-60 characters, description 150-160). AI hits these parameters on the first try; a human spends time counting characters and rewriting. Plus, AI generates 10 options at once, while a human needs to focus.
Summary Table
| Task | Winner | AI Time | Human Time |
|---|---|---|---|
| Landing Page Hero | GPT-5 (7.2) | 30 sec | 90 min |
| Email Sequence | Claude (7.5) | 2 min | 4 hours |
| Review Article | Anna (7.8) | 3 min | 6 hours |
| Telegram Post | Anna (8.1) | 1 min | 40 min |
| Meta Tags | GPT-5 (8.4) | 3 min | 2.5 hours |
Score: AI — 3, Human — 2. But that's just the surface. The qualitative takeaways are what matter.
Where AI Wins
- Speed. 100-200x. A human can't compete on any task.
- Technical tasks with rules. Meta tags, fixed-length headlines, platform formatting — AI nails it.
- Conversion hooks. Paradoxically, AI generates great CTAs — it's trained on thousands of sales texts. Anna was more subtle, but less "loud."
- Hypotheses. AI spits out 5 headline versions in 2 minutes. Invaluable for A/B testing.
- Long-form text structure. Give it a clear brief, and AI outlines an article better than a human under deadline pressure.
Where AI Loses
- Brand voice nuances. It's about "ear" — AI doesn't yet distinguish "premium" from "luxury," or "cozy" from "family-friendly."
- Emotional precision. Words that resonate exactly where a reader feels them — that's a rare skill, and AI often misses.
- Cultural context. Local humor, timely references, regional specifics — AI's knowledge is superficial.
- Personal experience and insights. Someone who's worked in an industry for 5 years knows things not found in training data.
- "Not only X, but also Y." Claude's favorite phrase. It shows up 3-5 times more often than in human writing.
- Triple lists. AI loves the "A, B, and C" rhythm, even when two items would do. It's a statistical quirk of tokenizers.
- Em dashes everywhere. GPT models use em dashes where a human would use a comma or a period. It's a signature artifact.
- "Critical," "crucial," "essential." AI's vocabulary is packed with superlatives, often without specific reasoning.
- Concluding synthesis paragraphs. AI almost always ends text with a summarizing "So, we've covered..." — a pattern from learning on school essays.
- Confident passive constructions. "It can be noted that...", "It's worth understanding that..." — AI hides behind impersonal phrasing.
- SEO meta tags (title, description, H1).
- Alt tags for images.
- Templated product descriptions for e-commerce stores.
- FAQ blocks with common questions.
- Translating text between languages.
- Summarizing long documents.
- Email subject lines — especially variations for A/B testing.
- Landing page hero sections and CTA blocks.
- Onboarding email sequences.
- Social media posts from a brief.
- SEO articles on common topics.
- Scripts for videos and podcasts.
- Texts requiring unique expertise (opinions, research, case studies).
- Corporate communications in crisis situations.
- Content for premium and luxury brands.
- Journalism that needs fact-checking and source verification.
- Legal and medical texts.
- Personal stories, narratives, opinion columns.
- Thin AI content with an obvious template — gets penalized via the Helpful Content System (HCU) or a manual filter.
- Carefully edited AI content with unique expertise — ranks equally with human content.
- AI content farms churning out hundreds of articles daily — consistently drop in HCU updates.
- Content strategists. Those who can set tasks, create briefs, and choose a brand's tone of voice.
- Senior copywriters for the premium segment. Where nuance matters more than volume.
- Domain experts writing in their field. A doctor, lawyer, engineer, or practical marketer with expertise and personal experience.
- AI editors. A new role — someone who preps AI drafts for publication.
The 2026 paradox: the best AI editors are AIs themselves. Run an AI draft through the same or another AI with the prompt "remove AI markers, make the text more human," and perception improves by 15-25%. Still, a final human edit always gets the highest scores.
AI vs. Copywriter Cost: The Real Economics
It's tempting to look only at AI generation costs, but that's misleading. You need to factor in the total cost: AI + editing time + task setup time.
| Scenario | AI Only | Copywriter | AI + Editor |
|---|---|---|---|
| Landing page hero | $0.05 | ₽5000 | $0.05 + 20 min editor |
| 5-email sequence | $0.15 | ₽15000 | $0.15 + 45 min editor |
| 1500-word article | $0.30 | ₽12000 | $0.30 + 90 min editor |
| 10 meta tags | $0.10 | ₽5000 | $0.10 + 10 min editor |
For high volume, the economics shift heavily toward the hybrid model. Take a marketing agency generating 50 articles a month for various clients: without AI, that's 50 × ₽12000 = ₽600,000. With a hybrid approach, it's 50 × ($0.30 + 1.5h editor at ₽1500/hour) ≈ ₽115,000. That's over 80% savings, with comparable quality.
But here's the crucial detail: the hybrid model doesn't work without a skilled editor. If an AI draft gets published unedited, quality drops to AI-level. That's not savings; it's a problem.
Which Formats AI Does Best
Beyond the five experimental tasks, we tested AI copywriting on a dozen other formats. Here's a summary of where AI performs well and where it needs heavy editing:
AI Works Great (Minimal Editing)
AI Works Decently (Moderate Editing)
AI Needs Heavy Editing or Isn't Suitable
SEO Nuances: Will Google Penalize AI Content?
Businesses worry most about losing search rankings when switching to AI copywriting. The fear is simple: "Google can spot AI text and will penalize it." What does Google say, and what's the practice shown from 2024-2026?
Google's Official Stance (Search Liaison, February 2023, confirmed in Helpful Content System Update 2024): "We don't differentiate between AI-generated and human content. We evaluate usefulness to the user. AI content can be helpful, it can be garbage — just like human content."
What this means in practice:
According to major SEO services (Ahrefs, Semrush, Sistrix), big Google updates in 2024-2025 hit "mass-produced" content regardless of its origin. Specifically: an 800-word page with a templated structure has an equally low chance of ranking, whether an AI or a junior copywriter wrote it.
Practical takeaway: you can use AI for SEO content, but with two conditions. First, mandatory editing that adds unique expertise (examples, case studies, screenshots, data). Second, don't churn out hundreds of identical articles — that's a pattern Google easily recognizes.
Forecast: What's Next for the Profession
Copywriters won't vanish, but 60-70% of routine tasks will go to AI by the end of 2027. What's left:
Who's out: mid-level generalists writing high volumes of templated content. "10 tips for..." SEO articles, product descriptions, basic email newsletters — AI will handle almost all of that.
AI Copywriting in Quantium
Quantium's chat offers GPT-5, Claude Sonnet 4.5, Gemini 2.5 Pro, and DeepSeek V3 — all four flagship models being tested for AI copywriting in 2026. Cost: 1 credit for a standard query, 3-5 credits for complex ones with reasoning. On the Basic plan with 3000 credits, that's 600-3000 texts per month.
To optimize your workflow, we recommend our auto-tasks feature. It lets you set up regular generations (like a morning channel post from a template) without manually going into the bot.
Related materials: GPT vs Gemini vs Grok Comparison, Marketer's Case Study in Quantium, Auto-Tasks for Content, ChatGPT in Telegram.
Try Quantium Free
20 credits a month on the free plan. 30+ AI models in one Telegram bot.
Open Bot →

