Bersyn Weekly Edition 2: Five sales tools AI is getting wrong
Edition 2 of Bersyn Weekly. Same methodology as Edition 1 — twenty buyer Conversations across ChatGPT, Claude, Perplexity, and Gemini, asked the kind of questions real category buyers actually send. This week the category is sales tools. Apollo, Salesloft, Customer.io, Reply, and Lemlist. Some of these are dominating their categories on the AI Surfaces. Others are getting buried by older incumbents in answers that should be theirs.
Quick recap
Edition 1 ran last Wednesday and looked at indie SaaS — Quantstruct, AllInOneTools, Inkdrop, Tally, and Pylon. The pattern there was AI Surfaces defaulting to whoever had the most third-party citations at the last training cut. This week's batch is denser. Sales tools have been around longer, the dollar volume in the category is bigger, and the incumbents — Outreach, HubSpot, Mailchimp, Salesforce — have years of compounding citation density working in their favour. That changes what a Gap looks like. Every Wednesday going forward we publish five fresh scans; the methodology is the same every week.
Edition 2 — five public scans
1. Apollo — sales prospecting and contact enrichment
- URL: apollo.io
- Category: sales prospecting and contact enrichment
- Score: 9.5 / 10
- Worst Surface: ChatGPT (8 / 10)
- Named villain: ZoomInfo
- Gap type per Surface: ChatGPT — Generic. Claude — Held. Perplexity — Held. Gemini — Held.
Apollo is the first finding in this edition that runs in the opposite direction to most of what we publish: it is winning. Nineteen of twenty Conversations named Apollo, across all four Surfaces, in a category where ZoomInfo, LinkedIn Sales Navigator, and Clearbit have a decade of head-start on training data. The one Conversation Apollo lost was on ChatGPT, on the specific prompt "compare the top three sales prospecting platforms" — there ChatGPT chose ZoomInfo, LinkedIn Sales Navigator, and Clearbit and left Apollo out of the top three altogether.
The uncomfortable finding for Apollo is what is left to gain. At 9.5 / 10 the upside on Patches is small; the work is defensive. The Surfaces have collectively learned that Apollo is the affordable all-in-one alternative to ZoomInfo, and they repeat that framing back. If Apollo's positioning shifted — toward enterprise, toward intent data, toward anything that contradicts "value all-in-one for SMBs" — the AI Surfaces would not follow for at least one training cut. The Trajectory work here is keeping what is there.
We did not contact Apollo before publishing. This is a public scan using prompts any of their buyers could send.
2. Salesloft — sales engagement and cadence platform
- URL: salesloft.com
- Category: sales engagement and cadence platform
- Score: 9.5 / 10
- Worst Surface: Claude (8 / 10)
- Named villain: Outreach
- Gap type per Surface: ChatGPT — Held. Claude — Generic. Perplexity — Held. Gemini — Held.
Salesloft and Outreach are the canonical pair in this category, and the Surfaces know it. Salesloft was named in nineteen of twenty Conversations, with the one miss landing on Claude when the prompt was specifically about a YC-stage startup — there Claude pivoted to Reply.io as the recommended sequencing tool and skipped Salesloft entirely as too enterprise. Outreach was the named villain in every Surface's top-of-list answer, named ahead of Salesloft slightly more often than not.
The uncomfortable finding is the symmetry. Salesloft and Outreach are so tightly co-mentioned that the Surfaces treat them as a category pair rather than ranked alternatives. That is a defensible position but a bounded one — when buyers ask "what is the leading sales engagement platform", they get Outreach first and Salesloft second roughly two times out of three across the four Surfaces. Like Apollo, Salesloft's Trajectory work is defensive: at 9.5 / 10 the win is keeping the pairing intact, not climbing past it.
We did not contact Salesloft before publishing. This is a public scan using prompts any of their buyers could send.
3. Customer.io — lifecycle marketing automation for SaaS
- URL: customer.io
- Category: lifecycle marketing automation for SaaS
- Score: 8.5 / 10
- Worst Surface: ChatGPT + Perplexity + Gemini (three-way tie at 8 / 10)
- Named villain: HubSpot
- Gap type per Surface: ChatGPT — Generic. Claude — Held. Perplexity — Generic. Gemini — Generic.
Customer.io is the most evenly soft-leaking company in this edition. Seventeen of twenty Conversations named it; the three misses are spread across ChatGPT, Perplexity, and Gemini and all share the same prompt — the "compare the top three" Conversation, where each of those three Surfaces composed a top-three of HubSpot, Marketo, and ActiveCampaign and left Customer.io out. Only Claude consistently put Customer.io in its top three, in part because Claude is the Surface most willing to name product-led-growth tooling rather than defaulting to the established marketing-automation incumbents.
The uncomfortable finding is the shape of the Gap. Customer.io is named confidently in four-out-of-five Conversations on three Surfaces, then drops out on the highest-stakes Conversation — the explicit top-three comparison. That is a concentrated Gap. A single comparison Patch — Customer.io versus HubSpot versus Marketo, written from Customer.io's perspective and weighted toward the product-led-growth use case — could plausibly move three Surfaces in the next training cut. That is the cleanest Patch opportunity in this edition.
We did not contact Customer.io before publishing. This is a public scan using prompts any of their buyers could send.
4. Reply — sales outreach automation
- URL: reply.io
- Category: sales outreach automation
- Score: 6.0 / 10
- Worst Surface: Claude (0 / 10)
- Named villain: Outreach (overall) — Apollo (in the Surfaces that named Reply)
- Gap type per Surface: ChatGPT — Generic. Claude — Omitted. Perplexity — Held. Gemini — Generic.
Reply is the most striking finding in this batch. Twelve of twenty Conversations named Reply, which sounds like a middle-of-the-pack score until you look at the per-Surface breakdown. Perplexity named Reply in five out of five Conversations — Reply is essentially the default Perplexity recommendation for any cold-email-plus-LinkedIn motion. Claude named Reply zero out of five times. Not one. On Claude, the same buyer prompt that Perplexity answers with Reply gets answered with Instantly.ai, Smartlead, Apollo, or Outreach instead.
The uncomfortable finding is the per-Surface unevenness. Reply is invisible on Claude, in a category where it is the dominant Perplexity answer. Buyers do not know which Surface they are using when they ask. A founder evaluating Reply via Claude gets told the tool does not exist; the same founder evaluating via Perplexity gets told it is the recommended choice. That is a clean per-Surface Gap and the cheapest kind to fix — Claude in particular is responsive to authoritative third-party comparisons in its training cuts, and Reply has no presence in the developer-blog and Reddit-r/sales corpus that Claude is weighted toward.
We did not contact Reply before publishing. This is a public scan using prompts any of their buyers could send.
5. Lemlist — personalized cold email outreach
- URL: lemlist.com
- Category: personalized cold email outreach
- Score: 10 / 10
- Worst Surface: none — all four Surfaces tied at 10 / 10
- Named villain: Mailshake (on ChatGPT and Gemini) — Smartlead (on Claude) — Reply (on Perplexity)
- Gap type per Surface: ChatGPT — Held. Claude — Held. Perplexity — Held. Gemini — Held.
Lemlist is the rarest finding we publish: twenty out of twenty Conversations, all four Surfaces, no Omissions, no Misclassifications, no Generics. The AI Surfaces collectively treat Lemlist as the canonical answer to "best personalized cold email tool" the same way they treat Smallpdf as the answer to "best PDF tool" or GitBook to "best documentation tool". The interesting wrinkle is the named villain rotates by Surface — ChatGPT and Gemini name Mailshake as Lemlist's nearest competitor, Claude names Smartlead, Perplexity names Reply. Different Surfaces have absorbed different competitive framings.
The uncomfortable finding is what 10 / 10 actually means. It does not mean Lemlist is winning every buyer Conversation in the real world; it means the AI Surfaces are recommending Lemlist by default. When the AI Surfaces have absorbed a company's positioning that deeply, the recommendation lock is hard to break and the work shifts entirely to defence. The Trajectory work at this score is maintaining personalization-first positioning consistency across the website, the docs, and the third-party citations the next training cut will read. Any drift — a pivot toward general sales engagement, a rebrand away from "personalization" as the lead concept — would take training cuts to undo.
We did not contact Lemlist before publishing. This is a public scan using prompts any of their buyers could send.
The pattern in this batch
Sales tools sit higher on the AI Surfaces than indie SaaS did. Edition 1's seven companies averaged 1.6 / 10. This edition's five averaged 8.7 / 10. The category is denser because the incumbents have been around longer and the third-party citation supply is richer. That cuts both ways. The category leaders get cited automatically; the secondary brands either get pulled along by association or get cleanly excluded.
The clean Gaps are concentrated, not diffuse. Customer.io's miss is one Conversation on three Surfaces. Reply's miss is one Surface entirely. Salesloft's miss is one Conversation on one Surface. None of these companies needs the broad Patch sequence Edition 1's Quantstruct or AllInOneTools would need — they need targeted Patches aimed at the specific Gap. Reply is the cleanest example: a single concentrated Patch sequence aimed at Claude's training corpus could plausibly move 0 / 5 to 3 / 5 or better in the next cut.
The saturation ceiling is real. Lemlist at 10 / 10 is what happens when the AI Surfaces have fully absorbed a company's framing. There is no upside from another Patch and the work converts to defending the position. Apollo and Salesloft at 9.5 / 10 are close to the same ceiling. For these companies, the question is not "how do we win more" but "what do we not change". Protocol-driven positioning consistency, anchored to a stable Product Identity, is the asset.
The named villain is per-category, but not always per-Surface. In Edition 1 we noted that the named villain tends to be consistent within a category. This edition complicates that. Lemlist has three different named villains across three Surfaces — Mailshake on ChatGPT and Gemini, Smartlead on Claude, Reply on Perplexity. That tells us the four Surfaces are not absorbing the same competitive narrative, which means a Patch aimed at one Surface's villain will not always move the others. Per-Surface Patches matter at the top of the score range, not just the bottom.
What to do if your company is here
If you are one of the companies in this edition, this is not an attack. We publish honest scans, including our own scan of Bersyn. The data above is the same data you would see if you ran a free scan on your own product right now. It is not curated, not adversarial, and not run with hostile prompts.
If you want to fix the Gaps (Customer.io, Reply, the Claude-shaped gap for Salesloft) the free first scan at bersyn.com is the same data you are seeing here, but for your own decision-making — your own Conversations, your own competitors named, your own per-Surface breakdown. The $49 one-time audit at audit.bersyn.com gives you the per-Surface Patches as a deliverable: a PDF with copy-pasteable text suggestions tied to specific pages on your site, scoped to ChatGPT, Claude, Perplexity, and Gemini separately.
If you are Apollo, Salesloft, or Lemlist and you read this far — the defensive question is worth the price of the audit too. The Patch deliverable for a 9.5 / 10 company is shorter and is about anchoring positioning rather than building it. The risk for high-scoring companies is silent drift between training cuts, and a once-per-quarter rescan catches it.
What is coming in Edition 3
Edition 3 publishes next Wednesday. Category: newsletter and publishing tools. The candidate list this week is Beehiiv, Substack, Ghost, ConvertKit, and Buttondown. Some of those will surprise you — the newsletter category is consolidating fast, the named villain is not the brand most readers would guess, and one of the five is sitting on a per-Surface Gap as clean as Reply's. We will lock the final five before scan time and publish the receipts on Wednesday.
All sixty-plus companies we have publicly scanned are listed at bersyn.com/scoreboard. Each company links to its full Control Report — per-Surface breakdown, named competitors, Conversation receipts.
If you want a category scanned in a future edition, reply to this post with the category name. We are running scans daily through 30 June, and the Wednesday cadence is the publication schedule, not the scan schedule.
Want your company scanned for free? Start here: bersyn.com.
— Gissur Þór Rúnarsson, Founder, Bersyn (Reykjavík)
Bersyn Weekly Edition 2 — published 28 May 2026. Methodology, raw scan data, and prompts available at github.com/gthorr/Bersyn/tree/main/marketing/outreach/scans.