Chatbot Resolution Rate vs Deflection Rate

A chatbot vendor showed you a number. Maybe it was 90% deflection, maybe 80%, maybe a clean round "we handle most of your support." It looked great on the slide, so you nodded. Then, before you signed, a small voice asked: most of what, exactly?

That voice is right. The difference between chatbot resolution rate vs deflection rate is the single most useful thing to understand before you pay for an AI chatbot, and it is the one thing most vendor pages are careful not to spell out.

Here is the short version. Deflection rate counts how many conversations the bot handled without passing them to a human. Resolution rate counts how many of those conversations actually solved the customer's problem. Those are not the same thing, and on a real dashboard they can sit 50 points apart.

For a new Shopify store, the gap is not academic. A "deflected" conversation can be a shopper who got their answer and bought something. It can also be a shopper who got a wrong answer, gave up, and closed the tab. Both look identical in a deflection number, and only one of them keeps your store.

This post defines both metrics in plain terms, shows why a high deflection rate can hide a bad customer experience, gives you the four numbers worth tracking instead, and ends with the one question that tells you whether a vendor's percentage is real. No jargon you have to look up.

Resolution rate, deflection rate, and the sneaky third one

Three words get used as if they mean the same thing. They do not, and getting them straight is most of the battle.

Deflection rate is the share of conversations the bot handled without escalating to a human, regardless of what happened to the customer. If someone chatted with the bot and did not then open a ticket or email you, that counts as deflected. A shopper who gave up in frustration and a shopper who got a perfect answer are both "deflected."

Containment rate is a slightly higher bar. It counts conversations that ended inside the bot without the customer asking for a human. It excludes the people who hit "talk to an agent," but it still does not prove the customer got what they needed. A bot can contain a conversation by giving a vague answer that simply stops the customer from escalating.

Resolution rate is the real one. It counts conversations where the customer's actual problem was solved. The order status was found. The return was started. The sizing question was answered correctly. This is the number tied to whether people come back and buy. Zendesk's own documentation draws the same line: a true automated resolution is one a system verifies actually closed the loop, not one it assumes because the chat ended.

A simple way to picture it

The three numbers stack. Every resolved conversation is contained, and every contained conversation is deflected. But the reverse is not true, and that is exactly where the trouble lives. A high deflection number can be padded with conversations that were never resolved and never even properly contained.

Diagram showing deflection rate, containment rate, and resolution rate as nested layers

Why a high deflection rate can hide a bad customer experience

Here is the uncomfortable part. A chatbot with a 90% deflection rate can have a resolution rate closer to 40%, according to a 2026 analysis of AI support deployments. The dashboard shows the 90. The 40 is the part you feel three months later.

The reason has a name: false deflection. A conversation gets marked as deflected the moment the customer stops talking to the bot, even if they stopped because the bot was useless. When one support team audited real deployments, it found that 15% to 25% of "deflected" tickets had been deflected with wrong or incomplete answers. The customer left. Their problem did not.

Why this is worse in ecommerce

In a B2B help desk, a frustrated customer files another ticket, so you see the re-contact. In a Shopify store, a frustrated customer just leaves. They abandon the cart. They do not write in to tell you the bot failed them. The conversation gets counted as a win while the sale walks out the door.

That silent exit costs more than one order. A shopper who had a bad support experience is not going to leave you a glowing review, and reviews are part of how the next shopper decides whether to trust you. Weak support quietly feeds fewer reviews and weaker trust signals, which feeds fewer sales. None of it shows up in a deflection number.

This matters right now because nearly everyone is rushing in. A 2026 Gartner survey found 91% of service leaders are under pressure to add AI, and more than 80% of organizations expect to cut support headcount within 18 months. When that many teams lean on automation that fast, the metric you trust is the difference between a bot that helps and a bot that quietly costs you customers.

Flow showing one chat counted as deflected whether resolved, escalated, or abandoned

The four metrics worth tracking instead

If deflection is the number not to trust on its own, what should you actually watch? Four metrics, and none of them is hard to understand.

1. Confirmed resolution rate

The percentage of conversations where the customer's problem was actually solved, confirmed by the customer or by a check of the transcript, not assumed because the chat ended. This is the closest thing to a single honest score.

2. CSAT (customer satisfaction)

A one-tap thumbs up or down, or a 1 to 5 rating, at the end of the chat. Aim for 80% or higher. A high containment rate sitting next to a low CSAT is the clearest warning that your bot is ending conversations by wearing people out, not by helping them.

3. Escalation and handoff quality

When the bot cannot help, does it hand the customer to you cleanly, with the conversation history attached? Or does it dead-end them in a loop, typing "agent, agent, agent" with no way out? A bot tuned only for deflection will avoid handing off, because every handoff dents the number. That is the opposite of what you want.

4. Re-contact rate within 24 to 48 hours

Of the customers the bot "resolved," how many came back about the same thing within a day or two? A low re-contact rate is the best proof a resolution was real. A high one means the bot closed conversations it never actually finished.

Metric	What it tells you	A healthy signal
Confirmed resolution rate	Problems actually solved, not just chats ended	Trending upward over time
CSAT	Whether customers liked the experience	80% or higher
Escalation and handoff quality	How cleanly the bot passes you a customer it cannot help	Fast, with chat history attached
Re-contact rate (24 to 48 hours)	Whether a "resolved" issue stayed resolved	Low and stable

Simple dashboard showing resolution rate, CSAT, handoff quality, and re-contact rate

What a good number actually looks like for a small store

Now the question every owner actually wants answered: what is a good number?

Honest answer first. It depends on your catalog, your return policy, and how messy your questions are. A store with clear shipping rules and a tight catalog will see higher numbers than one selling custom or made-to-order goods. Anyone who quotes you a single guaranteed percentage without asking about your store is selling you the number, not the result.

That said, here are real ranges to anchor on. Older rule-based bots tend to resolve only 10% to 30% of what they touch. Modern AI agents that can take a real action, like looking up an actual order, resolve far more. The direction of travel is steep: Gartner expects agentic AI to autonomously resolve about 80% of common service issues by 2029. The point is not the headline figure. It is that the industry is finally being measured on resolving problems, which is the number you should care about too.

For CSAT, 80% and up is the bar, and ecommerce satisfaction scores tend to run a little above that. For re-contact, lower is better, full stop.

Why chasing 90% deflection is a trap

A 90% deflection target sounds like a goal. In practice, pushing deflection that high usually means the bot is refusing to hand off, which is exactly the behavior that frustrates the customers who need real help. The stores that win are not the ones with the highest deflection. They are the ones whose customers got what they came for and came back.

The one question to ask any chatbot vendor

If you remember one thing from this post, make it this question. Ask any chatbot vendor, before you sign:

"When you quote that percentage, what exactly are you counting, and can you show me the transcripts behind it?"

A vendor who is proud of their resolution will answer in one breath: here is how we define a solved conversation, here are real chats, here is the re-contact rate. A vendor leaning on deflection will get vague. They will talk about "engagement" and "automation" and steer away from showing you actual conversations. The dodge is the answer.

In twelve years watching this play out, the pattern holds. The number is only as honest as the definition under it, and the definition is only as honest as the transcripts.

Four follow-ups worth asking

If you want to go one level deeper, ask these. How do you define a "resolved" conversation, in plain words? What is your re-contact rate within 48 hours? When the bot cannot help, what exactly happens to that customer? And can I read ten real transcripts, including the ones that went badly?

The last one matters most. Anyone can show you the good chats. The bad ones tell you how the tool behaves when your customer is already annoyed. If a vendor will not show you transcripts, that is useful information on its own. You can always send me the details and I will tell you what the number probably means.

Who is actually reading the transcripts

Here is the thing every dashboard hides: a number means nothing if no one reads the conversations behind it. This is the real split in the chatbot market, and it has nothing to do with the logo on the software.

Self-serve software

You pay roughly $39 to $279 a month, you get a dashboard, and you are on your own. The tool will happily report a deflection rate. Whether that rate is real, whether the bot is quietly failing on your top three questions, whether your handoff is broken: that is all on you to catch, in time you do not have.

Managed service

You pay more, and part of what you pay for is a person who actually reads the transcripts, finds the conversations that went sideways, and fixes the bot's answers. That weekly review is the entire point. It is the difference between a number on a screen and a bot that gets better.

The cost gap is smaller than it looks once you count resolution. Industry estimates in 2026 put an AI-handled resolution at well under a dollar, against several dollars for one a human handles. That gap only pays off on conversations that were genuinely resolved. A cheap tool with a flattering deflection number can cost you more than a managed bot that solves real problems, once you add back the lost carts and the support you still end up doing yourself.

This is how the Studio Niza AI chatbot service is built: trained on your products, integrated with Shopify, and monitored every week so someone is reading the chats, not just screenshotting the dashboard. The honest math behind the $99 a month is that it includes that monitoring time. A bot you do not maintain drifts. A bot someone watches improves.

Wrapping up

Resolution rate vs deflection rate comes down to one idea: deflection counts conversations the bot got rid of, while resolution counts problems the bot solved. A vendor who leads with deflection is showing you the flattering number. The honest one is harder to fake and worth far more.

So before you pay for any chatbot, do three things. Ask what the quoted percentage actually counts. Ask to see real transcripts, including the bad ones. And once a bot is live, watch resolution, CSAT, handoff quality, and 48-hour re-contact instead of staring at deflection.

You do not need to track all four from day one. If you are just starting, watch confirmed resolution and CSAT, and skim a handful of transcripts each week. That alone puts you ahead of most stores, which never look past the headline number.

A chatbot can be one of the highest-leverage tools a small Shopify store owns. It answers shipping questions at 2am, recovers carts, and frees you from the same ten emails. But that only happens when it resolves, not just deflects. The number on the demo slide is not the goal. A customer who got what they came for, and came back, is.

When you'd rather not audit this yourself

The Studio Niza AI chatbot service is built on resolution, not deflection. Custom-trained on your products, integrated with Shopify, and monitored weekly so someone is actually reading the transcripts. Setup from $599, $99/month all-in.

See how it works →

Or email contact@studioniza.com if you have a specific question about your store. I read every one.

Frequently asked questions

If you're still unsure after reading these, just send the question.

What is a good resolution rate for an ecommerce chatbot? +

It depends on your catalog and how complex your questions are. Older rule-based bots often resolve only 10% to 30% of what they touch, while modern AI agents that can look up real order data resolve much more. Watch your own number trend upward over time rather than chasing someone else's benchmark.

What is the difference between containment rate and deflection rate? +

Deflection rate counts every conversation the bot handled without a human, including ones where the customer gave up. Containment rate is a slightly higher bar that excludes customers who asked for an agent, but it still does not confirm the problem was solved. Neither one proves resolution, which is why you track resolution and CSAT alongside them.

Can a chatbot have a high deflection rate and still be bad? +

Yes, and this is the core trap. A conversation counts as deflected the moment the customer stops talking to the bot, even if they left with no answer. A bot can post a 90% deflection rate while resolving closer to 40% of problems, so a high deflection rate alone tells you very little.

How do I check if my chatbot's deflection number is real? +

Read the transcripts and watch the re-contact rate. If a large share of resolved customers come back about the same issue within 24 to 48 hours, the resolutions were not real. Pairing the deflection number with CSAT and a weekly transcript review is the fastest way to catch a bot that is failing quietly.

What CSAT score should a Shopify chatbot aim for? +

Aim for 80% or higher. Ecommerce satisfaction scores tend to run in the low 80s, so anything well below that signals customers are leaving conversations unhappy. A high containment rate sitting next to a low CSAT is a clear sign the bot is ending chats by frustrating people rather than helping them.

Does a small Shopify store really need to track all of these metrics? +

No, not at the start. If you are early, watch confirmed resolution rate and CSAT, and skim a handful of real transcripts each week. You can add handoff quality and re-contact rate once you have steady chat volume.