Performance

Performance metrics

Performance metrics are about quality. Where the Overview tells you how much, this page tells you how well.

In this guide:

Latency: how fast does the bot reply?

Top section:

Screenshot: Performance charts showing latency distributions per model.

If p95 is above 5 seconds, your bot feels sluggish. Common fixes:

Three signals:

Improve rate — flags per 100 conversations from Improve responses. Higher = bot wrong more often.
CSAT correlation — average CSAT for bot-only vs. agent-handled conversations. Useful for justifying the bot’s value.
Refusal rate — % of replies that say “I don’t know.” Healthy is 5–15%; over 25% means you need more knowledge.

Resolution rate — % of conversations closed without escalation.
Escalation rate — % handed to humans.
Reopen rate — % of solved conversations the customer reopens within 7 days. High reopens = answers weren’t actually solving.

A great chatbot:

These are guidelines, not laws — your customer mix matters.

For escalated conversations:

First response time — bot escalation to first agent message. SLA target configurable per plan.
Full resolution time — escalation to closed.
SLA hit rate — % of conversations meeting the SLA. Color-coded green/yellow/red.

Configure SLAs in Settings → SLAs. Track misses to identify staffing gaps.

Each metric supports drill-down:

Look at p95, not just p50. Half your users have a worse experience than the median.
Improve rate trends over weeks. Don’t react to a daily spike.
Cross-reference accuracy with knowledge changes. A jump in improve-rate after a Guidelines edit means your edit hurt; revert.