Skip to content

Public Sector / Policing

AI-powered call classification for a UK police constabulary

Joint engagement with PwC to transcribe, redact, and classify police 101 (non-emergency) calls at scale — building a PII redaction pipeline, hierarchical taxonomy, and LLM-based classification system achieving 85% accuracy.

85%

Classification accuracy (top-3)

5 weeks

End-to-end delivery

2 layers

PII redaction pipeline

Auto

Self-healing taxonomy expansion

The challenge

A UK police constabulary needed to understand the thematic content of their 101 (non-emergency) call volume at scale. Calls were unstructured audio with no systematic classification, making it impossible to identify patterns, allocate resources effectively, or measure changes in public demand over time.

The data was highly sensitive — containing personal details, crime reports, and vulnerable caller information — requiring robust PII handling before any analysis could begin.

Our approach

We built a layered PII redaction pipeline combining Deepgram transcription-level redaction with AWS Comprehend as a secondary protection layer, aligned with metadata including call handler teams, call duration, and hashed caller IDs.

For classification, we developed a hierarchical taxonomy (sub-themes, themes, categories) and built an LLM-as-a-Judge classifier that could select from the established taxonomy or propose additions via a self-healing mechanism with schema-validated suggestions. To bootstrap classification at scale without exhaustive manual labelling, we generated representative call language for each theme and applied cosine similarity matching via vector embeddings.

The outcome

The system achieved 60% accuracy on primary theme classification and 85% including secondary classifications (top-3), with category-level accuracy significantly higher.

The bootstrapping approach was extended to detect crime scene preservation language patterns and call handler abuse indicators. Findings were presented to constabulary leadership. A thought leadership paper was co-authored with PwC and presented at the Transforming Police Public Contact conference.

More case studies

HR Technology

Production AI pipeline processing thousands of conversations weekly

Designed and deployed a 6-stage AI insight pipeline that processes 1,000+ structured conversations per week, generating actionable workforce intelligence for enterprise clients.

Hospitality / SaaS

Unified data platform and AI chatbot for restaurant operations

Built an end-to-end data platform with near real-time ingestion and an agentic AI chatbot that lets restaurant operators query financial and operational data in natural language.

Internal R&D

Pantheon: agentic CRM replacing multiple SaaS tools

Built a full end-to-end agentic CRM that consolidates contacts, opportunities, projects, timesheets, invoicing, and communications into a single platform — with AI agents that autonomously manage email, phone, and calendar workflows.

Internal R&D

Pitou: automated business development and opportunity sourcing

Built a browser automation platform that identifies, evaluates, scores, and applies to contract opportunities across multiple job boards — with CRM integration for end-to-end pipeline management.

Telecommunications

Analytics platform strategy for a Tier 1 telecoms operator

Fixed-scope discovery engagement to assess current analytics capabilities, design a target-state architecture, and deliver an implementation roadmap for a developer marketplace portal.

Enterprise / Retail

Global data unification for a Fortune 500 retailer

Led a global HR and recruitment data unification programme across tens of source systems and multiple regions, consolidating three departments and delivering $2.5M+ in annual savings. Delivered by our founding principal at a previous employer.

Have a similar challenge?

Tell us about your challenge. We will respond within one business day.

Facing Something Similar?