End-to-end Voice of the Customer analytics pipeline processing real insurance call centre transcripts through LLM extraction, with structured insights stored in ArangoDB and served via a live webhook dashboard.
Proof of Concept — Built in Days
Weekend POC demonstrating production-grade LLM extraction at scale
Voice of the Customer (VoC) analytics platform built as a proof of concept demonstrating how LLMs can extract structured business intelligence from insurance call centre transcripts at scale. The pipeline processes real PII-redacted call transcripts, analyses them through a carefully engineered Gemini 2.5 Flash extraction prompt, stores structured insights in ArangoDB, and serves a live dashboard through an N8N webhook — all with zero external hosting dependencies.
Insurance call centres generate thousands of hours of customer conversations daily. Buried within these calls are critical business signals: frustrated customers about to churn, recurring billing complaints, agents who need coaching support, and policy issues that drive repeated contact. Manual review doesn't scale, and traditional keyword-based analytics miss the nuance of human conversation — sarcasm, subtle frustration, sentiment shifts during a call, and context-dependent meaning.
The pipeline runs as a sequence of N8N workflow stages:
Processed the CallCenterEN dataset from Hugging Face — 92,000+ real-world PII-redacted call centre transcripts. Filtered and curated 500 high-quality insurance conversations from auto_insurance_customer_service_inbound, insurance_outbound, automotive_and_healthcare_insurance_inbound, and customer_service_general_inbound sources.
Personal Proof of Concept
Built as a weekend POC
Mar 2026
AI Engineer & Pipeline Architect
Benchmarked GPT-4.1 Mini vs Gemini 2.5 Flash on identical transcripts. Flash selected for superior nuanced sentiment detection — correctly identifying subtle frustration that GPT-4.1 Mini flattened to neutral.
Extraction prompt with calibrated sentiment examples at each scale level, good/bad key_issues examples, churn criteria, and edge case handling for IVR recordings and cut-off transcripts.
AQL UPSERT pattern prevents duplicates on pipeline rerun. Server-side aggregation queries keep dashboard logic inside the database where it belongs.
N8N Respond to Webhook node serves complete Chart.js HTML — no external hosting, no frontend framework, no CORS issues. Real-time aggregation from ArangoDB on every request.
Pre-LLM check against ArangoDB prevents redundant API calls and cost waste on reprocessing runs. Deterministic document key generation from transcript metadata.
Captures how customer mood evolves during a call — not just a single score. Tracks sentiment shifts with boolean detection and narrative description of the emotional arc.