An end-to-end intelligent recruitment candidate matching system that combines graph database technology with AI-powered CV extraction and dual-retrieval search strategies. Designed specifically for the Australian Defence and Government sector where precision matching and security clearance handling are critical.
Private Client Project
Australian Defence and Government Recruitment Sector — Nov 2025 to Feb 2026
This system was built for the Australian Defence and Government recruitment sector — a domain with strict requirements around security clearances (Baseline, NV1, NV2, TSPV), government experience verification, and domain-specific skill taxonomies that standard recruitment tools simply cannot handle. The result is a production-grade, end-to-end intelligent candidate matching platform combining graph database technology, AI-powered extraction, and a dual-retrieval search engine.
All scoring is deterministic JavaScript — no LLM involved in score calculation, ensuring reproducible and auditable results:
(AQL_score / max_score) × 60 + semantic_similarity × 40 + 5Word-Boundary-Aware Skill Matching
Short skill terms (≤5 characters) use regex word-boundary matching. "Java" matches "Java" but NOT "JavaScript"; "SQL" matches "SQL" and "SQL Server" but NOT "NoSQL". This eliminated the false positives that plagued previous keyword-based approaches.
Core vs Supporting Skill Architecture
Skills are split into two tiers. Core skills form the minimum match gate — candidates must demonstrate these to rank highly. Supporting skills contribute bonus points but cannot substitute for core requirements, preventing generic profiles from outscoring domain-critical specialists.
Dual-Purpose Summary Generation
Candidate summaries are generated with a dual purpose: optimised for human readability while also including equivalent job titles, named technologies, clearance level in prose, and seniority signals — making them effective for both semantic vector retrieval and recruiter review.
Multi-Stage Pipeline Validation (4 Stages)
The validator routes outputs to one of three paths: approve (pass through), revise (regenerate response), or pipeline_error (fix upstream data issue).
Candidates are classified across 13 domain taxonomies tailored to the Australian Defence and Government sector:
Private Client Project
Australian Defence and Government Recruitment Sector
Nov 2025 – Feb 2026
AI/ML Engineer & Solutions Architect
ArangoDB stores candidates, skills, organisations, and clearances as vertices with edges representing relationships — enabling complex traversal queries impossible with relational databases.
Runs AQL graph queries and semantic vector search simultaneously then merges results — capturing candidates that single-method search misses, with a +5pt bonus for dual-match candidates.
All candidate scoring is calculated in deterministic JavaScript Code nodes — not LLM-generated — ensuring 100% reproducible, auditable, and mathematically verified results across all JD types.
Regex word-boundary patterns on short skill terms eliminate false positives — "Java" never matches "JavaScript", "SQL" never matches "NoSQL" — critical precision for technical role matching.
Every LLM response passes four validation stages: data integrity, grounding check, completeness check, and accuracy check — with automated routing to approve, revise, or escalate.
ArangoDB on localhost-only binding, Nginx reverse proxy with SSL/TLS termination, Let's Encrypt auto-renewal, scoped DB permissions, and Fail2ban — production security for Defence sector compliance.
CV Entities Extraction Pipeline
AI-powered extraction of structured fields from unstructured CV documents
ArangoDB Graph Visualisation
Graph showing candidate, skill, organisation, and clearance vertices with relationship edges
Microsoft Teams Chatbot Interface
Recruiter-facing chatbot for submitting candidate search queries directly in Teams
Webhook Chatbot
Alternative webhook-based interface for programmatic search query submission