PrivAgent is an efficient AI agent architecture for real-time privacy risk monitoring in sensitive environments. Developed in collaboration with the NHS (UK National Health Service) Sandbox and submitted to ACL, it solves the critical challenge of ensuring AI agents comply with privacy regulations like HIPAA and GDPR at every action — in real time, not as an afterthought.

Key Features

Rule Condensation — An LLM compiler compresses verbose legal texts (HIPAA's 144 articles, GDPR's 99 articles) into ~30-token typed checkers, achieving 10–50x compression.

Two-Pass Short-Circuit Detection — Phase 1 concurrently checks forbidden rules (immediate reject on violation); Phase 2 checks warning rules (all must pass to proceed).

SFT + GRPO Training — Specialized training on Qwen3-8B raises lenient accuracy from 47.5% to 88.8%, with an 8B model outperforming the 70B baseline across precision, speed, and efficiency.

Sub-Second Compliance — Prefix-cache-aware scheduling with sglang RadixAttention achieves 0.48–0.51s per request (vs. 10–14s for the 70B baseline).

Results

  • Accuracy improved by +4.0–9.5% (Medical) and +23.9–24.8% (CARES-18K) over 70B baseline
  • 21–27x faster on the same hardware
  • 46–54% token reduction
  • Latency: 0.48–0.51s/req (baseline: 10–14s)

Collaborators

NHS (UK National Health Service) Sandbox