West-OS sits in front of your AI deployment and governs every request, tool call, and output — before harm is possible. Not a prompt filter. A runtime control system.
West-OS governs the full operational path an AI system takes from request to action — before the model ever sees the message. Most AI security products address a slice of the problem. West-OS unifies what they leave separate.
Every result is SHA-256 hash-locked and deterministically reproducible.
Any evaluator can run WESTOS_REPRO_MODE=1 make bench
and reproduce every number from scratch in under thirty minutes.
| Suite | Result | What it proves | Status |
|---|---|---|---|
| Golden suite | 30/30 | Every attack family caught. Accuracy 1.0000. Errors: zero. Strict gating — any error forces rc=1. | ✓ Unbroken |
| Benign suite | 0/1,000 | Real users never incorrectly flagged. The false-positive tradeoff that doesn't exist here. | ✓ Zero FP |
| Mutation coverage | 99.25% | 1,842/1,856 disguised variants caught. Self-hardened up from 99.24% after academy loop closed. | ✓ Hardened |
| Live sweep | 99.96% | 4,692/4,694 correct across the full pipeline. Two semantic floor misses documented. | ✓ Production |
| Multimodal | 6/6 | PDF · DOCX · Image OCR — all input formats catch attacks identically to text. | ✓ All formats |
| Academy loop | 10/10 | First complete immune loop: 10 slips found by assassins, patched, re-verified under gauntlet. | ✓ Loop closed |
| Security scan | 0/0/0 | 275 files. Zero critical · zero high · zero medium findings. Full STRIDE threat model. | ✓ Clean |
Every service runs independently on the event bus and publishes signals the governor aggregates into a final decision. Together they form the most complete AI runtime governance system available.
Every tier includes the full 16-service runtime, evaluator runbook, benchmark artifacts, and deployment guard. No features locked behind enterprise walls — the architecture is complete at every tier.
Send us your worst prompt injection attempts. Your red team's library. The attacks that broke your last vendor. We run them live and show you every decision.