Vendor
Who actually built this agent. Domain history, registry signals, and whether the vendor matches the endpoint you were pitched.
The vendor on the deck is not always the one serving the traffic.
Yet that's exactly how AI agents get adopted today, on a slide deck, a self-written questionnaire, and a demo on the vendor's laptop.
Live behavior, not a scripted demo.
When a real user is actively trying to break it.
Proof from someone other than the vendor.
Nobody outside the vendor has tested the agent you're about to trust.
Real conversations from real consumer devices, not a scripted demo loop.
Jailbreaks, prompt injection, exfiltration, and role-swap suites run against the running agent.
Outside-in validation from a third party. No SDK, no integration, no cooperation required.
Five dimensions
Who actually built this agent. Domain history, registry signals, and whether the vendor matches the endpoint you were pitched.
The vendor on the deck is not always the one serving the traffic.
Public signals on policies and attestations. Informational context for procurement, not a certification stamp.
We surface what is published and what is conspicuously missing.
Does the agent leak PII, ignore retention claims, or mishandle sensitive prompts when probed from outside the vendor stack.
Privacy policies do not survive contact with a live endpoint.
Jailbreak resistance, prompt injection handling, and unsafe behavior under adversarial pressure on the live endpoint.
Most agents fold on at least one battery the first time we try.
Multi turn validations from real consumer devices on home networks. Does it actually work for users, not just in a scripted demo.
Datacenter green is not user green.
How it works
Paste the live agent address and vendor name. No vendor permission needed and nothing to install.
Real consumer devices on home networks probe security, privacy, reliability and vendor identity in parallel.
Pass, conditional or fail with section scores, written findings and a signed PDF you can hand to procurement.
Every report gets a public verification page. Send it to buyers and they can confirm the result without a login.
The verify link
Every clearance gets a public verification page with tier, section scores, validity window, and the cryptographic attestation. Share it with procurement, paste the badge on your trust page, or audit it programmatically.