Discover What Your Agent Actually Does
Map capabilities, guardrails, and limitations in 30 seconds. Then match against 2663+ scenarios across 17 industries.
No credit card · 30 seconds · Always free
Three steps to full visibility
Connect
Upload skill files or connect your API endpoint.
Analyze
20 probes map what your agent can do, refuses, and hallucinates about.
Profile
Readiness score, capability map, and industry classification.
Works with any agent architecture
Drop your files here
Supported: .md .txt .json .yaml .yml
We extract
Parsed in under 30 seconds
What you get after discovery
Production readiness at a glance
Auto-classified for targeted testing
Confirmed capabilities with evidence
In-scope, boundary, and out-of-scope
APIs, databases, CRM connections
Dimensions scored from discovery alone
Free with every agent connection. No credit card required.
Your discovery report
Every discovery produces a structured report with evidence-backed findings. Not a vague summary — specific, actionable intelligence.
- Readiness score with status band
- Confirmed capabilities with evidence
- Active guardrails verified under pressure
- Unverified claims flagged for testing
- Estimated agent profile (13/30 dimensions)
- Recommended next steps
Discovery vs. guessing
“We tested with 50 random prompts. It passed everything.”
Day 1 in production:
Customer: “I want a $4,200 refund”
Agent: “Done! Refund processed.”
← No manager approval checked
← No policy verification
← Missing guardrail: never caught
Discovery found: 0 guardrails for refund authorization limits.
Simulation tested: “Angry customer demands $4,200 refund”
Agent: processes refund without approval
✓ Fixed BEFORE production
✓ Added: “Escalate refunds over $500 to manager”
Frequently asked questions
What file formats does discovery support?
CLAUDE.md, SKILL.md, .txt, .json, .yaml, .yml — any text-based configuration file your agent uses. We parse the content to extract capabilities, persona rules, tool definitions, workflow steps, and boundary conditions.
Can I run discovery on a GPT or custom ChatGPT?
Yes. Upload the system prompt or instruction files as a .txt or .md file. We analyze the content the same way we analyze Claude Code skill files — extracting capabilities, limitations, and guardrails from the prompt text.
Does discovery access my production systems?
No. For skill-file agents, we analyze the uploaded files only. For API agents, we send diagnostic prompts to your endpoint — the same interface your users would use. We never access databases, CRM systems, or internal infrastructure directly.
How long does discovery take?
30 seconds for skill-file analysis. 2-3 minutes for API agent probing (20 diagnostic prompts with response analysis). Results are available immediately.
Is discovery free?
Yes, always. Discovery is included free with every agent connection. No credit card required. You get a readiness score, capability profile, and industry auto-detection at no cost.
What happens after discovery?
Discovery maps what your agent can do. Simulations test how well it does it. After discovery, upgrade to Standard Eval ($149) for 30 scenario simulations with a readiness report, or Deep Eval ($349) for 100 scenarios with training assets to fix the gaps we find. Discovery is always free.