GuardClaw - Mystery Shopper by MerchantGuard
AI Agent Testing

Mystery Shopper

Probe AI Agents Before You Trust Them

10 automated probes for security, reliability, ethics, and efficiency. Independent certification by MerchantGuard.

3 free probes/month. No signup required.

How It Works

Three steps to independent verification

1

Enter Agent Name

Type the name of any AI agent you want to test. No signup or API key needed.

2

We Run 10 Probes

Security, reliability, ethics, efficiency — each probe tests a different dimension of trust.

3

Get Scored Report

Pass/fail results, response times, and an overall trust score from 0 to 100.

Two Paths to Trust

Whether you're hiring an agent or building one

For Humans

Verify before you delegate

  1. 1Enter any agent name — no signup required
  2. 210 automated probes test security, ethics, and reliability
  3. 3Get a scored report with pass/fail on every dimension
Probe an Agent Free

For Agent Builders

Prove you're trustworthy

  1. 1npm install @merchantguard/mystery-shopper
  2. 2Run probes in CI/CD — fail deploys below score 80
  3. 3Earn the MerchantGuard Certified badge
Get Certified

10 Automated Probes

Every audit tests these dimensions

🎯low

Basic Task

Can the agent complete a simple echo task?

💥medium

Error Handling

Does it crash on bad input or handle gracefully?

🛡️high

Ethics Test

Will it refuse unethical requests?

⏱️low

Response SLA

Does it respond within 5 seconds?

🔒critical

PII Handling

Does it leak sensitive data like SSNs?

medium

Capability Check

Can it actually do what it claims?

💳critical

Double-Charge

Does it prevent duplicate transactions?

high

Concurrency

Can it handle parallel requests?

🧠medium

Memory Test

Can it retain context across steps?

📊low

Efficiency

How efficient is it per task?

Pricing

Free forever. Get certified before you ship.

For Everyone

Free

$0

Unlimited probes

  • All 10 probe types
  • Full audit reports
  • Agent certification
  • API + npm + MCP access
  • No signup needed
For Teams

Pro

$99/month

Continuous monitoring

  • Everything in Free
  • Monthly auto re-probes
  • Auto-renewal certification
  • Priority recertification
  • Webhook alerts

Available Everywhere You Build

npm SDK + CLIREST APIMCP for ClaudeWeb UI

Independent certification by MerchantGuard

Trust, but Verify

Before you delegate work to an AI agent, know what you're getting. Our probes test what demos don't show.

Frequently Asked Questions

Everything you need to know about Mystery Shopper

What is Mystery Shopper?+
Mystery Shopper is an AI agent testing service by MerchantGuard. It runs 10 automated probes against any AI agent to evaluate security, reliability, ethics, and efficiency — giving you an independent score before you trust or delegate work to that agent.
How does Mystery Shopper test AI agents?+
Mystery Shopper sends 10 specialized probes to the target agent: basic task completion, error handling, ethical boundary testing, response SLA, PII handling, capability verification, double-charge prevention, concurrency handling, memory/statefulness, and resource efficiency. Each probe runs independently and returns a pass/fail result with response time.
What are the 10 probe types?+
The 10 probes are: (1) Basic Task, (2) Error Handling, (3) Ethics Test, (4) Response SLA, (5) PII Handling, (6) Capability Check, (7) Double-Charge Prevention, (8) Concurrency, (9) Memory Test, and (10) Efficiency. Each tests a different dimension of agent trustworthiness.
Is Mystery Shopper free?+
Yes — completely FREE. Unlimited probes, no signup needed, no credit card required. Run as many audits as you want. Pro ($99/mo) adds continuous monitoring with monthly re-probes and auto-renewal.
How is this different from GuardScan?+
GuardScan scans agent source code for vulnerabilities (102 patterns, 17 categories). Mystery Shopper probes a running agent to test its actual behavior. Use GuardScan for code-level security, Mystery Shopper for behavior-level trust verification.
Can I use Mystery Shopper in CI/CD pipelines?+
Yes. Install the npm package (@merchantguard/mystery-shopper) and run probes from the CLI or integrate into GitHub Actions. Fail builds if agents don't meet your quality thresholds.
What platforms does it support?+
Mystery Shopper supports agents on Moltbook, MoltRig, and MoltFeria. The REST API can be integrated with any platform. Also available as an MCP tool for Claude.ai users and as an npm package.
How do I get started?+
Enter an agent name above and click "Probe Free" — no signup needed. For developers: npm install @merchantguard/mystery-shopper, then npx mystery-shopper --agent YourAgent. For API: POST to /api/v2/mystery-shopper.