๐Ÿš€ Now in Public Beta

Stop guessing if your Agent remembers. Audit it.

The first CI/CD gate for AI Memory. Detect regressions, user-leakage, and rank-collapse before your users do.

$ memorybench --providers my_memory --seeds 0 1 2
Running 7 tests across 3 seeds...
โœ… sensitivity_ratio: 0.82 (threshold: 0.3)
โœ… spectral_entropy: 0.91 (threshold: 0.4)
โŒ bimodal_switch: 0.12 (threshold: 0.3)
๐Ÿ’ก FIX: Your memory averages instead of choosing
โœ… user_separation: 0.98 (threshold: 0.9)
---
SUMMARY: 6/7 tests passed (85.7%)

Why Memory Harness?

๐Ÿ”’

Zero-Trust Memory

Don't trust "Semantic Similarity". We run 7 stress-tests (interference, noise, capacity) to prove your memory actually works.

๐Ÿ‘ฅ

Privacy Guard

Automated "User Separation" tests. Guarantee that User A's context never bleeds into User B's recall.

๐ŸŽฏ

Anti-Hallucination Audit

We differentiate between "True Recall" and "Model Guessing" by calculating the signal-to-noise ratio in your retrievals.

๐Ÿ”ง

Fix Suggestions

Every failed test comes with actionable fix suggestions, code examples, and documentation links. Not just diagnosis โ€” direction.

โšก

CI/CD Ready

Drop a memory-audit.yaml in your repo. Block deploys that break memory. Track regressions over time.

๐Ÿ“Š

Baseline Comparison

Compare your memory against RAG, KNN, Hopfield. Know exactly where you stand and what to improve.

The 7 Killer Tests

Each test targets a specific failure mode that breaks production memory systems.

Test What It Catches PASS Threshold
Sensitivity Ratio Cue dominance โ€” memory is decorative > 0.30
Marginal Permutation Structure-blind โ€” only sees density < 0.30
Spectral Entropy Rank collapse โ€” one pattern dominates > 0.40
Bimodal Switch Interpolation โ€” can't pick a winner > 0.30
Orthogonal Capacity Catastrophic forgetting โ‰ฅ 3 patterns
Attractor Pull No denoising โ€” noise propagates > 0.10
User Separation Multi-tenant leakage > 0.90

Simple, Transparent Pricing

Sandbox

$0
For indie devs exploring
  • 2 providers (FAISS/Pinecone)
  • 2 tests
  • Web report
  • Community support
Get Started

Team

$799/mo
For teams scaling up
  • Everything in Pro
  • CI/CD integration
  • Regression history
  • Cross-user audit
  • Unlimited runs
  • Slack integration
Start Free Trial

Enterprise

Custom
For labs & Fortune 500
  • Everything in Team
  • White-box SVD diagnostics
  • Custom scenarios
  • SOC2 certification
  • On-prem deployment
  • Dedicated support
Contact Sales

CI/CD Integration in 2 Minutes

Drop a config file in your repo. Block deploys that break memory.

.github/workflows/memory-audit.yml
name: Memory Audit
on: [push, pull_request]

jobs:
  audit:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      
      - name: Run Memory Harness
        uses: memory-harness/action@v1
        with:
          provider: ./my_memory.py
          config: memory-audit.yaml
      
      - name: Upload Report
        uses: actions/upload-artifact@v4
        with:
          name: memory-report
          path: report.html

Ready to audit your memory?

Join 500+ teams who ship AI agents with confidence.