Documentation Index
Fetch the complete documentation index at: https://docs.oobo.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
We tested whether accumulated engineering context actually helps AI agents perform better. The answer: yes, significantly. Oobo captures context from every commit — not just the code diff, but WHY it was built, what was tried, and what patterns work. This memory becomes available to agents via MCP tools.Results
60% More Bugs Fixed
Agents with oobo memory resolve 60% more real-world bugs on SWE-bench tasks
75% Win Rate
When memory makes a difference, oobo wins 3 out of 4 contested cases
2.5x More Accurate
On codebase-specific questions, agents answer correctly 2.5x more often with context
Zero False Alarms
When oobo warns about a risky modification, it’s always right — 0% false positive rate
How We Tested
Setup:- Same agent (Claude Sonnet 4) with identical tools in both conditions
- Only difference: one gets oobo’s memory context, one doesn’t
- Real engineering experiences from 12 major open-source Python repositories
- Evaluated on tasks the agent has never seen (strict no-leakage protocol)
What This Means
For Engineering Teams
For Engineering Teams
Every commit your team makes becomes searchable intelligence. When an agent encounters something similar months later, oobo provides the context: which files to look at, what patterns worked, what broke last time.
For AI Agent Performance
For AI Agent Performance
Agents without memory waste iterations exploring dead ends. With oobo context, agents navigate directly to the relevant code — producing fixes they’d otherwise miss entirely. Near-zero cost overhead.
For Code Quality
For Code Quality
Zero false alarms on regression warnings. When oobo surfaces a risk, it’s always based on real past experience — not heuristics or guesses.
Methodology
| Parameter | Value |
|---|---|
| Dataset | SWE-bench (2294 tasks, 12 Python repositories) |
| Model | Claude Sonnet 4 |
| Embedding | OpenAI text-embedding-3-large |
| Search | pgvector cosine similarity |
| Protocol | Strict train/test split, no data leakage |