Monarchic LLC logo Monarchic LLC

RepoIntel MCP

Repository
Intelligence Bench

RepoIntel is a repository intelligence MCP runtime for indexing code, symbols, evidence records, generated wiki pages, and change-aware retrieval. This page reports the current RepoIntel-Bench run across localization, explanation, patch planning, patch generation, and review.

Current RepoIntel Run

Overall score

0.9979

Tasks

59

Zero-score tasks

0

Review

1.0000

Localization

0.9938

Explanation

1.0000

Patch plan

1.0000

Patch generation

1.0000

RepoIntel-Bench

Score by Task Type

Task Type Tasks Mean Score Interpretation
Review 5 1.0000 Seeded defect findings are detected with strong file and citation coverage.
Localization 20 0.9938 Source-aware evidence ranking recovers target files and high-value symbols across fixture and OSS repos.
Explanation 14 1.0000 Evidence-backed answers have strong file and citation support in the strict retrieval path.
Patch plan 10 1.0000 Plans are evidence-backed with high file and citation coverage.
Patch generation 10 1.0000 A constrained semantic Python-service patcher emits diffs that apply and pass public and hidden tests.

Score by Repository

fixture-small-api

51 tasks

0.9975

express

2 tasks

1.0000

flask

2 tasks

1.0000

fastapi

2 tasks

1.0000

requests

2 tasks

1.0000

What This Measures

The benchmark combines generated fixture tasks with pinned public open-source tasks from FastAPI, Express, Requests, and Flask. It scores file recall, symbol recall, citation quality, patch application, hidden tests, minimality, seeded review findings, and task-type specific metrics.

Current Ceiling

The strict public-corpus run is strong across review, explanation, localization, patch planning, and the current Python-service patch tasks. Patch generation should still be read narrowly: it is a constrained semantic patcher, not a broad autonomous coding system.

Sources and Scope

  • RepoIntel artifact: local RepoIntel-Bench public run with 59 tasks and evaluated patch outputs.
  • Latest verified score: 0.9979 from a pinned public-corpus RepoIntel-Bench run.
  • Runtime note: benchmark corpus and adapter state are stored on the large data disk; private local calibration repos, prompt-path hints, and fixture patch templates are excluded from this public score. Patch generation uses a constrained semantic Python-service patcher.

RepoIntel results are benchmark-scoped. The current score should be read as a RepoIntel-Bench adapter result, not a general claim about all repository-intelligence workloads. A clean fully indexed run can update this page when the benchmark environment is pinned for publication.

Product Link

Use RepoIntel

The product page connects this RepoIntel-Bench result to the hosted MCP plan, Developer Bundle inclusion, and checkout path.

Open Product