Top Priority — Checked First
Token Efficiency Assessment
Before examining code quality, agents, or structure, we assess how efficiently your team is spending tokens. Token spending is a primary concern in its own right — not a footnote to code review. Every unnecessary token is a future tax on every interaction: compounding cost, compounding latency, compounding context pollution. We look at prompt design, context window discipline, model selection relative to task complexity, and whether AI is being invoked where simpler tools would suffice. Wasteful token patterns are often the earliest and clearest signal that an AI-assisted practice has drifted from craft to cargo cult.
How AI Is Actually Being Used
Which agents, which models, which workflows, what proportion of the code is AI-authored, and what review discipline sits between generation and merge. We map the real practice, not the policy on paper.
Obvious Risk Surfaces
A targeted scan for the failure modes most common in AI-assisted codebases: hallucinated dependencies, over-permissioned agents, test suites that confirm rather than challenge, and security defaults the agent never thought to harden.
Agent and Tool Configuration
A quick read of the prompts, tools, and permissions granted to any agents in your system. Over-permissioning and runaway autonomy are the cheapest risks to spot and the most expensive to ignore.
Structural Smell Test
A whole-system glance for the telltale signs of incremental agent assistance accumulating without an architectural hand on the tiller: contradictory patterns, redundant abstractions, dead code, drifting conventions.
Tooling and Dashboard Reality Check
If you already own security or governance tooling, we sample whether it is configured to enforce anything meaningful or simply producing reassuring dashboards. A surprising amount of "we are covered" turns out to be unverified on inspection.
Verdict and Recommendation
The deliverable. A short written summary stating whether further work is warranted, which rung of the ladder we would recommend, and — just as importantly — which rungs we would not.