Gaze

What Gaze Does

Line coverage tells you which lines ran. It does not tell you whether your tests actually verified anything meaningful.

A function can have 90% line coverage and tests that assert on nothing contractually important — logging calls, goroutine lifecycle, internal stdout writes — while leaving the return values, error paths, and state mutations completely unverified. That function is dangerous to change, and traditional coverage metrics will not warn you.

Gaze fixes this by working from first principles:

Detect every observable side effect a function produces — return values, error returns, mutations, I/O, channel sends, and more.
Classify each effect as contractual (part of the function’s public obligation), incidental (an implementation detail), or ambiguous.
Measure whether your tests actually assert on the contractual effects — and flag the ones they do not.

Gaze requires no annotations, no test framework changes, and no restructuring of your code. It analyzes your existing Go packages as-is.

Key Metrics

Contract Coverage

The percentage of a function’s contractual side effects that at least one test assertion verifies.

ContractCoverage% = (contractual effects asserted on / total contractual effects) x 100

A function with 90% line coverage but 20% contract coverage has tests that run code without checking correctness. Gaze surfaces the specific effects that have no assertion — the exact gaps you need to close.

Range	Status
80% or higher	Good
50-79%	Warning
Below 50%	Bad

Over-Specification Score

The count and ratio of test assertions that target incidental effects — implementation details that are not part of the function’s contract.

OverSpec.Ratio = incidental assertions / total mapped assertions

Tests that assert on log output, goroutine lifecycle, or internal stdout will break during refactoring even when the function’s actual contract is preserved. Gaze identifies each over-specified assertion and explains why it is fragile.

Count	Status
0	Good
1-3	Warning
More than 3	Bad

GazeCRAP

A composite risk score that replaces line coverage with contract coverage in the CRAP (Change Risk Anti-Patterns) formula.

CRAP(m)     = complexity^2 x (1 - lineCoverage/100)^3 + complexity
GazeCRAP(m) = complexity^2 x (1 - contractCoverage)^3 + complexity

Functions are placed in a quadrant based on both scores:

	Low GazeCRAP	High GazeCRAP
Low CRAP	Safe	Simple but underspecified
High CRAP	Complex but tested	Dangerous

The Dangerous quadrant — complex functions whose tests do not verify their contracts — is the highest-priority target for remediation.

Using Gaze

Gaze integrates with OpenCode through the /gaze slash command. This is the primary way to use Gaze for test quality analysis.

Setup

First, install the Gaze binary:

brew install unbound-force/tap/gaze

Or via Go:

go install github.com/unbound-force/gaze/cmd/gaze@latest

Then scaffold the OpenCode integration in your Go project:

gaze init

This creates 8 files across 3 directories in .opencode/ that wire up the /gaze command, a quality reporting agent, and AI-assisted test generation:

Directory	Files
`.opencode/agents/`	`gaze-reporter.md`, `gaze-test-generator.md`, `reviewer-testing.md`
`.opencode/command/`	`gaze.md`, `gaze-fix.md`, `speckit.testreview.md`
`.opencode/references/`	`doc-scoring-model.md`, `example-report.md`

Notable scaffolded files:

reviewer-testing.md — The Tester Divisor persona for code review. When the review council runs, this agent evaluates test architecture, coverage strategy, assertion depth, and test isolation. It uses Gaze quality data (when available) to ground its findings in concrete CRAP scores and coverage numbers.
speckit.testreview.md — A read-only spec testability analysis command (/speckit.testreview). Evaluates whether a spec’s requirements are testable before implementation begins — checking that acceptance criteria are specific enough to write tests from.

The `/gaze` Command

Inside OpenCode, use /gaze to get AI-assisted quality reports:

/gaze                           # Full report for entire module
/gaze crap ./internal/store     # CRAP scores only
/gaze quality ./pkg/api         # Contract Coverage metrics only

CLI Usage

Gaze can also be run directly from the command line:

gaze report ./... --ai=opencode   # Full report using OpenCode adapter
gaze report ./... --ai=claude     # Full report using Claude adapter

Both --ai=opencode and --ai=claude are fully supported AI backends for gaze report.

CLI Flags

Flag	Commands	Description
`--include-unexported`	`analyze`, `quality`	Include unexported functions in the analysis. Auto-detected for `package main` — CLI tools and entry points are analyzed by default without needing this flag.
`--ai-mapper`	`quality`, `crap`	Enable AI-assisted assertion mapping as a 5th pass (confidence 50). Uses the configured AI adapter to evaluate structurally disconnected assertions.
`--max-gaze-crapload`	`report`	Fail if more than N functions exceed the GazeCRAP threshold. Similar to `--max-crapload` but uses contract coverage instead of line coverage.
`--coverprofile`	`report`	Pass a pre-generated `go test -coverprofile` to skip Gaze’s internal test run. See the Tester guide for the CI pattern.

The `/gaze fix` Command

The /gaze fix command provides AI-assisted test generation to close coverage gaps. It reads quality data — contract coverage gaps and fix strategy labels — and generates Go test functions using the gaze-test-generator agent.

/gaze fix                       # Fix coverage gaps for the module

The command is scaffolded by gaze init (as .opencode/command/gaze-fix.md) and works alongside the /gaze report command. Use /gaze to identify gaps, then /gaze fix to generate tests that close them.

Fix Strategy Labels

Each function in a Gaze report carries a remediation label that tells you how to improve it. These labels appear in both text and JSON output:

Label	Meaning
`add_tests`	Function has no tests — write tests
`add_assertions`	Tests exist but don’t assert on contractual effects
`decompose`	Function is too complex — split it
`decompose_and_test`	Both too complex and undertested

The /gaze fix command uses these labels to determine the appropriate test generation strategy for each function.

Sample Output

Here is what a full Gaze report looks like on a real Go project — gcal-organizer. This sample was generated with an earlier version of Gaze; current output may include additional metrics and formatting improvements.

Overall Health Assessment

Dimension	Grade	Details
CRAPload	🔴 D	40/137 functions (29.2%) above threshold
GazeCRAPload	🟢 A-	5/17 analyzed functions above threshold
Avg Line Coverage	🔴 D	26.2% — 101 of 137 functions have 0% coverage
Contract Coverage	🟡 C	52.9% avg across 17 quality-analyzed functions
Complexity	🟢 B+	Average 4.9, but 40 functions exceed threshold

Top 5 Worst CRAP Scores

Function	CRAP	Complexity	Coverage	File
(*Service).CreateDecisionsTab	650.0	25	0.0%	internal/docs/service.go:460
runBrowserScript	342.0	18	0.0%	cmd/gcal-organizer/assign_tasks.go:237
loadDotEnv	240.0	15	0.0%	cmd/gcal-organizer/main.go:382
(*Service).ListMeetingDocuments	210.0	14	0.0%	internal/drive/service.go:113
(*Organizer).printSummary	156.0	12	0.0%	internal/organizer/organizer.go:227

For a section-by-section walkthrough of the full report — including GazeCRAP quadrants, quality analysis, classification, and prioritized recommendations — see Gaze in Practice.

Architecture

Gaze is structured as a set of focused packages:

Package	Purpose
`analysis`	Side effect detection engine — the core analysis orchestrator
`taxonomy`	Side effect type system with stable IDs and tier classification
`classify`	Contractual classification engine
`config`	Configuration file handling (`.gaze.yaml`)
`loader`	Go package loading wrapper
`report`	JSON and text output formatters
`crap`	CRAP score computation and reporting
`quality`	Test quality assessment — contract coverage and assertion mapping
`docscan`	Documentation file scanner for enhanced classification
`aireport`	AI CLI adapter integration for `gaze report` AI pipeline
`scaffold`	OpenCode file scaffolding for `/gaze` command setup

The analysis engine detects side effects across three implemented tiers (P0, P1, P2), covering the most common and impactful effect types — from return values and error returns through mutations, I/O, channel operations, and more.

Analysis Engine Details

These details are primarily relevant for understanding Gaze’s output and accuracy characteristics:

SSA degradation diagnostics. When SSA (Static Single Assignment) construction fails for a package (e.g., due to upstream toolchain issues), Gaze degrades gracefully instead of failing. The CRAP output includes ssa_degraded_packages listing affected packages, and the text report shows an SSA diagnostics section. Metrics for degraded packages are based on AST-only analysis with reduced accuracy.
AST mutation fallback. When SSA analysis fails, Gaze falls back to AST-based mutation detection for side effect identification. This is transparent to users but may affect accuracy for complex data flow patterns. The degradation is surfaced via the SSA diagnostics above.
no_test_coverage classification. Contract coverage distinguishes between functions that have side effects but no test coverage (no_test_coverage) and functions that have no detectable effects. This separation ensures the fix strategy labels recommend “write tests” rather than “add assertions” for completely untested functions.
Constructor naming signal. Functions with New or NewXxx naming prefixes receive a classification signal that biases toward constructor patterns, improving contractual classification accuracy for factory functions.
Reduced GoDoc signal. GoDoc keywords contribute a reduced signal (+5) to non-matching effect types, preventing documentation-derived classification from overwhelming mechanical signals.
Container unwrap mapping. Assertion mapping handles field access and transformation chains (e.g., JSON unmarshal followed by field access followed by type assertion) at confidence 55, improving mapping accuracy for assertions that verify deeply nested return values.

Current Limitations

Gaze is actively developed. The current scope has known boundaries:

Direct function body only. Gaze analyzes the immediate function body. Transitive side effects (effects produced by called functions) are out of scope for v1.
P3-P4 side effects not yet detected. P0 through P2 are fully implemented — covering return values, error returns, mutations, I/O, channel operations, file system operations, database writes, goroutine spawns, panics, and context cancellation. P3-P4 side effects (stdout/stderr, environment mutations, mutex operations, reflection, unsafe) are not yet detected.
Assertion mapping accuracy is ~84.7% (83/98 mapped assertions, ratchet floor 84.0%). The target is 90%. Accuracy is primarily limited by helper function assertions and testify field-access patterns (tracked as GitHub Issue #6).
No CGo or unsafe analysis. Functions using cgo or unsafe.Pointer are not analyzed for their specific side effects.
Single package loading. The analyze and quality commands process one package at a time. The crap and report commands process the entire module (./...).

Learn More

GitHub Repository — source code, issues, and releases
Why Contract Coverage? — a deeper look at why line coverage is not enough and how contract coverage changes the way you think about test quality

Muti-Mind

Docs

Unbound Force

Title here

Gaze

What Gaze Does

Key Metrics

Contract Coverage

Over-Specification Score

GazeCRAP

Using Gaze

Setup

The `/gaze` Command

CLI Usage

CLI Flags

The `/gaze fix` Command

Fix Strategy Labels

Sample Output

Overall Health Assessment

Top 5 Worst CRAP Scores

Architecture

Analysis Engine Details

Current Limitations

Learn More

Gaze

What Gaze Does#

Key Metrics#

Contract Coverage#

Over-Specification Score#

GazeCRAP#

Using Gaze#

Setup#

The /gaze Command#

CLI Usage#

CLI Flags#

The /gaze fix Command#

Fix Strategy Labels#

Sample Output#

Overall Health Assessment#

Top 5 Worst CRAP Scores#

Architecture#

Analysis Engine Details#

Current Limitations#

Learn More#

What Gaze Does

Key Metrics

Contract Coverage

Over-Specification Score

GazeCRAP

Using Gaze

Setup

The `/gaze` Command

CLI Usage

CLI Flags

The `/gaze fix` Command

Fix Strategy Labels

Sample Output

Overall Health Assessment

Top 5 Worst CRAP Scores

Architecture

Analysis Engine Details

Current Limitations

Learn More