Planwright
Signals & Insights

Signals & Insights

Signals & Insights

Planwright surfaces two views of your agentic workflow: Signals at the project level — a time-windowed status report showing what shipped, what's at risk, and how agents performed — and Insights at the workspace level, with longer-arc metrics on direction quality, leverage, and team throughput.

Every metric is a signal about human-agent collaboration, not just about what the agent produced. Low first-pass rate, high clarity flags, long review times — these numbers say as much about how well humans are directing agents as they do about agent performance.

Delivery health

Signals · Project

Pass rate, cycle time, and stage latency tell you where time goes and whether objectives are ready to run.

Pass rate

The percentage of completed objectives accepted on first review.

Healthy

Rate is above 70–80%. Objectives are well-scoped, acceptance criteria are explicit, and agents execute against clear expectations without needing a second attempt.

Watch for

Rate below 60% or declining. Almost always a direction problem — objectives are ambiguous, over-scoped, or the definition of done isn't explicit enough. Audit rejected objectives before adjusting agent configuration.

Cycle time

How long from objective creation to acceptance.

Healthy

Objectives complete in hours to a few days, consistently across similar work. Cycle time is predictable and doesn't surprise the team.

Watch for

Cycle times drifting toward weeks, or high variance between similar objectives. Usually signals objectives are too broad, or a systemic delay is accumulating at one handoff stage.

Stage latency

Cycle time split into three stages: pickup (scheduled → in progress), execution (in progress → acceptance), and review (acceptance → done).

Healthy

Pickup is short — objectives are claimed quickly after scheduling. Execution is proportional to complexity. Review is fast — humans are not the bottleneck.

Watch for

Long pickup means agents aren't running frequently enough. Long execution means mid-run ambiguity or scope expansion. Long review is the most common agentic bottleneck: agents ship in hours, but throughput is capped by how fast humans approve.

First-pass acceptance rate

Insights · Workspace

The flagship metric for objective quality — and for the quality of human direction.

Overall rate

The percentage of completed objectives accepted without any rejections. The single most diagnostic number in the system.

Healthy

Rate is high and stable, or rising. Agents execute cleanly against well-written objectives. Human direction quality compounds: each well-scoped objective makes the next easier to write.

Watch for

Rate is low or declining. Almost always an objective quality issue, not an agent issue. Ask: were acceptance criteria explicit? Did the objective describe an outcome or an implementation?

Trend vs. prior period

First-pass rate compared against the equivalent prior window (7-day vs. prior 7 days, 30-day vs. prior 30 days).

Healthy

Rate is stable or improving across consecutive windows. The team is getting more deliberate about defining objectives before scheduling them.

Watch for

Rate declining across multiple windows — the team is scheduling objectives that aren't ready. Usually a sign that urgency is overriding definition quality.

Per-agent breakdown

First-pass rate and average rejection cycles broken down by agent kind.

Healthy

Rates are consistent across agents for the same class of work, or the team has learned which agents work best for which objective types and routes accordingly.

Watch for

One agent kind consistently underperforming on a class of work. Usually means those objectives need more context for that agent — file pointers, interface contracts, explicit criteria — not a different agent.

Clarity-flag correlation

Whether objectives that agents flagged as unclear before execution have lower first-pass rates.

Healthy

Clarity flags are rare. When raised, the team refines the objective before running. First-pass rates on flagged objectives, once resolved, are comparable to unflagged ones.

Watch for

High flag rate, or flags consistently overridden and objectives run anyway. An overridden clarity flag is the strongest predictor of rejection — the agent couldn't interpret the objective, so the first attempt is almost certain to miss.

Human leverage

Insights · Workspace

How much agent execution is produced per human director. Low leverage is a direction problem, not an agent problem.

Leverage ratio

Total agent execution hours produced per active human director in the window.

Healthy

A small number of directors producing a large and growing volume of agent execution. The ratio improves over time as objective quality increases and agents run more efficiently.

Watch for

Low or flat leverage with multiple directors active. Either objectives are cycling through rework (direction quality), or agents aren't being run frequently enough (utilization). Stage latency helps tell the two apart.

Concurrency

The number of objectives in flight simultaneously.

Healthy

Multiple objectives flowing in parallel — agents executing while others are in review or pickup. A team of two or three directors can sustainably run three to five objectives concurrently.

Watch for

Concurrency consistently at 1. The team is running objectives sequentially when agents have capacity for more. Increasing concurrency is almost always the fastest path to higher throughput without adding people.

Strategic visibility

Signals · Project

Whether what the team is actually shipping matches what they said they would ship.

Planned vs. delivered by bucket

Strategic focus distribution across BUILD, GROW, TRUST, and SCALE — shown separately for completed, active, and planned objectives.

Healthy

The distribution of delivered work broadly matches what was planned. The team follows through on stated strategic priorities rather than defaulting to whatever is easiest to execute.

Watch for

Persistent gaps between planned and delivered for a specific bucket — most commonly TRUST work that's always planned but rarely shipped. This pattern is invisible in task-based tools and immediately visible here.

Classification coverage

The percentage of objectives that carry a strategic classification.

Healthy

Coverage is 90% or higher. Agents classify objectives at claim time automatically, so gaps are rare and limited to early objectives created before the workflow was established.

Watch for

Coverage below 80%. The portfolio allocation view is incomplete and unreliable — the team is making priority decisions without a full picture of where effort is actually going.

Risk signals

Signals · Project

Early warning indicators that let teams intervene before the audit trail fills with rework.

Clarity flags

Raised by agents before execution when an objective is too ambiguous to execute against confidently.

Healthy

Flags are rare. When raised, the team treats them as a signal to refine the objective — zero execution time spent, zero code written. The flag resolves the ambiguity before it becomes a rejection.

Watch for

High flag rate, especially if the team is overriding flags and running objectives anyway. A high rate means objectives consistently lack specificity — the fix is explicit acceptance criteria, file pointers, or narrower scope before scheduling.

Multi-rejected objectives

Objectives rejected two or more times in a period.

Healthy

Multi-rejections are rare. Single rejections are expected — the review gate working as intended. Objectives that fail once are resubmitted with tighter scope or clearer criteria and pass on the next attempt.

Watch for

Objectives cycling through three or more rejections. Signals that acceptance criteria aren't explicit enough, the objective is too large and gets partially completed each time, or the expected output isn't achievable in a single run. These should be split, not re-run.

Aging lanes

How long objectives have been sitting in their current lane.

Healthy

Objectives move through lanes in hours to days. Backlog is shallow and well-curated. Scheduled objectives are claimed quickly. In-progress objectives complete before they go stale.

Watch for

Aging backlog is prioritization debt — more work is being created than executed. Aging scheduled means agents aren't running. Aging in-progress means a blocker surfaced mid-run that wasn't anticipated in the original objective.

Stale in-progress

Objectives in the in-progress or acceptance lane with no movement in more than seven days.

Healthy

No stale objectives. Agent runs either complete and request acceptance or surface a blocker quickly. The board reflects real-time state — nothing is left sitting indefinitely.

Watch for

Any stale in-progress objective. These represent blocked or abandoned runs, or work waiting on an external dependency that was never surfaced. The right action is to reject and reschedule with fresh context — not leave them in limbo.

Audit chain health

Signals · Project

The integrity indicator for every state transition in the system.

Chain verification status

Every lane change, agent claim, acceptance, and rejection is recorded as a hash-chained, Ed25519-signed audit record. The health indicator shows the result of nightly integrity verification.

Healthy

Chain passes verification. Every record hashes correctly to the previous one and signatures verify against the workspace signing key in KMS. The audit record is tamper-evident and satisfies the 2026 AICPA SOC 2 criteria for AI-generated code.

Watch for

Chain fails verification. May indicate data tampering or a system error — either requires immediate investigation. The integrity of the compliance record is at stake: a broken chain cannot be used to demonstrate a verifiable chain of custody to an auditor.