stonewall.dev
Back to Blog
AI product-management spec-driven-development cursor

Cursor Changed How We Code. Nothing Changed How We Spec.

Stonewall · · 12 min read

The 10x Engineer Building the Wrong Thing

Cursor made the 10x engineer real. Claude Code made it absurd. A single developer can now scaffold a full-stack app, write tests, deploy infrastructure, and iterate on feedback loops that used to take a team of five — all in an afternoon. If you're reading this, you've probably felt it. The cursor for product managers conversation started because engineers realized the bottleneck moved.

But here's the part that keeps nagging: a 10x engineer building the wrong feature is a 10x waste. And right now, the "what to build" question is still answered the same way it was in 2019 — a Google Doc, a gut feeling, and a meeting that could have been an email.

The bottleneck didn't disappear. It moved upstream.

What AI Coding Tools Actually Changed

Let's give credit where it's due. The shift in developer tooling over the past two years has been genuinely transformative.

Stack Overflow's 2025 developer survey found that 84% of developers use or plan to use AI coding tools, with 51% using them daily. That's not hype — that's infrastructure. Cursor, Claude Code, Copilot, and the rest have collectively done something remarkable: they compressed the distance between intent and implementation.

A senior engineer can now hold an entire feature in their head, describe it conversationally, and watch working code materialize. Junior developers are shipping at mid-level velocity. Solo founders are building products that look like they have a team of ten.

Vibe coding went from meme to methodology. The implementation layer got fast.

84% Devs using AI tools
51% Using them daily
$192.7B AI startup VC (Q3 2025)

And AI startups captured 52.5% of global venture capital through Q3 2025 — $192.7 billion — largely on the premise that this acceleration would compound. It has. But only on one side of the equation.

The Part Nobody Talks About

Here's what a typical product development cycle looks like in 2026:

Step 1: Someone writes a PRD in Notion or Google Docs. Maybe they use ChatPRD to generate a first draft — and to be fair, tools like ChatPRD have proven real demand here, with over 100,000 users paying $15/month to generate specs from chat. That's a signal worth paying attention to.

Step 2: The PRD gets reviewed in a meeting. Half the context is verbal. Decisions are made in Slack threads that nobody bookmarks.

Step 3: An engineer opens Cursor, reads the PRD once, and starts building. Within four hours, the implementation has diverged from the spec in ways nobody will notice for two weeks.

Step 4: The PRD sits in a docs folder, slowly rotting, while the codebase becomes the de facto source of truth.

Sound familiar?

The spec layer — the place where "what to build and why" gets decided — hasn't had its AI moment. We have AI that can write code from natural language. We don't have AI that can tell you whether the code you're about to write is the right code to write.

There's no tab-complete for product decisions. No copilot for prioritization. No agent that reads your codebase, understands your board, and says "this feature contradicts what you shipped last week."

If you're an engineer writing PRDs today, you already know the pain of speccing in a vacuum. The tools assume you're either a full-time PM with enterprise workflows or a developer who doesn't need specs at all. Neither is true.

YC Noticed

In their Spring 2026 Request for Startups, Y Combinator explicitly called for a "Cursor for Product Managers." Partner Andrew Miklas framed it directly: upload your customer data, ask "what should we build next?", and get feature outlines with reasoning.

The fact that YC put this in writing matters. It's an acknowledgment from the most influential startup accelerator in the world that the spec layer is a real, unsolved problem — not a nice-to-have, but a missing piece of the AI development stack.

YC Spring 2026 RFS: Partner Andrew Miklas explicitly called for a "Cursor for Product Managers" — upload customer data, ask "what should we build next?", get feature outlines with reasoning. The PM software market is valued at $5.1-30.7B and growing 7-10% annually through 2033.

The product management software market is valued somewhere between $5.1 billion and $30.7 billion depending on how you draw the boundaries, growing at 7-10% annually through 2033. The money is there. The problem is there. The timing is there.

But the framing needs adjustment.

Where the "Cursor for PMs" Framing Gets It Wrong

Calling this a "PM tool" is like calling GitHub a "developer tool." Technically true. Misses the point entirely.

Product managers are one user of this system. But the people who need it most are the engineers-turned-founders, the solo developers, the two-person teams where one person is simultaneously the PM, the designer, the engineer, and the support team. The people who are 10x-ing their code output but still spending their Mondays in planning purgatory.

The real need isn't a chat interface that generates PRD documents. It's a stack layer — something that sits between user signal and AI code execution, maintaining the connective tissue that currently exists only in someone's head.

GitHub's engineering blog put it well in September 2025:

"The specification becomes the source of truth and determines what gets built." GitHub Engineering, September 2025

Not the Jira ticket. Not the Slack thread. Not the meeting notes. The spec.

But the spec has to actually work for that to be true. And right now, specs are static documents that start dying the moment they're written. If you've ever come back to a PRD after a week of implementation, you know exactly why your current PRDs stop working after 48 hours.

What "Living Specs" Actually Means

The concept of a "living document" has been a PM cliche for a decade. Every team says their PRDs are living documents. Nobody's PRDs are actually living documents. They're documents that lived once and are now in various stages of decomposition.

A genuinely living spec would need to do things that no current tool does:

Read the codebase. Not just sit next to it — actually parse it. Understand the data model, the API surface, the existing patterns. When you say "add a notification system," it should know you already have a NotificationPreference entity and a half-built WebSocket layer.

Stay connected to the board. When a ticket moves to "done," the spec should know. When a ticket gets descoped, the spec should update. The board and the spec should be the same system, not two systems that someone manually keeps in sync.

Flag drift. When a pull request changes something that contradicts the spec — different API shape, missing edge case, architectural deviation — that should surface before the PR merges. Not in a retrospective three sprints later.

Update itself. When code ships, the spec should reflect what actually got built, not what was planned six weeks ago. The spec becomes a record of intent AND reality, not just intent.

This isn't theoretical. Each of these capabilities is technically feasible today. The reason nobody's built it is that it requires bridging three competencies that rarely exist in the same team: PM tooling, developer tooling, and AI infrastructure.

The Closed Loop Nobody's Built

Here's the architecture that's missing from the market:

Intent Spec Board Code Drift Check Spec Update feedback loop

Intent — You describe what you want to build. Not in a blank document, but in a dialog. The AI asks clarifying questions. It reads your codebase to understand what exists. It pushes back on scope. It surfaces edge cases you haven't considered.

Spec — The conversation produces a structured specification. Not a wall of text — a machine-readable document with clear acceptance criteria, technical constraints, and scoping decisions.

Board — The spec automatically generates work items. Not a manual copy-paste into Linear or Jira — a bidirectional link where the board IS the spec, decomposed into executable chunks.

Code — Engineers build against the spec. AI coding tools can reference it directly. The spec is context, not a separate tab you forgot to open.

Drift Detection — A GitHub App watches pull requests as they come in. It compares what's being built against what was specced. When reality diverges from intent, it flags it — before merge, not after.

Spec Update — When the drift is intentional (scope changed, better approach found), the spec updates. The loop closes.

Every existing tool handles one, maybe two of these steps. Linear is excellent at the board layer — over 18,000 paying customers would agree — but it tracks work, not intent. There's no spec connection. Notion AI gives you a powerful document layer with 100 million users, but it has no native code awareness. The AI features are horizontal, not purpose-built for the spec-to-code loop.

The gap isn't in any single step. It's in the connections between them.

The Consolidation Wave Is a Signal

If you've been paying attention to the PM tools space, you've noticed a pattern. Cycle got acquired by Atlassian and sunsetted in October 2025. Kraftful was absorbed by Amplitude in July 2025. Chisel merged into Pendo. Airfocus joined Lucid.

This isn't random consolidation. It's what happens when point solutions can't sustain themselves independently. A tool that only does user feedback analysis, or only does prioritization frameworks, or only does roadmapping — it's useful but not defensible. The incumbents absorb the feature and move on.

The surviving startups in this space will be the ones that own a complete loop, not a single node. The ones that are opinionated enough to say "here's how specs should work in an AI-native workflow" rather than "here's a flexible platform that can do anything."

What This Means for You

If you're an engineer who also wears the PM hat — and in 2026, that's most engineers at companies under 50 people — the tools are about to get radically better. The current state of "AI PRD tools" is roughly where AI coding tools were in early 2023: useful but primitive, and about to go through a rapid maturation cycle.

The practical implications:

Your specs should be closer to your code. Not literally in the codebase (though that's not crazy), but connected to it. If your AI coding tool can't reference your spec, you're leaving context on the table.

Your board should generate from your spec, not the other way around. Most teams create tickets first and rationalize them into a spec later. That's backwards. The spec is the reasoning; the board is the execution plan.

Drift is the silent killer. The gap between what was specced and what got built is where most product quality issues hide. If you're not tracking that gap, you're navigating blind. Every team we've talked to has this problem. Almost none of them have a system for catching it.

If you're building in this space, the window is open. YC's RFS is a green light. The consolidation wave cleared out weak players. AI infrastructure (embeddings, code parsing, streaming agents) is mature enough to build on. But the window won't stay open forever — the next 18 months will determine whether this becomes a startup category or an incumbent feature.

The Spec Layer Will Get Built

The question isn't whether someone builds the connective tissue between "what to build" and "how to build." The demand is too obvious, the gap too wide, the market too large. AI product management isn't a niche — it's the missing half of the AI development stack.

The question is whether it gets built by a team that thinks in code — that understands what it feels like to context-switch between a Notion doc and a terminal, to lose two hours because the spec didn't mention an edge case, to ship something that technically matches the ticket but misses the point — or whether it gets built by an incumbent that thinks in slide decks and enterprise sales cycles.

The spec layer is missing. We're building it.
Stonewall's AI PRD writer reads your codebase, asks scoping questions, and generates living specs. The board auto-generates. Drift detection is coming soon. Free to start.
Try the free AI PRD writer

Related Posts