← Writing

AI Did Not Make Engineering Cheaper. It Made It Bigger.

A hand-pressed linocut in brick red ink on cream paper: a small bookkeeper bent over a desk, writing in an open ledger, casting an enormous shadow that stretches across the floor several times larger than the figure. The cost the books never capture.

In budget meetings across the industry this year, the question is no longer whether AI coding tools are useful. That part is settled. The question is whether the engineering budget should now come down. One executive points to faster code generation as evidence that it should. Another asks why headcount has not already fallen. The honest answer from the people who run engineering is harder to fit into a spreadsheet: the code is arriving faster, but the review queue, the integration work, the security checks, and the production ownership have all grown.

If AI tools have made writing software dramatically faster, what should the engineering organization look like in a year? Most chief financial officers have a clear answer: smaller, and cheaper. The math behind that answer is straightforward. It is also incomplete in a way that matters.

Where the cost-out math is incomplete

The cost-out framing rests on a model of engineering economics in which writing code is the dominant share of the cost. In any organization that has been running production software for more than a few years, that model is wrong.

The engineering hours in such an organization are consumed by a long list of activities: specifications, design discussions, code review, integration with existing systems, security review, testing, deployment automation, on-call rotation, incident response, vendor management, audit support, technical debt remediation, internal documentation, and the human coordination around all of it. The writing of code itself was perhaps a third to a half of the function. Most of it was somewhere lower.

What AI tools have changed is concentrated in that visible portion. First-draft generation of functions, tests, configurations, scripts, and boilerplate is dramatically faster. Switching between languages and frameworks is faster. Translating well-formed specifications into running code is faster.

Almost everything else about shipping software is unchanged or harder. Code review now has to absorb a higher volume of generated work. Integration with existing systems still requires understanding them. Security review still requires judgment. Production incidents still happen. On-call still requires humans. The portion of engineering work that AI tools have reduced is not the portion that drives most of the cost.

The pattern that has emerged across 2024 and 2025 has been less clean than the original cost-out model promised. Organizations that cut roles tied to routine delivery have often found that review, integration, reliability, and domain-knowledge work became the new bottlenecks. Some rebuilt that capacity within a year, hiring back into adjacent roles the cost model had not counted. The CFO spreadsheet did not have a line item for reviewing capacity. That is part of why the math was incomplete.

The spreadsheet is not wrong because finance is wrong. It is wrong because it measures the visible part of engineering work and misses the capacity that makes software safe to ship. And as the bills are now coming due, it overcounts the savings on the part it did measure.

From production capacity to judgment capacity

The shift underneath all of this is worth naming clearly. AI tools have moved engineering from a production-capacity problem to a judgment-capacity problem. The skill that was scarce two years ago, the speed at which code could be produced, has become cheaper. The skills that were always valuable, but were not the binding constraints, have moved to the front.

The old and new bottlenecks look like this:

Old bottleneck New bottleneck
Writing boilerplate Reviewing generated code
Translating tickets into code Defining the right problem to solve
Knowing syntax and frameworks Understanding systems and constraints
Individual developer throughput Team integration and review capacity
Junior apprenticeship through repetition Deliberate apprenticeship through review, debugging, and ownership
The scarce resource moved from producing code to judging it.

Most engineering organizations were designed for the left-hand column. Most hiring practices, interview loops, performance metrics, and promotion criteria still optimize for the left-hand column. The right-hand column is where the new value lives. The transition from the left to the right is the work of the next two years.

The cost did not vanish. It changed shape.

There is a second shift underneath the first, and it is about cost rather than capacity. The cost-out case assumes that when writing gets faster, the money the organization spent on writing simply disappears. It does not. It changes shape. Work that used to sit on the payroll as a fixed salaried cost is moving onto the compute bill as a variable, metered one. A human engineer costs the same whether they write ten lines that day or ten thousand. An AI agent costs in proportion to what it does, and the heaviest, most productive use is the most expensive. For the first time, engineering output has a meter attached to it.

This is why the pricing of these tools has been unstable, and why it will stay unstable for a while. Flat-rate subscriptions were a bet that usage would stay modest. Usage did not stay modest. As teams move from autocomplete to chat to agents that run for minutes and spawn other agents, consumption per engineer climbs by an order of magnitude, and the flat fee stops covering it. The shift to usage-based billing is not vendors turning greedy. It is the cost surfacing where it was always going to surface, on a meter that rises with output. An organization reading its current subscription line as a stable number is reading a price that is mid-correction.

The right mental model is not a software license. It is cloud. A decade ago, compute moved from a fixed capital purchase to a metered utility, and a generation of engineering leaders learned, usually through one surprise invoice, that a variable cost with no governance is a cost with no ceiling. The discipline that grew up in response, cloud financial management, is about to repeat itself one layer higher. AI-assisted engineering is becoming a metered production input, and it will need the same handling: budgets, visibility by team, routing routine work to cheaper models and reserving expensive ones for the problems that justify them, and a hard line drawn between spend and the outcomes it produces.

The organizations that get hurt over the next year or two will be the ones that skip that discipline. The pattern is already visible in the earliest heavy adopters. Adoption races ahead, the bill races behind it, and when leadership goes looking for the return, it finds usage it cannot tie to a single shipped outcome. That is not a failure of the technology. It is a failure to govern a variable input as a variable input. The near-term future of engineering economics is not cheaper engineering. It is engineering whose cost has a throttle, and whose advantage goes to the leaders who can aim that throttle at outcomes instead of activity.

Where freed engineering capacity should go

The framing that produces better operating decisions starts from a different premise. AI made writing faster. The engineering function therefore has freed capacity. The leadership question is where to put it.

Four places make sense, in priority order.

The roadmap that was previously infeasible. Every engineering organization carries a list of work the business wants and the engineering function has not been able to commit to. Some of it is the work that quietly underpins competitive position, the kind of project that takes two years to pay back and has been deferred for three. Freed capacity is, before anything else, the chance to put that work onto the calendar. This is the growth play and the highest-return use of AI-amplified capacity.

The quality, reliability, and security work that was always underfunded. Every engineering organization also carries a list of things it knows it should be doing and has not had the capacity to do. Hardening services that run a little hotter than they should. Migrating off the database version the vendor stopped patching last year. Replacing the deployment pipeline that breaks once a quarter in ways nobody has time to root-cause. This work is unglamorous and high-yield. AI-augmented capacity is the chance to fund it.

The contractor and outsourcing spend. If the choice is between reducing FTE headcount and reducing contractor and outsourcing spend, reduce the contractor and outsourcing spend first. The institutional knowledge that lives in employees is more expensive to recreate than the throughput that lives in contractors. The cost-out math that was unfavorable to body-count outsourcing two years ago is now favorable to the executive who runs the renegotiation. Outcome-based contracts are negotiable now that were not. The opening is real and unusually wide.

The FTE headcount itself, last. And not by a uniform percentage applied across the engineering function. Where the work itself has actually shrunk, the headcount can shrink. Where the work has merely sped up, the headcount should stay or grow into the freed capacity from the first two priorities.

The blunt summary worth stating once, clearly: AI gives you more software for the same money. It does not give you the same software for less money, unless you are willing to accept that the software you ship will be worse.

Six moves a CXO can make this quarter

The reframing resolves at the operating model level into six concrete moves. None of them require a transformation program. All of them sit within the authority of the executive who would normally own AI strategy.

Measure freed capacity, not just reduced effort. Most organizations are measuring how much faster their developers code with AI tools. That is the wrong measurement. The measurement that matters is how much capacity the function has freed for work that was previously infeasible. Without that measurement, the freed capacity will be invisible to the executive team and will be redirected to whatever fills the calendar by default, which is usually maintenance.

Move capacity toward the strategic backlog, reliability, and security work. The decisions about where freed capacity goes are operating model decisions, not engineering decisions. They belong on the executive agenda. Treating the question as a head-of-engineering question delegates a strategic capacity choice to someone whose mandate is delivery, not portfolio prioritization.

Shift hiring from code production to review, design, and integration. Pause backfills of pure mid-level code-producer roles for two quarters and watch what does not happen. Often, the work that the role was producing is now produced by the remaining engineers with AI assistance. In the cases where the missing work matters, the backfill resumes, with a different specification. Invest the freed budget in senior review and system-design capacity. The hiring market for these skills is brutal, which means the gap is being filled, where it is being filled at all, by internal promotion rather than external hiring.

Protect junior hiring. Redesign the apprenticeship. This is the move most organizations get wrong. Cutting junior hiring solves the short-term cost problem and creates a seven-year capability problem. The senior engineers of 2033 are the junior engineers of 2026. If they are not hired now, or hired and not given a learning path, the senior bench in seven years will be thin. The traditional apprenticeship path, in which juniors learned through hours of writing boilerplate that became muscle memory and then judgment, no longer works because AI writes the boilerplate. The new path requires deliberate exposure to the parts of the work AI is bad at. Reading code at scale. Debugging production incidents, with senior support, but as the named investigator. Owning a small but real piece of a system end-to-end. Working with stakeholders. These exposures do not happen by accident in an AI-augmented team. They happen because a manager decided that they should.

Renegotiate outsourcing from body-count to outcomes. Body-count outsourcing contracts are the most exposed category of engineering spend in the AI moment. The model that paid for a hundred contractors to write routine APIs is now hard to defend, because the alternative is two or three internal engineers with AI tools doing the same work. The negotiating position with body-count vendors is the strongest it has been in a decade. The move is from body-count contracts to outcome contracts: from paying for engineer-months to paying for the delivery of specific systems, at specific quality bars, on specific dates. Vendors are nervous. The opening is real.

Govern AI like metered infrastructure, not a license. Usage-based pricing turns AI-assisted engineering into a variable cost that scales with adoption, which means it needs the cost discipline already applied to cloud. Set budgets and make spend visible by team. Route routine work to cheaper models and reserve frontier models for the problems that need them. Above all, instrument the link between spend and delivered outcomes, because the question that closes every one of these cost overruns is the same one: what did we get for it. An organization that cannot answer that is not measuring its AI investment, it is hoping.

The CFO conversation, one quarter versus three years

The version of the AI conversation where the engineering bill goes down and nothing else changes is the version finance has every reason to push for. It is also a one-quarter conversation. The version that treats AI as freed capacity to be directed deliberately is a three-year conversation. The executive’s responsibility is to keep both framings in the room and to insist on the longer one.

The cost of the shorter conversation is real. The products get worse. The incidents become more frequent. The institutional knowledge erodes. The competitive position quietly degrades over a horizon longer than the current planning cycle. None of it shows up in next year’s budget. All of it shows up in the years after that, in operating outcomes the executive team will not be able to trace back to the cost-out decision that produced them.

The longer conversation is harder to have because it requires defending capacity that finance can credibly argue is no longer needed. The defense is that the capacity is needed for different work than it used to be needed for, and that the different work is what will produce the operating results the executive team is committing to.

What the moment requires

AI changed the scarce resource in software engineering. Code production became faster. Engineering judgment became more valuable, and the cost of production moved onto a meter. The organizations that benefit most from this shift will be the ones that redirect capacity toward strategic backlog, reliability, security, senior review, redesigned apprenticeship, and outcome-based outsourcing, and that govern the new metered cost the way they govern any other production input. The organizations that treat AI as a pure cost-out lever will reduce the very judgment capacity they need to use AI well.

AI increases the amount of software an organization can responsibly attempt. It does this only if leaders preserve and redirect the human judgment that makes software safe to ship. The headcount question is the wrong unit of analysis. The capacity question is the right one. Get the question right, and most of the operating decisions that follow get easier. Get the question wrong, and the next twelve months produce incidents the lean organization cannot catch in review, followed by the realization that the math that started the cycle was wrong.


Sunil Prakash leads enterprise AI and platform architecture work in global financial services. He writes about production-grade AI systems, agent governance, and engineering operating models, and is the author of Agentic AI for Serious Engineers.