Will AI replace a software agency?

No. AI lowers the floor — almost anyone can get a prototype fast today. But it does not replace the last 30% of a product: edge cases, security, performance, maintainability and accountability in operation. That distance between a demo and a system you can rely on is exactly the work an agency is for. AI makes a slow agency redundant, not a good one.

What is the 70% problem with AI code?

The 70% problem, described by Addy Osmani, says AI gets you to roughly 70% of a working app very fast, but the last 30% — hardening, edge cases, security, maintainability — needs real engineering. The first impression is misleading because the demo looks finished while the hardest part is still missing.

Can't we just vibe-code our product ourselves with AI?

For a prototype or an internal tool that is often a good idea. For a product that customers and data are entrusted to, it gets risky: in an August 2025 survey, 16 of 18 CTOs reported production incidents caused by unreviewed AI-generated code. Without review, tests and architecture, the demo stays a demo.

Does AI make software development cheaper?

AI clearly lowers the cost of the first 70% and speeds up routine work. The last 30% — security, scaling, tests, maintainability — does not get cheaper, it gets more important. An AI-native agency passes the speed of the first 70% on to you and invests the time it saves into exactly the quality that makes a product reliable.

How do I tell an AI-native agency from a vibe-coder?

A disciplined agency uses AI visibly as an accelerator, but also shows you its review process, its tests, and its security and architecture decisions. It talks about the last 30%, not just the fast demo, and takes contractual responsibility for operation and maintenance.

Will AI make developers obsolete?

The data points to a shift rather than replacement: per the DORA 2025 report, around 90% of respondents use AI at work, and AI amplifies productivity — but only where tests, version control and fast feedback exist. The demand is for more judgment, not less. Routine work shifts to AI; responsibility for the outcome stays with people.

Contact

All posts

KIJune 29, 20267 min read

Will AI replace your software agency? Why the opposite is true

Since vibe coding, almost anyone can build a working demo in an afternoon. So the honest question is: why still hire an agency? The answer: AI lowers the floor — and raises the bar for production at the same time. An AI-native agency closes that gap faster than ever.

Hauke Rux

CEO, Project Manager

Share

7 min read

Since Andrej Karpathy coined the term “vibe coding” in February 2025, almost anyone can assemble a working demo in an afternoon — by voice prompt, without writing a single line. That is impressive, and it is the right way to start a prototype. The obvious conclusion is still often wrong: “If AI can do this, why do we still need an agency?”

The honest answer is uncomfortable for any slow agency and good news for everyone else: AI lowers the floor radically — and at the same time raises the bar for a production-ready product. The distance between a demo and a system you entrust with customers and data has not shrunk. It has become more visible. And that is exactly where senior engineering lives.

Key takeaways

AI lowers the floor: in a controlled GitHub study, developers completed a scoped task 55.8% faster with Copilot.
The 70% problem: AI gets you to 70% of an app fast; the last 30% — edge cases, security, maintainability — stays engineering work.
Trust is falling: 66% of developers are frustrated by “almost right” AI code, and for around 45% debugging AI code takes longer than writing it (Stack Overflow 2025).
Production reality: in an August 2025 survey, 16 of 18 CTOs reported production incidents caused by AI-generated code.
DORA 2025: AI amplifies a team (+21% tasks completed, +98% pull requests merged) — but without tests, version control and fast feedback, delivery stability drops.
Bottom line: AI removes the excuse for a slow agency, not the need for a good one.

What AI made easy — and what it didn't

AI radically sped up producing a first working version — but not building a product. That is not a contradiction, it is a clarification. In a controlled GitHub study, developers solved a scoped programming task 55.8% faster with Copilot (1h11 instead of 2h41). The DORA 2025 report confirms the pattern at scale: AI is associated with more tasks completed (+21%) and substantially more pull requests merged (+98%). Its central thesis is sober and important: AI amplifies what is already there.

That is exactly why the picture is more nuanced than the demo videos suggest. In a randomized METR study, experienced open-source developers working on their own, well-known repositories were actually 19% slower with AI — while believing they were faster. AI helps most where context is missing and least in code someone already knows deeply. The lesson for a product: speed is context-dependent, and the demo is the easy part.

The 70% problem

AI gets you to roughly 70% fast — the last 30% is the actual software engineering. Google engineer Addy Osmani described this as the 70% problem, and it matches what we see in projects every day. The first 70% — the visible UI, the happy path, the first features — comes together with AI surprisingly fast. The last 30% is invisible and expensive: error handling, edge cases, security, performance under load, accessibility, maintainability, and the question of whether the system is still changeable in two years.

Split bar: 70 percent of a working demo comes fast with AI, the lime-highlighted last 30 percent stands for edge cases, security, scaling and maintainability. — The 70% problem: the demo is done fast, the hard remainder is engineering. After Addy Osmani, 2025.

The catch: those 30% are not 30% of the effort — they are often the bulk of it. A demo that works in the living room is not the same as a system that survives 10,000 users, invalid input, traffic spikes and attackers. Treating the 70% as “almost done” means planning past the actual project.

From demo to production: the real distance

The distance between a convincing demo and a system you entrust to customers is exactly where senior engineering lives. And that distance now shows up in the numbers, too. Per the Stack Overflow Developer Survey 2025, 84% of developers use or plan to use AI tools — but trust in their accuracy has fallen: only about a third trust the accuracy, and 46% actively distrust it. The most telling figure: 66% are frustrated by “almost right” AI answers, and for around 45% debugging AI code takes longer than writing it themselves.

“Almost right” is more expensive in production than “obviously wrong”, because it slips past shallow tests. That shows up in incidents: in an August 2025 survey, 16 of 18 CTOs reported production incidents caused by AI-generated code. The DORA report names the mechanism behind it: AI adoption has a negative relationship with delivery stability — unless strong testing, clean version control and fast feedback absorb the added speed. Speed without discipline produces instability, not value.

Build it yourself with AI, or hire an AI-native agency?

The honest counter-question isn't “AI or agency” — it's “who reliably closes the last 30%”. The objection “we'll just build it ourselves with AI” is legitimate — and for a prototype, an internal tool or market validation it is often the right call. That is exactly what vibe coding is good for. It gets risky the moment the prototype is meant to become a product, one that customers, revenue and personal data are entrusted to.

Comparison: building it yourself with AI ships a demo fast but leaves the last 30 percent, review and tests open; an AI-native agency ships the demo just as fast and closes production, review, tests and maintenance — highlighted in lime. — Both paths start fast. The difference is who closes the last 30% with discipline.

Dimension	Build it yourself with AI	AI-native agency
First demo	hours to days	hours to days
Last 30%	open, ad hoc	systematically closed
Code review & tests	rare	standard
Security	mostly unchecked	review, SAST, hardening
Maintainability & handover	tech-debt risk	documented, handoverable
Accountability on incident	yours	contractually defined

Important: the left column is not “dumb” and the right one is not “magic”. Both start fast today because both use AI. The difference is not the speed of the first 70%, but the discipline of the last 30%. An AI-native agency passes the speed on to you — and invests the time it saves into review, tests and architecture instead of even more unreviewed code. What that transition from prototype to a reliable system looks like in practice, we describe in From AI prototype to production-ready system.

What you actually pay an agency for now

You no longer pay for typing code — you pay for the judgment that turns AI speed into a safe product. That is the real shift. A large part of an agency's value used to sit in the craft of writing code. Today it sits in the decisions around it: which architecture holds up in two years? Where does AI-generated code fail subtly? Which tests prove it actually works? Which data must never leave the building?

That work has not gotten smaller — it has gotten more valuable, because AI produces more code, faster, and therefore more surface that needs review, tests and security. A modern agency must therefore use AI excellently itself; refusing AI leaves speed and quality on the table. The case that disciplined AI use ships faster and better, we make in AI in software development: faster and better. The flip side — which new risks AI code brings and how to contain them — we cover in The risks of AI code generation. How we fold AI into our development without giving up control is exactly this third path: AI plus engineering rigor, together as the product.

Next steps

Three questions settle faster than any tool debate whether you build it yourself or need support:

Maturity: is this a prototype for validation — or a product that customers and data are entrusted to?
The last 30%: who reliably owns security, tests, scaling and maintenance once the demo stands?
Accountability: who is liable if AI-generated code causes an incident in production?

If the answers point toward “real product”, a conversation is worth it. We build fast with AI — and close the last 30% with the discipline that makes a product reliable. Take a look at our software development or book an intro call directly.

Frequently asked questions

Conclusion

AI makes starting easier and building faster — but it does not remove the last 30% of a product. That distance, between a convincing demo and a system you can rely on, is exactly where senior engineering lives. AI removes the excuse for a slow agency, not the need for a good one. If you want speed, pair AI with discipline — together they are the actual product.

Written by

Hauke Rux

CEO, Project Manager

Share

All posts

Keep reading

Let's talk about your project

Book a 30-minute discovery call. We'll review your goals, surface unknowns, and outline how we would run the engagement.

Schedule a call

Booking calendar (Cal.com)

This area embeds the external service Cal.com. By loading it you agree that a connection to Cal.com is established and data may be transferred to the USA.

Privacy policy