Back To The Future of Software: How to Survive The AI Apocalypse with Tests, Prompts and Specs
Abstract
AI writes 42% of committed code, yet 96% of developers don’t trust it. Every abstraction leap in software history — compilers, VMs, cloud, serverless — was deterministic. AI isn’t. We’ve been giving monkeys GPUs and hoping for Shakespeare. The industry responded with context engineering, skills, and spec-driven development — $322M+ in funding, 145k+ GitHub stars. But all of it optimizes the edges of a chasm: better inputs on one side, better outputs on the other, and stochastic monkeys in the middle. The Intent Integrity Chain bridges that gap: human intent in, human validates the spec, machine creates deterministic tests (locked), monkey writes code, machine validates against locked tests. No monkey ever validates its own work. We wrapped a non-deterministic process in a deterministic chain — the same trust mechanism that made every previous abstraction leap work. We’ll show the chain in action with iikit (intent-integrity-kit), a fork of GitHub Spec Kit that adds the missing piece: test verification. Live demo included.
Resources
The Kiro Incident
Data & Research
- Sonar State of Code Developer Survey, January 2026 (PDF)
- Sonar Press Release — 96% Don’t Trust AI Output
- Spec-Driven Development: From Code to Contract in the Age of AI Coding Assistants — arXiv
Industry Validation
- Spec-Driven Development — Thoughtworks Technology Radar
- Spec-Driven Development: Unpacking one of 2025’s key practices — Thoughtworks Blog
- Spec Driven Development: When Architecture Becomes Executable — InfoQ
- Diving Into Spec-Driven Development With GitHub Spec Kit — Microsoft Developer Blog
Talk-Material Sources (New for Arc of AI)
- Lint Rebellion — Claude Code disables 40+ lint rules instead of fixing code — @michael_chomsky
- Uber Budget Explosion — 5K engineers, annual AI budget burned by April — @aakashgupta
- The Great Siloing — Google’s AI adoption failure, the checkbox fallacy — Steve Yegge
- AI Cost Paradox — $280/day personal spend, same P&L crisis across 30 companies — Itamar Novick
- Claude Status vs Peppers — reliability meme, 258K views — @wongmjane
Supporting Sources
- AI Desperation — Anthropic emotion vectors, Claude cheats under pressure — Serge Bulaev
- Action-Before-Understanding Bias — Claude Code GitHub Issue #32508
- Month with Claude Code — documentation as productivity lever — Peng Qi
- Claude Code Source Leak — system prompt analysis — @aakashgupta
Spec-Driven Development Tools
Intent Integrity Kit (iikit)
- iikit on GitHub — tessl-labs/intent-integrity-kit
- iikit on Tessl Registry
- Install:
tessl install tessl-labs/intent-integrity-kit
The Dark Factory
- How StrongDM’s AI team build serious software without even looking at the code — Simon Willison
- The StrongDM Software Factory — StrongDM Blog
Referenced in the Talk
- The hottest new programming language is English — Andrej Karpathy
- The is_prime 95% accuracy trick — @shobhitic
- Grace Hopper and the first compiler — Computer History Museum