Blog | Alex Lavaee

Stop Building Subagents. Start Writing Skills.

Most teams overuse subagents when skills are the better primitive. The architectural case for progressive context disclosure, automatic project scoping, and portable expertise across 30+ AI coding tools.

Alex Lavaee, Norin Lavaee

March 17, 2026

Your Agent's Bottleneck Isn't the Model. It's the Context.

AI coding agents burn through hundreds of thousands of tokens grepping files and hallucinating APIs. A new class of context infrastructure tools is emerging to fix both problems — for your codebase and for external libraries.

Alex Lavaee, Norin Lavaee

March 10, 2026

Your AI Agent Writes Plausible Code. Plausible Is 20,000x Slower Than Correct.

A developer reimplemented SQLite in Rust with LLMs — 576,000 lines that compiled, passed tests, and ran 20,171x slower than the real thing. The bugs weren't syntactic. They were semantic. Here's why architecture, specs, test-driven contracts, and targeted review are the fix.

Alex Lavaee, Norin Lavaee

March 9, 2026

GPT-5.4: The Real Leap Isn't Coding

GPT-5.4's coding benchmarks barely moved. But computer use jumped from 47% to 75%, tool search cuts MCP token usage by 47%, and knowledge work hit 83% across 44 professions. Here's what actually matters for developers.

Alex Lavaee, Norin Lavaee

March 5, 2026

AI Agents Demand More Engineering Discipline, Not Less

Four industry leaders independently converged on the same conclusion: engineering discipline is the competitive moat when building with AI agents. Here's the day-one infrastructure that makes agent-generated code reliable.

Alex Lavaee, Norin Lavaee

March 4, 2026

How to Harness Coding Agents with the Right Infrastructure

A technical deep dive into harness engineering — the converging discipline across OpenAI, Anthropic, and independent practitioners that makes coding agents reliable on complex work.

Alex Lavaee, Norin Lavaee

March 3, 2026

Inside the Cloud VMs Powering Autonomous Coding Agents

A technical deep dive into the isolated VM infrastructure that lets AI coding agents operate for hours without human intervention — from Cursor's cloud agents and Firecracker microVMs to snapshot bootstrapping, computer use, and secrets management.

Alex Lavaee, Norin Lavaee

February 26, 2026

Designing the Multi-Agent Development Environment

The biggest constraint in multi-agent development isn't model capability. It's that nobody's built the orchestration, window management, and resource isolation layers end to end. A technical deep dive into what each tool does architecturally, where it breaks, and what the missing product looks like.

Alex Lavaee, Norin Lavaee

February 25, 2026

Junior Engineers Don't Need Protection from AI. They Need Agency.

The discourse assumes juniors need protection from AI tools. They don't. They need trust, a disciplined workflow, and room to build capability on their own terms.

Alex Lavaee, Norin Lavaee

February 24, 2026

If Your Claws Aren't Out, You're Already Falling Behind

Karpathy just named the layer most engineers are missing: Claws. Here's the data behind it, and how to start building it today.

Alex Lavaee, Norin Lavaee

February 23, 2026

Gemini 3.1 Pro, Opus 4.6, and Codex 5.3: A Technical Breakdown of Three Models, Three #1 Positions

Google just reclaimed #1 on SWE-Bench Verified with Gemini 3.1 Pro. But Codex still leads terminal work, and Claude still leads real-world preference. Here's what's technically different about each model—and what engineers are actually experiencing.

Alex Lavaee, Norin Lavaee

February 19, 2026

The New SDLC: A Practical Guide to Agentic Engineering

Coding is practically solved. The engineer's job is shifting from writing code to designing systems, writing specs, and orchestrating agents. Here's what the new software development lifecycle looks like and how to adopt it today.

Alex Lavaee, Norin Lavaee

February 18, 2026

Claude Sonnet 4.6: What Developers Actually Need to Know

Sonnet 4.6 scores within 1.2 points of Opus 4.6 on SWE-bench at roughly 60% of the cost. We break down the benchmarks, architecture changes, pricing math, developer reactions, and what it means for your agentic workflows.

Alex Lavaee, Norin Lavaee

February 17, 2026

Google DeepMind's Delegation Framework for Coding Agent Architecture

Google DeepMind's new paper formalizes delegation as more than task decomposition — it's a transfer of authority, accountability, and trust. Here's what that means for how we build coding agents, with concrete patterns you can apply today.

Alex Lavaee, Norin Lavaee

February 16, 2026

GLM-5 and the Open Model Convergence

GLM-5 hit 77.8% on SWE-bench Verified under an MIT license. The benchmark gap between open and closed models is closing fast. Here's what that means for how you architect your coding agent infrastructure—and what to do about it.

Alex Lavaee, Norin Lavaee

February 12, 2026

Codex Spark and the Two-Mode Future of Coding Agents

OpenAI's Codex Spark trades intelligence for speed at 1,000+ tokens/sec on Cerebras hardware. The real story isn't the model—it's the infrastructure overhaul and the emerging split between speed mode and depth mode in coding agents.

Alex Lavaee, Norin Lavaee

February 12, 2026

OpenAI's Agent-First Codebase Learnings

OpenAI shipped a million lines of code with zero human-written code. The engineering patterns they discovered—progressive disclosure, layered architecture, feedback loops—are patterns you can adopt today. Here's a practical breakdown.

Alex Lavaee, Norin Lavaee

February 11, 2026

Five Architectural Primitives Every Agent Swarm Rediscovers

Cursor ran thousands of agents to build a browser. Anthropic ran 16 to build a C compiler. Both independently converged on the same five design patterns. Here's the technical breakdown of why, and how you can apply them.

Alex Lavaee, Norin Lavaee

February 10, 2026

Building Self-Improving Coding Agents: How Factory's Signals Pipeline Closes the Feedback Loop

Factory's Signals system auto-resolves 73% of agent issues in under 4 hours using LLM judges, friction telemetry, and a closed-loop pipeline. Here's how it works and how you can adopt similar patterns in your own agent infrastructure.

Alex Lavaee

February 9, 2026

Opus 4.6, GPT-5.3 Codex, Agent Teams, and Fleet Mode: What Developers Actually Need to Know

Four major AI releases dropped within 24 hours. Here's a technical deep dive into Opus 4.6, GPT-5.3 Codex, Claude Code's agent teams, and Copilot CLI's Fleet Mode—and how to start using them effectively.

Alex Lavaee, Norin Lavaee

February 5, 2026

Codex macOS: Orchestration-First Agent Desktop

I spent a week exploring OpenAI's new Codex macOS app. Here's what I learned about its orchestration-first approach, how it differs from the Claude workflow I've grown attached to, and whether it's worth adding to your toolkit.

Alex Lavaee

February 4, 2026

Agent-Operated CI/CD: The Architecture Making AI Coding Agents Actually Work

A practical guide to wiring AI coding agents into your CI/CD pipeline with GitHub Actions. Includes working configurations for Copilot Autofix, OpenAI Codex, and Claude Code with proper guardrails.

Alex Lavaee, Norin Lavaee

February 3, 2026

Evolving Coding Agent Infrastructure: The Rise of the Meta-Framework Layer

How hooks, skills, and tool orchestration are transforming developer infrastructure. A deep dive into Claude Code's layered stack and why the most important code you write this year won't be features.

Alex Lavaee

February 2, 2026

Building AI Agents That Work at Any Scale

OpenAI built a data agent serving 3.5k users across 600 petabytes. The architectural patterns that made it work are the same ones that power a 3,000-line coding agent CLI.

Alex Lavaee

January 29, 2026

Atomic: Building Reliable AI Coding Agent Infrastructure

A technical guide to implementing procedural memory, specialized sub-agents, and autonomous ralph loops for AI coding assistants cross platform.

Alex Lavaee

January 28, 2026

Atomic: Automated Procedures and Memory for AI Coding Agents

Building on AI Coding Infrastructure, Atomic introduces a research-to-execution flywheel where specifications become lasting memory. Here's what we learned scaling multi-agent workflows.

Alex Lavaee, Norin Lavaee

December 8, 2025

How I Shipped 100k LOC in 2 Weeks with Coding Agents

Open sourcing my developer workflow with AI agents—skills, sub-agents, and autonomous execution. A 5-minute setup that provides the missing infrastructure layer for AI coding tools.

Alex Lavaee

November 12, 2025

Continuous Self-Learning in AI Agents

An overview of two frameworks for memory and context management to enable continous self-learning systems

Alex Lavaee, Norin Lavaee

November 10, 2025

Context Engineering Navigator

An interactive cheat sheet covering context engineering techniques for LLMs including retrieval, processing, management, and dynamic assembly strategies.

Alex Lavaee, Norin Lavaee

September 19, 2025

Building Products with Agentic-Powered IDEs

How context engineering transforms AI-powered development tools from disappointing to transformative through smart prompting, MCP servers, and strategic tool integration.

Alex Lavaee, Norin Lavaee

July 23, 2025

Memorization, Generalization, and Reasoning

A deep dive into the concepts of memorization, generalization, and reasoning in large language models.

Alex Lavaee, Norin Lavaee

June 23, 2025