Writing
Systems thinking applied to AI infrastructure.
Architecture-first writing on production challenges. How real teams build durable agents, secure workflows, observable systems, and reliable AI infrastructure that doesn't break under load.
Featured Research
Architecture problems I've actually solved.
These aren't tutorials. They're systems breakdowns of real challenges: stateful agent execution, tool governance, observability at scale, and production reliability patterns that matter when things fail at 3 AM.
LangGraph architecture · May 2026 · 7 min read
LangGraph v1 and Durable Agent Architecture: Why Enterprise AI Needs Checkpoints
Production agents crash. When they do, you need checkpoints. This is how real teams build resumable workflows that survive restarts, approval delays, and API failures without replaying unsafe work.
MCP security · May 2026 · 8 min read
MCP Security Architecture: Tool Permissions, Context Boundaries, and Enterprise Guardrails
MCP is elegant for tool integration, but it concentrates risk. Here's how to architect tool permissions, isolation boundaries, governance policies, and audit logs before you wire enterprise systems into agents.
AI observability · May 2026 · 7 min read
OpenTelemetry GenAI Observability: Tracing Agent Workflows Beyond Token Counts
Token counts lie. Agent failures are loud but opaque. This is how production teams use OpenTelemetry semantic conventions to trace multi-agent workflows, isolate failures, and debug without guessing.
Context engineering · May 2026 · 8 min read
Context Engineering for Enterprise RAG: From Prompt Windows to Retrieval Operating Systems
Longer context windows changed RAG architectures completely. Here's how production teams layer system instructions, memory, evidence, and governance to build retrieval systems that actually work.
FastAPI AI backends · May 2026 · 7 min read
FastAPI AI Backends for Background Reasoning: Queues, Polling, and Resumable Workflows
Reasoning models need backend architecture. Your API shouldn't hold an HTTP connection hostage while the model thinks. Here's how to do async reasoning right.
Medium & Distribution
Where ideas amplify.
Medium remains the fastest way to reach engineers at scale. I publish canonical research here first, then use other platforms to extend reach. RSS syndication and cross-posting to LinkedIn happen automatically.
AI infrastructure / April 2026
Desktop AI Supercomputing is Here: A Practical Look at NVIDIA DGX Spark for Startups
Multi-agent systems / May 2025
The Future of AI: Building Agent-to-Agent Communication Systems
LangGraph / January 2025
Building an AI-Powered Stock Analysis Pipeline with LangGraph, DeepSeek, and Ollama
AI agents / September 2024
Building a Real-Time AI Agent with LangChain, LangGraph, and Open Source LLMs using Ollama
What's Next
Building a living architecture journal.
MDX support for live code examples, interactive diagrams, searchable tag filters, and RSS feeds. The goal is to make this a reference resource—not just another blog.
Work With Me
Building systems that matter? Let's talk.
Bring the hard system constraint: retrieval quality, agent failure modes, latency, evaluation, deployment topology, or technical market education.