Production-Grade AI Home Lab Platform
Designed and built an end-to-end, 100% self-hosted local AI platform running on Apple Silicon (MacBook M1 Pro) to serve low-latency inference with zero external cloud dependencies.
Architecture decisions
- Integrated LangGraphJS for stateful multi-agent orchestration and tool control loops
- Configured Cloudflare Tunnels (WAF + Tunnel) for secure, zero-open-port ingress
- Engineered an OpenAI-compatible FastAPI gateway proxy routing queries to local Ollama and MLX model runtimes