Writing

Selected technical writing on ML system design, retrieval infrastructure, and Bayesian modeling. Each piece focuses on implementation choices, trade-offs, and failure modes.

Building a production-style Vector RAG backend

A production-oriented retrieval architecture for RAG applications, focused on index design, latency-recall trade-offs, citation traceability, and operational patterns for evaluation and monitoring.