~ali/blog
$ ls -lt *.md
2026-02-20
MCPLLMs ToolingArchitecture
Every AI system I've built has a graveyard of custom connectors — bespoke glue code written to bridge a model to a tool. Model Context Protocol is here to replace all of it. What it is, why it matters, and what it changes for how we build AI systems.
2026-02-18
RAGLLMs ArchitectureProduction ML
I spent months building a production RAG system. Then I watched someone dump the entire corpus into a million-token context window and get the same answer in one shot. Here's an honest look at where RAG still wins — and where it's already losing.
2026-01-29
Edge AIQuantization InferenceMLOps
The race to 100B+ parameters is over for most applications. 7B quantized models running on-device are beating 70B hosted models on latency, cost, and privacy. A look at why smaller models are winning — and what that means for how we deploy ML.
2026-01-12
RLVRFine-tuning ReasoningLLMs
RLHF teaches models to sound good. RLVR teaches them to be correct. Reinforcement learning from verifiable rewards is how DeepSeek-R1 and similar models learned to actually reason — and it's changing how we think about post-training.
$