Engineering journal

Writing on production AI, retrieval, and the systems around it.

Field notes from building automation-heavy tools and AI products — what actually holds up in production, the trade-offs nobody mentions, and working code you can lift.

5 articles
✦ Latest

Amazon S3 Vectors for Retrieval at Scale

What Amazon S3 Vectors actually is, how it compares to dedicated vector databases, and a practical walkthrough of using it for retrieval pipelines.

April 21, 2025 · 7 min read · Read
Infrastructure
AI Engineering Mar 14, 2025

How Call Centres Use Voice AI to Automate Conversations

How modern Voice AI systems replace legacy IVR menus — the STT/LLM/TTS stack, real-world deployments, and a practical guide to building a production call centre agent.

11 min
AI Engineering Nov 12, 2024

What Are Temperature, Top-p, and Top-k in AI?

A clear breakdown of the three key sampling parameters in AI language models — what they control, how they interact, and when to reach for each.

7 min
Systems Oct 8, 2024

Instructor vs. OpenAI Structured JSON in Production

A hands-on comparison of Instructor and OpenAI's native structured output for production AI systems — where each shines and where each breaks.

12 min
Infrastructure Sep 3, 2024

A Practical Guide to Quantization

How quantization lets you run large language models on consumer hardware — the math behind it, and working code using BitsAndBytes 4-bit and 8-bit configs.

3 min
No articles in this category yet.