feed

Posts grouped by topic. 21 posts across 3 categories. Series cards step through their parts via the prev / next buttons inside the card.

agentic AI

5 posts

post agentic 2026-04-29 · 6 min

Why your top-K vector search returns 8 markers stacked on Manhattan

Cosine top-K is doing the right math but the wrong job. Here is the gotcha, the fix (MMR), and a second fix on top of that (geographic deduplication) that production systems actually need.

#vector-search #mmr #similarity #learning #engineering

post agentic 2026-03-15 · 6 min

Evals for agentic systems: what to measure, what to skip

Most agent evals measure the wrong thing. Output quality alone misses 80% of production failure modes. The five eval categories I now run on every agentic system, the LLM-as-judge gotchas, and how to ship a useful eval harness in a week.

#agentic #llm #evals #testing #production

post agentic 2025-11-22 · 6 min

RAG chunking strategies that actually work for technical content

Fixed-size chunking is fine for blog posts and dangerous for technical docs. Six chunking strategies, the failure modes I hit with each, and the hybrid approach I now reach for when retrieval quality matters.

#rag #llm #embeddings #vector-search #retrieval

post agentic 2025-07-15 · 6 min

Designing tools for LLMs: what makes a tool the model actually uses

Tool naming, description writing, argument schemas, error envelopes, idempotency. Lessons from authoring two production MCP servers, with the patterns I now reach for and the mistakes I've stopped making.

#mcp #agentic #llm #tool-design

post agentic 2025-03-22 · 5 min

When the multi-agent split is wrong, and a single LLM call probably wins

Counter-narrative to 'agents everywhere'. Multi-agent systems pay a real latency, complexity, and observability cost. Here's the decision framework I use to decide whether to split, and three concrete cases where a single call beats N agents in a graph.

#agentic #llm #design #anti-pattern

data engineering

8 posts · 1 series

post data-engineering series · Streaming-data architectures · 1/2 2025-07-08 · 6 min

Architecture series, part 1: Lambda architecture, where it came from and why it hurt

Part 1 of a 2-post series on streaming-data architectures. Lambda architecture solved the right problem in 2014: how to combine batch correctness with stream-low-latency. The cost was running two pipelines for the same logic. Here's why it was right then, and why most teams shouldn't pick it now.

#data-engineering #architecture #lambda #streaming #batch #learnings

1 / 2

post data-engineering 2026-02-04 · 5 min

Medallion architecture: the rules I now follow for bronze, silver, gold

After three years of building and refactoring medallion data lakes, here's the opinionated rule set that holds up: what bronze should and should not do, what makes silver actually queryable, and how to keep gold from drifting into chaos.

#data-engineering #medallion-architecture #delta-lake #learnings

post data-engineering 2026-01-19 · 6 min

Spark Structured Streaming: watermarks, late data, and the mistakes I made

From batch to streaming, the gotchas. Watermarks that drop too aggressively, late events that get silently lost, stateful aggregations that grow without bound, and the four operational habits that keep streaming jobs healthy in production.

#pyspark #spark #streaming #data-engineering #learnings

post data-engineering 2025-10-08 · 6 min

Spark partition sizing: what I learned the hard way

Why your Spark job has 200 partitions even when your data has 5 GB. How to pick a target partition size, when to repartition vs coalesce, when AQE saves you, and the diagnostic loop I run on every slow job now.

#pyspark #spark #performance #data-engineering #learnings

post data-engineering 2025-09-12 · 9 min

Building a programmatic data-quality platform that teams actually adopt

Architectural pattern for a multi-tenant data-quality platform across many teams. Why centralising contracts beats centralising data, and how the SDK + control plane + fact/dim store pattern works regardless of which tools you reach for.

#data-engineering #data-quality #architecture #platform-engineering

post data-engineering 2025-08-14 · 5 min

Delta Lake MERGE patterns I wish I'd learned a year earlier

Five MERGE patterns that solved real problems for me: idempotent upsert, soft delete, late-arriving data, deduplication on ingest, and the slowly-changing-dimension type-2 case. Plus the performance gotchas that bit me first.

#pyspark #delta-lake #data-engineering #etl #learnings

post data-engineering 2024-12-20 · 4 min

PySpark skew detection, and the three fixes that actually work

How to know your job is skewed (the Spark UI lies more than you think), and the three fix patterns I reach for in production: salt-and-aggregate, broadcast join, and AQE-driven dynamic shuffle.

#pyspark #data-engineering #spark #performance

python

8 posts · 1 series

post python series · Python idioms · 1/3 2025-02-20 · 5 min

Python idioms I reach for daily, part 1: decorators that earn their keep

Part 1 of a 3-post series. Decorators are easy to misuse and spectacular when they fit. Here are the four shapes I reach for in production AI/data work, with code: retry, timing, instrumentation, and feature-flagging.

#python #decorators #idioms #series-python-idioms

1 / 3

post python 2025-12-03 · 7 min

pandas vs Polars vs DuckDB: when each one actually wins

Three Python data libraries, three personalities. pandas is the API everyone knows. Polars is faster and stricter. DuckDB is the database masquerading as a library. Here's the decision framework I use, with the cases where each is genuinely the right pick.

#python #pandas #polars #duckdb #data-analysis

post python 2025-05-04 · 5 min

uv, ruff, pyright: the new Python toolchain that actually moves the needle

Three tools that replaced six in my Python workflow. uv subsumes pip + venv + pip-tools + build + twine. ruff subsumes flake8 + black + isort + pyupgrade. pyright (or basedpyright) makes type checking instant. Here's how to migrate.

#python #tooling #dx #productivity

post python 2025-01-18 · 5 min

Pydantic v2 for production data contracts

Pydantic v2 isn't just dataclasses with validation, it's a contract layer for any boundary where untrusted data crosses into your code: HTTP, JSON files, LLM tool outputs, queue messages. Here are the patterns I reach for.

#python #pydantic #data-contracts #validation

post python 2024-11-03 · 5 min

Python syntactic sugar from 3.10–3.13 that actually changed how I write code

Pattern matching, structural type generics, exception groups, TaskGroup, the f-string self-doc =, and a couple of less-loved features. Code samples for each, with the use-cases I reach for them in real work.

#python #language #learning

post python 2024-08-12 · 5 min

asyncio in production: what works, what surprises, and what to skip

The mental model behind asyncio that finally clicked, the production patterns I reach for (TaskGroup, Semaphore, run_in_executor), and the four mistakes that bite every async codebase eventually.

#python #async #concurrency #performance