2026-05-20

Sources

Engineering @ Scale — 2026-05-20#

Signal of the Day#

Netflix’s decision to decouple raw video ingestion from multimodal AI data fusion serves as a masterclass in pipeline architecture. By persisting raw model outputs into Cassandra first and relying on asynchronous “temporal bucketing” to align intersecting predictions offline, they prevent complex intersections from bottlenecking their real-time 216-million-frame ingest layer.

Simon Willison

Simon Willison — Week of 2026-05-16 to 2026-05-22#

Highlight of the Week#

The most impactful milestone this week is the official announcement of Datasette Agent, merging Simon’s three years of work on his LLM library directly into Datasette. This conversational AI interface allows users to naturally interrogate their databases, boasting an extensible plugin architecture for charts, image generation, and secure code execution.

Key Posts#

[The last six months in LLMs in five minutes] · Source Simon shared annotated slides from his PyCon US 2026 lightning talk capturing a major inflection point in AI developer tooling. He highlights how coding agents crossed the threshold to become reliable daily drivers, and points to the astonishing capabilities of massive local models running on consumer hardware like Mac Minis.