2026-05-19

Sources

AI Reddit — 2026-05-19#

The Buzz#

The defining event today is Andrej Karpathy joining Anthropic’s pre-training team to explicitly use Claude for recursive self-improvement,. The community is treating this as the “Ronaldo signing for Barca” moment for AI, further solidifying Anthropic’s status as the ultimate talent magnet. Meanwhile, Google unveiled Gemini 3.5 Flash and Gemini Omni, but excitement was quickly tempered by developers grumbling about steep 14x request multipliers and confusing benchmarks that make the new model more expensive to run in practice than Gemini 3.1 Pro,,.

2026-05-19

Simon Willison — 2026-05-19#

Highlight#

Simon’s annotated PyCon US 2026 lightning talk provides a sharp, insightful retrospective on the “November 2025 inflection point,” identifying exactly when coding agents became reliable daily drivers and laptop-grade local models started wildly overperforming. It is a quintessential Willison post that perfectly frames the recent tectonic shifts in AI developer tooling.

Posts#

[The last six months in LLMs in five minutes] · Source Simon shares his annotated slides from a PyCon US 2026 lightning talk summarizing the past six months of LLM developments. He zeroes in on two main themes: coding agents crossing the threshold from “often-work” to “mostly-work” driven by Reinforcement Learning from Verifiable Rewards, and the astonishing capability of local models like the 20.9GB Qwen3.6-35B-A3B and Gemma 4. The post also tracks the recent surge of “Claws” (personal AI assistants running locally on Mac Minis) and features his ongoing “pelican riding a bicycle” SVG visual benchmark to compare models.

2026-05-20

Sources

AI Reddit — 2026-05-20#

The Buzz#

The biggest shockwave today is a severe reality check on AI API and subscription pricing. GitHub Copilot’s new token-based billing has users staring at 10x cost increases, while Google’s new Gemini 3.5 Flash is inexplicably priced 14x higher than its predecessor, completely abandoning the “cheap and fast” ethos. As developers scramble to cancel bloated subscription stacks, the contrasting triumph of a user running DeepSeek-V4-Flash locally on a $2,500 rig of legacy RTX 2080 Tis perfectly captures the community’s sudden, aggressive pivot toward cost-control and hardware independence.

AI Reddit

Sources

AI Reddit — 2026-05-29#

The Buzz#

The most impactful shifts today are coming from practitioners tearing down default software wrappers to unlock massive performance gains in local inference and generation. In the local LLM space, Multi-Token Prediction (MTP) is delivering staggering 3.34x inference speedups on dense models like Gemma 4, proving that the decode phase is memory bandwidth bound rather than compute bound. Meanwhile, the Stable Diffusion community finally identified why Qwen Edit 2511 outputs have looked so blurry in ComfyUI: the default nodes were secretly relying on obsolete area downscaling and injecting bloated vision-language descriptions. By bypassing these defaults, users are finally achieving crisp, high-resolution prompt adherence.

AI Reddit

AI Reddit — Week of 2026-05-16 to 2026-05-22#

The Buzz#

The era of sloppy, unlimited “vibe coding” is officially dead, killed by GitHub Copilot’s sudden shift to strict usage-based billing that is driving projected monthly costs for power users from $39 up to a staggering $387, triggering a mass exodus to alternatives. Meanwhile, the talent war saw a massive “Ronaldo signing for Barca” moment as Andrej Karpathy joined Anthropic’s pre-training team to focus on recursive self-improvement using Claude, cementing their status as the ultimate talent magnet. In a ruthless counter-maneuver for market dominance, OpenAI offered $2M in API tokens via uncapped SAFEs to all 169 current Y Combinator startups, effectively trading compute for deep ecosystem lock-in and usage surveillance before founders even have a chance to evaluate open-source alternatives.

Simon Willison

Simon Willison — Week of 2026-05-16 to 2026-05-22#

Highlight of the Week#

The most impactful milestone this week is the official announcement of Datasette Agent, merging Simon’s three years of work on his LLM library directly into Datasette. This conversational AI interface allows users to naturally interrogate their databases, boasting an extensible plugin architecture for charts, image generation, and secure code execution.

Key Posts#

[The last six months in LLMs in five minutes] · Source Simon shared annotated slides from his PyCon US 2026 lightning talk capturing a major inflection point in AI developer tooling. He highlights how coding agents crossed the threshold to become reliable daily drivers, and points to the astonishing capabilities of massive local models running on consumer hardware like Mac Minis.