Tech Company Blogs

Engineering @ Scale — Week of 2026-05-16 to 2026-05-22#

Week in Review#

This week, engineering organizations aggressively shifted away from unconstrained, single-agent architectures toward highly deterministic, platform-governed execution loops. A clear consensus emerged that scaling AI requires decoupling stochastic reasoning engines from strict, sandboxed execution environments, while simultaneously optimizing the underlying “boring machinery” of data pipelines to feed these models without bottlenecking real-time inference.

Top Stories#

How Snapchat Serves a Billion Predictions Per Second · Snapchat Snapchat reduced its data plane costs by 10x and halved inference latency by transferring features as raw bytes and delaying deserialization until inside the inference engine. At the scale of a billion predictions per second, this proves that optimizing network transport and hardware-specific execution graphs (e.g., isolating dense matrix multiplications on GPUs while keeping embedding lookups on CPUs) is far more critical than tuning the ML model itself.

2026-05-26

Simon Willison — 2026-05-26#

Highlight#

Today’s updates emphasize the dual-edged sword of AI in security, contrasting how AI tools are overwhelming open-source maintainers with a flood of valid vulnerability reports while simultaneously introducing novel data exfiltration risks in enterprise agentic systems like Microsoft Copilot.

Posts#

The pressure · Source Daniel Stenberg highlights the unprecedented toll that high-quality, AI-assisted security reports are taking on the curl project’s team. The volume of credible vulnerabilities has surged to over one report per day—double the rate seen in 2025—leading to severe work-life balance issues for maintainers. Fortunately, because curl is well-architected, these AI-discovered flaws are almost exclusively categorized as LOW or MEDIUM severity, with no HIGH severity issues found since late 2023.

2026-04-03

Simon Willison — 2026-04-03#

Highlight#

The overarching theme today is the sudden, step-function improvement in AI-driven vulnerability research. Major open-source maintainers are simultaneously reporting that the era of “AI slop” security reports has ended, replaced by an overwhelming tsunami of highly accurate, AI-generated bug discoveries that are drastically changing the economics of exploit development.

Posts#

Vulnerability Research Is Cooked · Source Highlighting Thomas Ptacek’s commentary, Simon notes that frontier models are uniquely suited for exploit development due to their baked-in knowledge of bug classes, massive context of source code, and pattern-matching capabilities. Since LLMs never get bored constraint-solving for exploitability, agents simply pointing at source trees and searching for zero-days are set to drastically alter the security landscape. Simon is tracking this trend closely enough that he just created a dedicated ai-security-research tag to follow it.

2026-04-05

Simon Willison — 2026-04-05#

Highlight#

Simon highlights a deep-dive post by Lalit Maganti on the realities of “agentic engineering” when building a robust SQLite parser. The piece beautifully articulates a crucial lesson for our space: while AI is incredible at plowing through tedious low-level implementation details, it struggles significantly with high-level design and architectural decisions where there isn’t an objectively right answer.

Posts#

Eight years of wanting, three months of building with AI Simon shares a standout piece of long-form writing by Lalit Maganti on the process of building syntaqlite, a parser and formatter for SQLite. Claude Code was instrumental in overcoming the initial hurdle of implementing 400+ tedious grammar rules, allowing Lalit to rapidly vibe-code a working prototype. However, the post cautions that relying on AI for architectural design led to deferred decisions and a confusing codebase, ultimately requiring a complete rewrite with more human-in-the-loop decision making. The core takeaway is that while AI excels at tasks with objectively checkable answers, it remains weak at subjective design and system architecture.

2026-04-07

Simon Willison — 2026-04-07#

Highlight#

Anthropic’s decision to restrict access to their new Claude Mythos model underscores a massive, sudden shift in AI capabilities. It is a fascinating look at an industry-wide reckoning as open-source maintainers transition from dealing with “AI slop” to facing a tsunami of highly accurate, sophisticated vulnerability reports.

Posts#

[Anthropic’s Project Glasswing - restricting Claude Mythos to security researchers - sounds necessary to me] · Source Anthropic has delayed the general release of Claude Mythos, a general-purpose model similar to Claude Opus 4.6, opting instead to limit access to trusted partners under “Project Glasswing” so they can patch foundational internet systems. Simon digs into the context, tracking how credible security professionals are warning about the ability of frontier LLMs to chain multiple minor vulnerabilities into sophisticated exploits. He even uses git blame to independently verify a 27-year-old OpenBSD kernel bug discovered by the model. He concludes that delaying the release until new safeguards are built, while providing $100M in credits to defenders, is a highly reasonable trade-off.

2026-04-07

Sources

Engineering @ Scale — 2026-04-07#

Signal of the Day#

By implementing an LLM-based risk classifier as an executable guardrail, Vercel successfully automated 58% of monorepo pull request merges without increasing revert rates. This demonstrates that mature codebases often suffer from review capacity misallocation rather than a lack of verification capability, making automated risk routing a highly effective scaling lever.

2026-04-09

Sources

Apple Ecosystem Daily — 2026-04-09#

Highlights#

Today’s news cycle is characterized by critical software refinements and fascinating hardware modifications. Apple released bug-fixing updates across its operating systems and professional creative apps, while also expanding its Self Service Repair program to accommodate its newest hardware releases. Concurrently, the enthusiast community demonstrated the hardware hackability of the new MacBook Neo, and security researchers shed light on both Apple Intelligence vulnerabilities and law enforcement data extraction techniques.

2026-04-14

Sources

Engineering @ Scale — 2026-04-14#

Signal of the Day#

To prevent API endpoints from exhausting an LLM’s context window, Cloudflare introduced a “Code Mode” architectural pattern for Model Context Protocol (MCP) servers that collapses thousands of tools into just two: a search function and a sandboxed JavaScript execution function. This progressive tool disclosure approach reduced their internal token consumption by 94% and offers a highly scalable model for hooking enterprise APIs to autonomous agents.

2026-04-15

Sources

Apple Ecosystem Daily Digest: AI Push, Foldable iPhone Ultra, and OLED iPads — 2026-04-15#

Highlights#

Today’s news is heavily dominated by the rapid integration of AI across the Apple ecosystem, underscored by Google launching a native Gemini app for Mac and Apple sending its own Siri engineers to an AI coding bootcamp ahead of WWDC. On the hardware front, exciting new details have emerged regarding Apple’s first foldable device, the “iPhone Ultra,” alongside reports of OLED displays coming to the iPad Air and iPad mini.

2026-04-18

Sources

Apple Ecosystem Daily — 2026-04-18#

Highlights#

Today’s ecosystem news highlights major hardware transitions and significant software shifts, most notably the looming end of Intel Mac support with the upcoming macOS 27. We are also tracking surprising supply constraints for the wildly popular new MacBook Neo, a definitive conclusion to the Apple Watch import ban saga, and new zero-day workarounds from macOS malware authors.