Sources

Tech Videos — 2026-05-22#

Watch First#

The standout video today is Chip design from the bottom up – Reiner Pope from the Dwarkesh Patel channel. Reiner Pope (CEO of MatX) provides a phenomenal, zero-fluff explanation of how AI chips fundamentally work—starting from basic logic gates, detailing the specific math of multiplier-accumulators, and building all the way up to why systolic arrays efficiently balance compute versus communication in modern TPUs and GPUs.

Highlights by Theme#

Developer Tools & Platforms#

On Google Cloud Tech, “What’s new in Go” details the new Green Tea Garbage Collector in Go 1.26, a vector-accelerated overhaul that shifts memory operations to contiguous pages, cutting GC CPU overhead by up to 50% without requiring any code changes. Also on Google Cloud Tech, “The ABCs of agent building” formalizes integrations for agent-based development, moving beyond brittle API wrappers by utilizing the Model Context Protocol (MCP) for tool discovery, the Universal Commerce Protocol (UCP) for structured ordering, and the Agent Payments Protocol (AP2) to enforce hard spending limits. Finally, Fireship’s digest “Google’s AI endgame is here…” highlights “Antigravity”—an agent-focused VS Code fork that impressively live-demoed writing its own drivers to run the game Doom on a scratch-built operating system.

AI & Machine Learning#

For practical architecture, AWS Developers’ “Stop AI Hallucinations With These 5 Techniques” outlines code-level guardrails, such as using Graph-RAG over vector RAG for deterministic aggregation, and writing “neurosymbolic guardians” in Python to strictly block LLMs from bypassing constraints that they would otherwise ignore in prompts. On the open weights front, Google for Developers announced the Gemma 4 lineup in “What’s new in the Gemma open model family”, releasing a highly efficient 26B Mixture-of-Experts (MoE) model and a 31B dense model, both featuring a 256k context window and a permissive Apache 2.0 license. Addressing inference speed, “Fast Models Need Slow Developers” by AI Engineer argues that because Cerebras Codex Spark hits 1,200 tokens per second, expensive test suite validations and massive diff generations are practically free, requiring a fundamental shift in how developers loop with AI.

Hardware & Infrastructure#

In a massive infrastructure revelation discussed on the All-In Podcast’s “SpaceX’s $2T Case…”, Anthropic is reportedly paying SpaceX $1.25 billion per month for access to the Colossus orbital compute clusters. On the silicon side, Fireship notes that Google is officially splitting its custom tensor hardware into distinct jobs, creating the TPU-T optimized for training and the TPU-I optimized for inference. For edge compute workloads, Google for Developers’ “Bring the power of on-device AI to life…” showcases LiteRT-LM, an optimized runtime that uses device NPUs and memory caching to run Gemma 2B and 4B models locally with zero network latency.

Everything Else#

On the corporate governance front, Y Combinator’s interview with Eric Ries in “How The Best Companies Defend Against Mediocrity And Rot” strongly challenges the standard Delaware C-Corp structure, arguing that founders must leverage Public Benefit Corporations (PBCs) or trust structures to legally shield their technical mission from short-sighted activist investors. Providing a much-needed dose of skepticism, ThePrimeagenHighlights’ “Industry Wide AI Psychosis” amplifies Mitchell Hashimoto’s warning that heavily relying on AI agents for rapid feature shipping is masking deep architectural decay and creating a false sense of security within software engineering.


Categories: YouTube, Tech