Sources
- AI Engineer
- All-In Podcast
- Andrej Karpathy
- Anthropic
- Apple
- Apple Developer
- AWS Events
- ByteByteGo
- Computerphile
- Cursor
- Dwarkesh Patel
- EO
- Fireship
- GitHub
- Google Cloud Tech
- Google DeepMind
- Google for Developers
- Hung-yi Lee
- Lenny's Podcast
- Lex Clips
- Lex Fridman
- Life at Google
- Marques Brownlee
- Microsoft
- No Priors: AI, Machine Learning, Tech, & Startups
- Numberphile
- NVIDIA
- OpenAI
- Perplexity
- Quanta Magazine
- Slack
- The Pragmatic Engineer
- Visual Studio Code
Tech Videos — 2026-06-07#
Watch First#
Immediate Mode UI and Animations with Casey Muratori is an absolute must-watch for its deep architectural dive into why deferred, DOM-style layout engines inevitably turn into overly complex bottlenecks, and how imperative Immediate Mode UI radically simplifies state management.
Highlights by Theme#
Developer Tools & Platforms#
In From MCP to Scale: Pipelines That Build Themselves on the AI Engineer channel, Rafael Levi demos using the Model Context Protocol (MCP) to dynamically write and self-heal web scrapers for bot-heavy sites like Walmart. The demo highlights practical token-saving strategies by outputting scraped data as markdown rather than raw HTML, effectively reducing token costs by about 62%. For frontend and game developers, ThePrimeagenHighlights features a rigorous technical breakdown in Immediate Mode UI and Animations with Casey Muratori. Muratori pushes back against retaining complex node graphs for UI, contrasting Z-buffer fragment rendering with Painter’s algorithms and arguing that developers should execute bespoke layout flows imperatively rather than relying on bloated DOM layout interpreters. On a lighter, non-technical note, GitHub announced some new merchandise like matching koozies and hoodies in Check out the new GitHub shop spring collection.
AI & Machine Learning#
On the AI Engineer channel, Dat Ngo details the architecture of LLM evaluations in LLM Observability, Evaluation, Experimentation Platform. He emphasizes the use of OpenTelemetry (OTel) traces to audit agents across granular spans, multi-span trajectories, and full session scopes, noting that code alone can no longer effectively audit non-deterministic models. The talk also breaks down the nuances of using LLM-as-a-judge versus deterministic code evals to measure signal without relying entirely on human intervention.
Hardware & Infrastructure#
In Under 5 minutes to a deployed LLM endpoint, Audry Hsu demonstrates RunPod’s serverless cloud AI infrastructure. The live console demo shows how developers can sidestep the current GPU supply crunch by deploying open-source models via vLLM on autoscaling H100s and A100s, managing cold starts and setting up functional endpoints in mere minutes. Additionally, in Why Secondary Markets Are Eating the IPO, Gavin Baker briefly notes an impending networking super-cycle driven by the disaggregation of AI inference, highlighting companies like DriveNets and Arrcus that are building specialized silicon networking for data centers.
Everything Else#
The All-In Podcast panel unpacks a structural shift in late-stage private equity in Why Secondary Markets Are Eating the IPO. The panel details how SPVs and secondary platforms like Forge are replacing traditional IPOs to provide necessary employee liquidity for mega-unicorns like SpaceX and Anthropic, as founders increasingly choose to avoid public market scrutiny. Finally, Dwarkesh Patel shares a short, philosophical anecdote in The one sentence that opened up the universe for a stranger about a rancher whose mind was blown upon discovering that the stars in the night sky are simply distant suns.