Sources
- AI Engineer
- All-In Podcast
- Andrej Karpathy
- Anthropic
- Apple
- Apple Developer
- AWS Events
- ByteByteGo
- Computerphile
- Cursor
- Dwarkesh Patel
- EO
- Fireship
- GitHub
- Google Cloud Tech
- Google DeepMind
- Google for Developers
- Hung-yi Lee
- Lenny's Podcast
- Lex Clips
- Lex Fridman
- Life at Google
- Marques Brownlee
- Microsoft
- No Priors: AI, Machine Learning, Tech, & Startups
- Numberphile
- NVIDIA
- OpenAI
- Perplexity
- Quanta Magazine
- Slack
- The Pragmatic Engineer
- Visual Studio Code
Tech Videos — 2026-04-29#
Watch First#
The math behind how LLMs are trained and served – Reiner Pope MatX CEO Reiner Pope delivers an incredible blackboard breakdown of inference economics, showing exactly how memory bandwidth and KV cache capacity fundamentally dictate batch sizes and latency limits. If you want to cut through the marketing noise and understand why AI APIs cost what they do, or why context length scaling has hit a hard memory wall, this is the single best technical explanation available.