Sources
- AI Engineer
- All-In Podcast
- Andrej Karpathy
- Anthropic
- Apple
- Apple Developer
- AWS Events
- ByteByteGo
- Computerphile
- Cursor
- Dwarkesh Patel
- EO
- Fireship
- GitHub
- Google Cloud Tech
- Google DeepMind
- Google for Developers
- Hung-yi Lee
- Lenny's Podcast
- Lex Clips
- Lex Fridman
- Life at Google
- Marques Brownlee
- Microsoft
- No Priors: AI, Machine Learning, Tech, & Startups
- Numberphile
- NVIDIA
- OpenAI
- Perplexity
- Quanta Magazine
- Slack
- The Pragmatic Engineer
- Visual Studio Code
Tech Videos — 2026-04-06#
Watch First#
Insights from NVIDIA Research | NVIDIA GTC is the standout watch today, offering a dense, highly credible look into how GPU hardware architectures are physically evolving to support high-throughput LLM inference alongside novel reinforcement learning pre-training techniques.
Highlights by Theme#
Developer Tools & Platforms#
On the Fireship channel, Cursor ditches VS Code, but not everyone is happy… covers Cursor 3.0 abandoning its VS Code roots for a custom Rust and TypeScript rewrite optimized for managing multiple parallel agents, powered by a “Composer 2” model that was initially marketed as in-house but later revealed to be Moonshot’s Kimmy K2 wrapped in heavy reinforcement learning. Conversely, the Visual Studio Code channel released a 6-part series, including VS Code Learn | Reviewing and controlling agent changes and Steering & Queueing is Here!, showcasing their own agentic Copilot setup with transparent debug views, checkpoint restores, and live queueing of tasks to compete on granular agent control. Meanwhile, the Syntax podcast episode It’s Been A Hell Of Week dives into Claude Code’s CLI source map leaking on NPM to reveal regex-based prompt flaggers, alongside a breakdown of a severe remote access trojan (RAT) injected via a decoy Axios package release.
AI & Machine Learning#
The NVIDIA Developer channel video Insights from NVIDIA Research | NVIDIA GTC provides exceptional technical signal on combating the training data shortage by shifting RL earlier into the pre-training phase (“RLP”), forcing models to “think” for dense, verifier-free rewards to gain durable 4-20% accuracy bumps on models like Qwen and NeMoTron. It also details NVIDIA’s “Groot” foundation models for robotics, highlighting a “Dreams Dojo” system that synthesizes massive amounts of training data by running action models backwards from human video. Additionally, the OpenAI channel video LG Uplus Creates Next Gen AICC shows a practical production case study using the Realtime API for speech-to-speech customer service that entirely bypasses traditional transcription layers to natively preserve human intonation and handle low-latency interruptions.
Hardware & Infrastructure#
Insights from NVIDIA Research | NVIDIA GTC from NVIDIA Developer breaks down the physical limits of fast inference decode, proposing future architectures with DRAM stacked directly on top of the GPU die and SRAM co-located with arithmetic units to virtually eliminate the energy wasted on data movement. For cloud infrastructure security, the AWS Events channel offers a solid primer in The Circle is Closing - Enhancing Your AWS Data Perimeter, mapping out how to build defense-in-depth perimeters by layering Service Control Policies (SCPs) as primary boundary weapons, Resource Control Policies (RCPs) for granular asset protection, and VPC endpoint policies as network armor. On the defense hardware front, the All-In Podcast episode Anduril & Palantir: How Silicon Valley Is Rebuilding America’s Military discusses the vital shift from exquisite, high-cost legacy defense platforms to attritable mass systems, detailing Anduril’s “Arsenal 1” factory that is designed to operate like a highly flexible, software-defined contract manufacturer for military drones.
Everything Else#
If you need a lightweight distraction, the GitHub channel highlighted a neat open-source Next.js and Three.js project in Turn your GitHub profile into a 3D city that visualizes your commit history as a 3D pixel art city where building height correlates directly with contribution counts. Finally, the All-In Podcast clip Retardmaxxing: The Next Great Modern Philosophy? brings attention to a niche internet philosophy of the same name, which is essentially a crude rebranding of stoic pragmatism for modern men: stop overthinking, go to work, and just enjoy your life.