Sources
- Airbnb Engineering
- Amazon AWS AI Blog
- AWS Architecture Blog
- AWS Open Source Blog
- BrettTerpstra.com
- ByteByteGo
- CloudFlare
- Dropbox Tech Blog
- Facebook Code
- GitHub Engineering
- Google AI Blog
- Google DeepMind
- Google Open Source Blog
- HashiCorp Blog
- InfoQ
- Spotify Engineering
- Microsoft Research
- Mozilla Hacks
- Netflix Tech Blog
- NVIDIA Blog
- O'Reilly Radar
- OpenAI Blog
- SoundCloud Backstage Blog
- Stripe Blog
- The Batch | DeepLearning.AI | AI News & Insights
- The Dropbox Blog
- The GitHub Blog
- The Netflix Tech Blog
- The Official Microsoft Blog
- Vercel Blog
- Yelp Engineering and Product Blog
Engineering @ Scale — 2026-05-19#
Signal of the Day#
The most critical insight this period comes from Snapchat’s billion-prediction-per-second ML platform: at massive scale, the “boring machinery” of network transport and data serialization dominates inference costs more than the ML model itself. By refactoring their data plane to transfer features as raw bytes and delaying deserialization until inside the inference engine, they achieved a 2x reduction in latency and a 10x drop in data plane costs.