Sources
- Airbnb Engineering
- Amazon AWS AI Blog
- AWS Architecture Blog
- AWS Open Source Blog
- BrettTerpstra.com
- ByteByteGo
- CloudFlare
- Dropbox Tech Blog
- Facebook Code
- GitHub Engineering
- Google AI Blog
- Google DeepMind
- Google Open Source Blog
- HashiCorp Blog
- InfoQ
- Spotify Engineering
- Microsoft Research
- Mozilla Hacks
- Netflix Tech Blog
- NVIDIA Blog
- O'Reilly Radar
- OpenAI Blog
- SoundCloud Backstage Blog
- Stripe Blog
- The Batch | DeepLearning.AI | AI News & Insights
- The Dropbox Blog
- The GitHub Blog
- The Netflix Tech Blog
- The Official Microsoft Blog
- Vercel Blog
- Yelp Engineering and Product Blog
Engineering @ Scale — 2026-04-13#
Signal of the Day#
When using large language models for recommendation systems, passing raw numerical counts ruins the signal because the model processes digits as text tokens rather than magnitudes. By converting raw engagement counts into percentile buckets wrapped in special tokens (e.g., <view_percentile>71</view_percentile>), LinkedIn increased the correlation between popularity and embedding similarity 30x, offering a highly reusable pattern for safely encoding structured numerical data into transformer contexts.