Hacker News — 2026-05-27#
Top Story#
Matrix Multiplications on GPUs Run Faster When Given “Predictable” Data Matrix multiplications are supposed to be fully deterministic, executing the same number of operations and memory accesses regardless of the tensor’s contents. Yet, initializing matrices with zeros or ones yields measurably faster performance than using normally distributed random data. The culprit is dynamic switching power: predictable data minimizes transistor state flips, reducing power consumption and preventing the GPU’s Voltage Regulator Module from aggressively throttling clock frequencies under heavy load.