Tensordyne is making AI compute more efficient using logarithms Tensordyne is developing AI chips that use a logarithmic number system to multiply numbers more efficiently than conventional floating-point math, aiming to improve compute efficiency for AI workloads. The company claims its approach reduces power consumption, but challenges remain in converting existing models and kernels to its proprietary format without accuracy loss. Tensordyne is making AI compute more efficient using logarithms. But is compute efficiency the thing that matters? Most AI chip companies are focusing on speeding up generative AI workloads by changing the memory hierarchy. Groq https://www.zach.be/p/why-is-openai-partnering-with-cerebras achieved ultra-low-latency at a high cost by using super-fast but expensive SRAM memory, rather than the slower but more cost-effective high-bandwidth memory HBM that Nvidia relies on. Cerebras https://www.zach.be/p/why-is-openai-partnering-with-cerebras is using wafer-scale processing to take that idea even further, with each wafer providing 44GB of SRAM. d-Matrix https://www.zach.be/p/most-ai-chips-arent-cost-effective is leveraging processing-in-memory to improve performance by reducing data movement to and from memory. Taalas https://www.zach.be/p/taalas-is-what-etched-should-have is using read-only memory, which is even faster and cheaper than SRAM, but can’t be changed -- so each chip can only run one model. Tensordyne https://www.tensordyne.ai/ is approaching AI chips differently. They’re using a different kind of number system based on logarithms that allows them to multiply numbers more efficiently than chips using conventional floating point math. But translating more efficient multipliers into better performance on real AI workloads is hard. Today on the blog, we’re looking at Tensordyne’s unique technology, how it positions their chips relative to others in the market, and trying to answer the question: do more efficient compute engines actually matter for AI chips? Logarithmic multiplication The core technical advantage of Tensordyne’s architecture comes from its logarithmic number format. Instead of storing a value A as just A , they actually store log 2 A . Then, if they want to multiply A by another value, B , they can use the logarithm product rule https://en.wikipedia.org/wiki/Logarithm to perform multiplication using addition: log 2 A x B = log2 A + log2 B This idea isn’t new, but nobody has successfully used it in a commercial AI accelerator before. Running large matrix multiplications requires a lot of multipliers, but it also requires a lot of adders. The log product rule makes multiplication easier, but it makes addition much harder. Tensordyne gets around this using an approximation https://www.eetimes.com/recogni-pivots-to-data-center-ai-inference-chips/ : log 2 1+x = x for 0