How AI Discovered a Faster Matrix Multiplication Algorithm

Quanta Magazine

22 May 202313:00

Summary

TLDRThe enigmatic operation of matrix multiplication, fundamental in fields from computer graphics to quantum physics, has been revolutionized by a breakthrough from Google's AI research lab, DeepMind. Traditionally, matrix multiplication involves a complex and time-consuming process, with the standard algorithm requiring a cubic number of steps relative to the matrix size. However, DeepMind's AlphaTensor, an AI built on reinforcement learning, has discovered a new algorithm that significantly reduces the number of multiplication steps needed, particularly for matrices with binary elements. This achievement not only breaks a 50-year-old record but also exemplifies the potential for AI to augment human intelligence, as mathematicians have already built upon AlphaTensor's findings to further optimize matrix multiplication methods.

Takeaways

🧮 **Matrix Multiplication Importance**: Matrix multiplication is a fundamental operation in mathematics, crucial for fields like computer graphics, neural networks, and quantum physics.
🚀 **Efficiency Challenge**: Finding more efficient matrix multiplication methods is a significant challenge due to its complexity, which impacts the ability to solve larger problems in a reasonable time.
📚 **Standard Algorithm**: The traditional method for multiplying matrices involves a straightforward process but requires N-cubed steps, which becomes inefficient for large matrices.
🔍 **Strassen's Algorithm**: Volker Strassen's algorithm reduces the number of multiplication steps from eight to seven for 2x2 matrices, offering substantial computational savings for larger matrices.
🏆 **Winograd's Proof**: Shmuel Winograd proved that no algorithm could multiply two 2x2 matrices using six or fewer multiplications, establishing Strassen's algorithm as the best solution for a long time.
🤖 **DeepMind's Breakthrough**: Google's DeepMind AI lab discovered a new algorithm that surpasses Strassen's method for multiplying 4x4 matrices with binary elements, setting a new record.
🤹 **AlphaTensor**: DeepMind's AlphaTensor uses reinforcement learning, derived from the AlphaGo algorithm, to find more efficient matrix multiplication algorithms by playing a 'game' of minimizing multiplication steps.
🧠 **Reinforcement Learning**: AlphaTensor learns through strategic penalties and rewards, optimizing its approach to achieve the task of finding the most efficient matrix multiplication algorithms.
🧩 **Tensor Decomposition**: The process of breaking down a 3D tensor into rank-1 tensors represents each step in a matrix multiplication algorithm, with fewer tensors correlating to fewer multiplication steps.
🔬 **Pattern Discovery**: AlphaTensor's training led to the discovery of patterns for efficient tensor decomposition, which not only rediscovered Strassen's algorithm but also surpassed it.
🤝 **Human-AI Collaboration**: The collaboration between AI programs like AlphaTensor and mathematicians can lead to new discoveries, with AI providing tools and insights to guide mathematicians.