Magma: A Foundation Model for Multimodal AI Agents

A State-Of-The-Art must-read paper -- "Put the sausage to hot dog"*

Apr 05, 2025

∙ Paid

Most foundation models today still live in a digital world of words.

They’ve become fluent in describing things like objects, actions, and outcomes in a beautiful cascade of tokens. But we’re still struggling to build systems that can properly act within 2D and, more importantly, 3D worlds. Here, the gap between perception and action remains wide. That …

Continue reading this post for free, courtesy of Jan Daniel Semrau (MFin, CAIO).

Or purchase a paid subscription.

Encyclopedia Autonomica

Magma: A Foundation Model for Multimodal AI Agents

A State-Of-The-Art must-read paper -- "Put the sausage to hot dog"*

Continue reading this post for free, courtesy of Jan Daniel Semrau (MFin, CAIO).