Building sophisticated robotic projects at home is becoming a little easier.
Earlier this week, AI Dev Platform Hugging Face released an open AI model of robotics called Smolvla. Trained with a “compatible license” community sharing dataset, Smolvla outperforms much larger models of robotics in both virtual and real-world environments, embracing facial claims.
“Smolvla aims to democratize access to vision language action (VLA) models and accelerate research into generalist robotic agents,” wrote a hugging face in a blog post. “Smolvla is not only a lightweight yet capable model, but also a way to train and evaluate Generalist Robotics (Technologies).”
Smolvla embraces the rapidly expanding efforts of the face to establish an ecosystem of low-cost robotics hardware and software. Last year, the company launched Lerobot, a collection of robot-centric models, datasets and tools. Recently, he embraced Face and acquired Pollen Robot, a French-based robotics startup, and announced several inexpensive robotics systems, including humanoids for purchase.
With a size of 450 million, Smolvla has been trained on data in the Lerobot Community Dataset. This is a specially marked robotics dataset shared by hugging Face’s AI development platform. A parameter, sometimes called a “weight,” is an internal component of the model that derives its behavior.
Hugging Face claims that Smolvla is small enough to run on a single consumer GPU, or a MacBook, and can be tested and deployed on “affordable” hardware, including the company’s robotics system.
With an interesting twist, Smolvla also supports “asynchronous inference stacks.” FaceingFace says that models can separate the robot’s actions from the processing of what they see. As the company explains in a blog post, “(b) Because of this separation, robots can respond more quickly in rapidly changing environments.”
Smolvla can be downloaded from the hugging face. Already, users of X claim they used the model to control the third-party robot arm.
It is worth noting that the embracing face is far from the only player in the early open robot races.
Nvidia has a collection of tools for open robotics, and the startup K-scale lab builds components for what is called “open source humanoids.” Other scary companies in this segment include Dyna Robotics, Jeff Bezos-backed physical intelligence, and RLWRLD.