Tesla’s Optimus humanoid robot gets improvements

September 26, 2023

Tesla has announced major improvements to its humanoid robot, Optimus, which is now capable of picking up and sorting objects, doing yoga, and navigating through surroundings.

One of the key reasons for Optimus’s impressive capabilities is its use of neural networks. Neural networks are a type of machine learning that allows robots to learn and adapt to their environment. This is in contrast to rule-based systems, which are more limited in their abilities.

Optimus’s motion is facilitated by a sophisticated neural architecture that is trained in an end-to-end manner. This means that the robot takes in videos as input and produces actions as output.

To understand its surroundings, Optimus analyses images using efficient Vision Transformers (ViT) or more conventional backbone models like ResNet or EfficientNet. Videos can be processed in two ways—treating each frame as an individual image or considering the video as a whole. Different techniques, such as SlowFast Network or RubiksNet, are used to efficiently handle video data.

While it’s not entirely clear whether Optimus responds to language prompts, if it does, there’s a mechanism for integrating language with visual perception. Techniques like Feature-wise Linear Modulation (FiLM) may be employed for this purpose, allowing language embeddings to influence the image processing pathway.

To translate continuous motion signals into discrete actions that the robot can understand, Optimus might use various methods, such as categorising the movements or employing VQVAE for compression.

All these components work together within a Transformer-based controller. This controller takes in-video tokens (possibly modulated by language) and produces action tokens step-by-step. The robot continually refines its actions by observing the consequences of its previous moves, demonstrating its self-corrective abilities as seen in the demos.

The sources for this piece include an article in AnalyticsIndiaMag.

Top Stories

Related Articles

January 14, 2026 Lenovo is repositioning itself for a world where enterprise customers no longer want to be locked into more...

January 12, 2026 A Canadian space company is about to make history. On Sunday, Toronto-based Kepler Communications will launch 10 more...

January 8, 2026 D-Wave says it has solved a major technical bottleneck that has long limited the scalability of gate-model more...

January 7, 2026 CES 2026 kicked off with a bang on Jan. 6. It’s been two days of the four-day more...

Jim Love

Jim is an author and podcast host with over 40 years in technology.

Share:
Facebook
Twitter
LinkedIn