Google's AI bots learn by watching movies, just like the rest of us

Google DeepMind’s robotics team is teaching robots to learn the way a human intern would: by watching a video. The team has published a new paper demonstrating how Google’s RT-2 robots with the Gemini 1.5 Pro generative AI model built in can absorb information from videos to learn how to navigate and even carry out requests at their destination.

Thanks to the Gemini 1.5 Pro’s wide context window, it’s possible to train a robot as if it were a new intern. This window allows the AI ​​to process large amounts of information simultaneously. Researchers would film a video tour of a designated area, such as a home or office. The robot would then watch the video and learn about the environment.



scroll to top