{"id":1116030,"date":"2023-07-02T13:41:00","date_gmt":"2023-07-02T17:41:00","guid":{"rendered":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/uncategorized\/robocat-google-deepminds-innovative-leap-into-ai-powered-the-jerusalem-post\/"},"modified":"2023-07-02T13:41:00","modified_gmt":"2023-07-02T17:41:00","slug":"robocat-google-deepminds-innovative-leap-into-ai-powered-the-jerusalem-post","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/robotics\/robocat-google-deepminds-innovative-leap-into-ai-powered-the-jerusalem-post\/","title":{"rendered":"RoboCat: Google DeepMind&#8217;s innovative leap into AI-powered &#8230; &#8211; The Jerusalem Post"},"content":{"rendered":"<p><p>    In a significant advancement in robotics, Google DeepMind has    introduced a new AI agent named RoboCat. This agent    is designed to learn a variety of tasks across different    robotic arms, showcasing the ability to self-generate new    training data to improve its techniques, marking a crucial step    towards the creation of general-purpose robots.  <\/p>\n<p>    RoboCat, a Transformer model with a    VQ-GAN encoder, was released in June 2023. It is primarily    intended for research into learning to accomplish a wide    variety of tasks from expert demonstrations or multiple real    robot embodiments for manipulation.  <\/p>\n<p>    The primary intended users are Google DeepMind researchers, and    it's not intended for commercial or production use.  <\/p>\n<p>    RoboCat's standout feature is its learning speed. It can master    a new task with as few as 100 demonstrations, leveraging a    large and diverse dataset. This capability reduces the need for    human-supervised training, potentially accelerating the pace of    robotics research.  <\/p>\n<p>    RoboCat's training involves a comprehensive five-step    self-improvement process. It starts with collecting 100-1000    demonstrations of a new task or robot, using a robotic arm controlled by a human. This new task or    arm data is used to fine-tune RoboCat, creating a specialized    spin-off agent. This agent practices the new task or arm an    average of 10,000 times, generating more training data.  <\/p>\n<p>    The demonstration data and self-generated data are then    incorporated into RoboCats existing training dataset, and a    new version of RoboCat is trained on the updated dataset.  <\/p>\n<p>    This process enables RoboCat to learn from a wide range of    tasks and diverse training data types. Having been trained on    millions of trajectories from both real and simulated robotic    arms, RoboCat handles a variety of tasks involving different    objects and variations, sourced from Reinforcement Learning    (RL), Teleoperation (Teleop), and RoboCat itself.  <\/p>\n<p>    These tasks include stacking RGB objects, tower and pyramid    building with RGB objects, and lifting NIST-i gears, among    others. The training involved four different types of robots    and many robotic arms to collect vision-based data representing    the tasks RoboCat would be trained to perform.  <\/p>\n<p>    RoboCat demonstrates impressive adaptability by quickly    learning to operate different robotic arms. For example, after    observing 1000 demonstrations controlled by humans, RoboCat    could successfully direct a new arm with a three-fingered    gripper and twice as many controllable inputs, achieving an 86%    success rate in picking up gears.  <\/p>\n<p>    Moreover, the more new tasks RoboCat learns, the better it gets    at learning additional new tasks. The initial version of    RoboCat achieved a 36% success rate on previously unseen tasks    after learning from 500 demonstrations per task. However, the    latest version, trained on a more diverse set of tasks, more    than doubled this success rate on the same tasks.  <\/p>\n<p>    RoboCat's performance was evaluated through various tasks, such    as inserting and removing objects from a bowl and lifting large    gears. These evaluations were conducted in both simulated and    real-world environments and compared to the performance of    human teleoperators.  <\/p>\n<p>    During the training process, RoboCat uses different    observations to understand the robot's position and grip. These    observations include joint angles, TCP position, gripper joint    angle, and gripper grasp status. The specific observations    depend on the robot and objects being used.  <\/p>\n<p>    In the development of RoboCat, an interesting comparison was    made between the VQ-GAN tokenizer and the patch ResNet used in    Gato. The patch ResNet tokenizer performed better during    training tasks but performed worse on tasks that were not    included during training.  <\/p>\n<p>    It's important to note that RoboCat is currently an early    research model and has not been evaluated for deployment and    safety outside of research environments. As RoboCat's    capabilities expand, potential ethical and safety risks need to    be carefully addressed. Therefore, caution should be exercised    when considering the use of RoboCat outside of research    settings. Nonetheless, the development of RoboCat represents a    significant milestone in the field of robotics and AI, bringing    us closer to a future where robots are an integral part of our    everyday lives.  <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>Read more: <\/p>\n<p><a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/www.jpost.com\/business-and-innovation\/article-747981\" title=\"RoboCat: Google DeepMind's innovative leap into AI-powered ... - The Jerusalem Post\">RoboCat: Google DeepMind's innovative leap into AI-powered ... - The Jerusalem Post<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> In a significant advancement in robotics, Google DeepMind has introduced a new AI agent named RoboCat. This agent is designed to learn a variety of tasks across different robotic arms, showcasing the ability to self-generate new training data to improve its techniques, marking a crucial step towards the creation of general-purpose robots. RoboCat, a Transformer model with a VQ-GAN encoder, was released in June 2023.  <a href=\"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/robotics\/robocat-google-deepminds-innovative-leap-into-ai-powered-the-jerusalem-post\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[187746],"tags":[],"class_list":["post-1116030","post","type-post","status-publish","format-standard","hentry","category-robotics"],"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/1116030"}],"collection":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/comments?post=1116030"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/1116030\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/media?parent=1116030"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/categories?post=1116030"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/tags?post=1116030"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}