A
Alexander Herzog
Publications - 4
Citations - 478
Alexander Herzog is an academic researcher. The author has contributed to research in topics: Computer science. The author has an hindex of 2, co-authored 4 publications receiving 478 citations.
Papers
More filters
Proceedings Article
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn,Anthony Brohan,Noah Brown,Yevgen Chebotar,Omar Cortes,Byron David,Chelsea Finn,K. Gopalakrishnan,Karol Hausman,Alexander Herzog,Daniel Ho,Jasmine Hsu,Julian Ibarz,Brian Ichter,Alex Irpan,Eric Jang,Rosario Jauregui Ruano,Kyle Jeffrey,Sally Jesmonth,N. J. Joshi,Ryan Julian,Dmitry Kalashnikov,Yuheng Kuang,Kuang-Huei Lee,Sergey Levine,Yao Lu,Linda Luu,Carolina Parada,Peter Pastor,Jornell Quiambao,Kanishka Rao,Jarek Rettinghouse,D. Reyes,Pierre Sermanet,Nicolas Sievers,Clayton Tan,Alexander Toshev,Vincent Vanhoucke,Fei Xia,Ted Xiao,Peng Xu,Sichun Xu,Mengyuan Yan +42 more
TL;DR: It is shown how low-level skills can be combined with large language models so that the language model provides high-level knowledge about the procedures for performing complex and temporally extended instructions, while value functions associated with these skills provide the grounding necessary to connect this knowledge to a particular physical environment.
Journal ArticleDOI
RT-1: Robotics Transformer for Real-World Control at Scale
Anthony Brohan,Noah Brown,Justice Carbajal,Yevgen Chebotar,Joseph Dabis,Chelsea Finn,K. Gopalakrishnan,Karol Hausman,Alexander Herzog,Jasmine Hsu,Julian Ibarz,Brian Ichter,Alex Irpan,Tomas Jackson,Sally Jesmonth,Nikhil J Joshi,Ryan Julian,Dmitry Kalashnikov,Yuheng Kuang,Isabel Leal,Kuang-Huei Lee,Sergey Levine,Yao Lu,Utsav Malla,D. Manjunath,Igor Mordatch,Ofir Nachum,Carolina Parada,Jodilyn Peralta,Emily Perez,Karl Pertsch,Jornell Quiambao,Kanishka Rao,Michael S. Ryoo,Grecia Salazar,Pannag Raghunath Sanketi,Kevin Sayed,Jaspiar Singh,Sumedh Anand Sontakke,Austin Stone,Clayton Tan,Huong Tran,Vincent Vanhoucke,Steve Vega,Quan Vuong,Fei Xia,Ted Xiao,Peng Xu,Sichun Xu,Tianhe Yu,Brianna Zitkovich +50 more
TL;DR: In this article , the authors present a model class, dubbed Robotics Transformer, that exhibits promising scalable model properties and verify their conclusions in a study of different model classes and their ability to generalize as a function of the data size, model size and data diversity based on a large-scale data collection on real robots performing real-world tasks.
Journal ArticleDOI
Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Alexander Herzog,Kanishka Rao,Karol Hausman,Yao Lu,Paul Wohlhart,Mengyuan Yan,Jessica Lin,Montse Gonzalez Arenas,Ted Xiao,Daniel Kappler,Daniel Ho,Jarek Rettinghouse,Yevgen Chebotar,Kuang-Huei Lee,K. Gopalakrishnan,Ryan Julian,Adrian Li,Chuyuan Fu,Bo Wei,Sangeetha Sukumari Ramesh,K.D. Holden,K. Kleiven,David A. Rendleman,Sean Kirmani,Jeffrey G. Bingham,Jonathan Weisz,Ying Xu,Wenlong Lu,Matthew Bennice,Jessica Lam,Yunfei Bai,Benjie Holson,Michael Quinlan,Noah Brown,Mrinal Kalakrishnan,Julian Ibarz,Peter Pastor,Sergey Levine +37 more
TL;DR: In this article , the authors describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings.
Journal ArticleDOI
Language-Conditioned Reinforcement Learning to Solve Misunderstandings with Action Corrections
Michael Ahn,Anthony Brohan,Yevgen Chebotar,K. Gopalakrishnan,Karol Hausman,Alexander Herzog,Daniel Ho,Jasmine Hsu,Brian Ichter,Alex Irpan,Eric Jang,Rosario Jauregui Ruano,Kyle Jeffrey,Sally Jesmonth,Nikhil J Joshi,Ryan Julian,Dmitry Kalashnikov,Yuheng Kuang,Kuang-Huei Lee,Linda Luu,Jornell Quiambao,Kanishka Rao,Pierre Sermanet,Nicolas Sievers,Alexander Toshev,Mengyuan Yan,Olivier Sigaud +26 more
TL;DR: A first formalization and experimental validation of incremental action-repair for robotic instruction-following based on reinforcement learning is presented and it is shown that a reinforcement learning agent can successfully learn to understand incremental corrections of misunderstood instructions.