Manipulate-Anything: Automating Real-World Robots using Vision-Language Models

Published in Conference on Robot Learning (CoRL), 2024

Recommended citation: Duan, J.*, Yuan, W.*, Pumacay, W., Wang, Y. R., Ehsani, K., Fox, D., & Krishna, R. (2024). Manipulate-Anything: Automating Real-World Robots using Vision-Language Models. Conference on Robot Learning (CoRL). https://arxiv.org/abs/2406.18915

Automation of real-world robots using vision-language models for manipulation tasks.

Authors: Jiafei Duan, Wentao Yuan, Wilbert Pumacay, Yi Ru Wang, Kiana Ehsani, Dieter Fox, Ranjay Krishna

Download paper here

Project Website

Share on

Twitter Facebook LinkedIn

Yi Ru (Helen) Wang

Share on