Manipulate-Anything: Automating Real-World Robots using Vision-Language Models

Published in Conference on Robot Learning (CoRL), 2024

Recommended citation: Duan, J.*, Yuan, W.*, Pumacay, W., Wang, Y. R., Ehsani, K., Fox, D., & Krishna, R. (2024). Manipulate-Anything: Automating Real-World Robots using Vision-Language Models. Conference on Robot Learning (CoRL). https://arxiv.org/abs/2406.18915

Automation of real-world robots using vision-language models for manipulation tasks.

Authors: Jiafei Duan, Wentao Yuan, Wilbert Pumacay, Yi Ru Wang, Kiana Ehsani, Dieter Fox, Ranjay Krishna

Download paper here

Project Website