Manipulate-Anything: Automating Real-World Robots using Vision-Language Models
Published in Conference on Robot Learning (CoRL), 2024
Recommended citation: Duan, J.*, Yuan, W.*, Pumacay, W., Wang, Y. R., Ehsani, K., Fox, D., & Krishna, R. (2024). Manipulate-Anything: Automating Real-World Robots using Vision-Language Models. Conference on Robot Learning (CoRL). https://arxiv.org/abs/2406.18915
Automation of real-world robots using vision-language models for manipulation tasks.
Authors: Jiafei Duan, Wentao Yuan, Wilbert Pumacay, Yi Ru Wang, Kiana Ehsani, Dieter Fox, Ranjay Krishna