SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation
Published in International Conference on Machine Learning (ICML), 2025
Recommended citation: Fang, H., Grotz, M., Pumacay, W., Wang, Y. R., Fox, D., Krishna, R., & Duan, J. (2025). SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation. International Conference on Machine Learning (ICML). https://arxiv.org/abs/2501.18564
Integration of visual foundation models with memory architecture for enhanced robotic manipulation capabilities.
Authors: Haoquan Fang, Markus Grotz, Wilbert Pumacay, Yi Ru Wang, Dieter Fox, Ranjay Krishna, Jiafei Duan