PointArena: Probing Multimodal Grounding Through Language-Guided Pointing
Published in In Submission, 2025
Recommended citation: Cheng, L., Duan, J., Wang, Y. R., Fang, H., Li, B., Huang, Y., Wang, E., Eftekhar, A., Lee, J., Yuan, W., Hendrix, R., Smith, N. A., Xia, F., Fox, D., & Krishna, R. (2025). PointArena: Probing Multimodal Grounding Through Language-Guided Pointing. In Submission. https://arxiv.org/abs/2505.09990
A framework for probing multimodal grounding through language-guided pointing interactions.
Authors: Long Cheng, Jiafei Duan, Yi Ru Wang, Haoquan Fang, Boyang Li, Yushan Huang, Elvis Wang, Ainaz Eftekhar, Jason Lee, Wentao Yuan, Rose Hendrix, Noah A. Smith, Fei Xia, Dieter Fox, Ranjay Krishna