Publications
2026
- Preprint
OA-WAM: Object-Addressable World Action Model for Robust Robot ManipulationarXiv preprint arXiv:2605.06481, 2026 - Preprint
Thinking in Text and Images: Interleaved Vision–Language Reasoning Traces for Long-Horizon Robot ManipulationarXiv preprint arXiv:2605.00438, 2026 - ICRA
2025
- Under Review
- 🏆Best Paper Award
RoboAfford++: A Generative AI-Enhanced Dataset for Multimodal Affordance Learning in Robotic Manipulation and NavigationarXiv preprint arXiv:2511.12436, 2025 - 🏆Best Student Paper Award
Exploring typographic visual prompts injection threats in cross-modality generation modelsarXiv preprint arXiv:2503.11519, 2025 - ACL
- ACL