Publications

2026

  1. ICRA
    toponav.png
    Toponav: Topological graphs as a key enabler for advanced object navigation
    Peiran Liu*, Qiang Zhang*, Daojie Peng*, Lingfeng Zhang*, Yihao Qin, Hang Zhou, Jun Ma, Renjing Xu, and Yiding Ji
    arXiv preprint arXiv:2509.01364, 2026
  2. RAL
    stairway.png
    Stairway to Success: Zero-Shot Floor-Aware Object-Goal Navigation via LLM-Driven Coarse-to-Fine Exploration
    Zeying Gong, Rong Li, Tianshuai Hu, Ronghe Qiu, Lingdong Kong, Lingfeng Zhang, Yiyi Ding, Leying Zhang, and Junwei Liang
    2026
  3. AAAI
    spatialnav.png
    What You See is What You Reach: Towards Spatial Navigation with High-Level Human Instructions
    Lingfeng Zhang*, Haoxiang Fu*, Xiaoshuai Hao, Shuyi Zhang, Qiang Zhang, Rui Liu, Long Chen, and Wenbo Ding
    2026

2025

  1. Technical Report
    mimo.png
    MiMo-Embodied: X-Embodied Foundation Model Technical Report
    Xiaoshuai Hao*, Lei Zhou*, Zhijian Huang*, Zhiwen Hou*, Yingbo Tang*, Lingfeng Zhang*, Guang Li*, Zheng Lu*, Shuhuai Ren, Xianhui Meng, and others
    arXiv preprint arXiv:2511.16518, 2025
  2. CVPR
    sky.png
    Is your VLM Sky-Ready? A Comprehensive Spatial Intelligence Benchmark for UAV Navigation
    Lingfeng Zhang*, Yuchen Zhang*, Hongsheng Li, Haoxiang Fu, Yingbo Tang, Hangjun Ye, Long Chen, Xiaojun Liang, Xiaoshuai Hao, and Wenbo Ding
    arXiv preprint arXiv:2511.13269, 2025
  3. Under Review
    socialnav.png
    SocialNav-Map: Dynamic Mapping with Human Trajectory Prediction for Zero-Shot Social Navigation
    Lingfeng Zhang, Erjia Xiao, Xiaoshuai Hao, Haoxiang Fu, Zeying Gong, Long Chen, Xiaojun Liang, Renjing Xu, Hangjun Ye, and Wenbo Ding
    arXiv preprint arXiv:2511.12232, 2025
  4. 🏆Best Paper Award
    roboafford++.png
    RoboAfford++: A Generative AI-Enhanced Dataset for Multimodal Affordance Learning in Robotic Manipulation and Navigation
    Xiaoshuai Hao, Yingbo Tang, Lingfeng Zhang, Yanbiao Ma, Yunfeng Diao, Ziyu Jia, Wenbo Ding, Hangjun Ye, and Long Chen
    arXiv preprint arXiv:2511.12436, 2025
  5. 🏆Best Student Paper Award
    ijcai.png
    Exploring typographic visual prompts injection threats in cross-modality generation models
    Hao Cheng, Erjia Xiao, Yichi Wang, Lingfeng Zhang, Qiang Zhang, Jiahang Cao, Kaidi Xu, Mengshu Sun, Xiaoshuai Hao, Jindong Gu, and others
    arXiv preprint arXiv:2503.11519, 2025
  6. ACM MM
    roboafford.png
    Roboafford: A dataset and benchmark for enhancing object and spatial affordance learning in robot manipulation
    Yingbo Tang*, Lingfeng Zhang*, Shuyi Zhang, Yinuo Zhao, and Xiaoshuai Hao
    In Proceedings of the 33rd ACM International Conference on Multimedia, 2025
  7. ACM MM
    videocot.png
    Video-cot: A comprehensive dataset for spatiotemporal understanding of videos based on chain-of-thought
    Shuyi Zhang, Xiaoshuai Hao, Yingbo Tang, Lingfeng Zhang, Pengwei Wang, Zhongyuan Wang, Hongxuan Ma, and Shanghang Zhang
    In Proceedings of the 33rd ACM International Conference on Multimedia, 2025
  8. Under Review
    nava.png
    NavA^3: Understanding Any Instruction, Navigating Anywhere, Finding Anything
    Lingfeng Zhang*, Xiaoshuai Hao*, Yingbo Tang, Haoxiang Fu, Xinyu Zheng, Pengwei Wang, Zhongyuan Wang, Wenbo Ding, and Shanghang Zhang
    arXiv preprint arXiv:2508.04598, 2025
  9. Technical Report
    robobrain.png
    Robobrain 2.0 technical report
    BAAI RoboBrain Team, Mingyu Cao, Huajie Tan, Yuheng Ji, Xiansheng Chen, Minglan Lin, Zhiyu Li, Zhou Cao, Pengwei Wang, Enshen Zhou, and others
    arXiv preprint arXiv:2507.02029, 2025
  10. ACL
    mapnav.png
    Mapnav: A novel memory representation via annotated semantic maps for vlm-based vision-and-language navigation
    Lingfeng Zhang*, Xiaoshuai Hao*, Qinwen Xu, Qiang Zhang, Xinyao Zhang, Pengwei Wang, Jing Zhang, Zhongyuan Wang, Shanghang Zhang, and Renjing Xu
    In The 63rd Annual Meeting of the Association for Computational Linguistics, 2025
  11. ICRA
    mfnp.png
    Multi-floor zero-shot object navigation policy
    Lingfeng Zhang*, Hao Wang*, Erjia Xiao*, Xinyao Zhang, Qiang Zhang, Zixuan Jiang, and Renjing Xu
    In 2025 IEEE International Conference on Robotics and Automation (ICRA), 2025

2024

  1. IROS
    trihelper.png
    Trihelper: Zero-shot object navigation with dynamic assistance
    Lingfeng Zhang, Qiang Zhang, Hao Wang, Erjia Xiao, Zixuan Jiang, Honglei Chen, and Renjing Xu
    In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
  2. ECCV
    swt.png
    Spiking wavelet transformer
    Yuetong Fang, Ziqing Wang, Lingfeng Zhang, Jiahang Cao, Honglei Chen, and Renjing Xu
    In European conference on computer vision, 2024