Skip to content

Latest commit

 

History

History
20 lines (7 loc) · 1009 Bytes

Large_Navigation_Model.md

File metadata and controls

20 lines (7 loc) · 1009 Bytes

Large Navigation Model

Vision-Language-Action

Video Generation

  • NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation, RSS 2024. [Paper] [Website] [Video] [Code]
  • Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks, arXiv 2024.12. [Paper] [Website] [He Wang, PKU]