Demo video: https://github.com/user-attachments/assets/95c2114e-e33a-46ed-933a-3a2ec153f6ca This is the official code for MatSpray, a framework for fusing 2D material ...
Previous research has investigated the application of Multimodal Large Language Models (MLLMs) in understanding 3D scenes by interpreting them as videos. These approaches generally depend on ...
Abstract: This paper proposes 3DGeoDet, a novel geometry-aware 3D object detection approach that effectively handles single- and multi-view RGB images in indoor and outdoor environments, showcasing ...
Abstract: Giving machines the ability to infer the complete 3D geometry and semantics of complex scenes is crucial for many downstream tasks, such as decision-making and planning. Vision-centric ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results