Abstract: Open-vocabulary object detection (OVD) models are considered to be Large Multi-modal Models (LMM), due to their extensive training data and a large number of parameters. Mainstream OVD ...
Abstract: How to ensure robust and accurate 3D object detection under various environment is essential for autonomous driving (AD) environment perception. While, until now, most of the existing 3D ...