Method Detail: Mono3DVLT-MT

Back to Leaderboard
Benchmark: VLSOT
Short name: Mono3DVLT-MT
Long name: Mono3DVLT-MT
Description: @inproceedings{wei2025mono3dvlt, title={Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking}, author={Wei, Hongkai and Yang, Yang and Sun, Shijie and Feng, Mingtao and Song, Xiangyu and Lei, Qi and Hu, Hongli and Wang, Rong and Song, Huansheng and Akhtar, Naveed and others}, booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference}, pages={13886--13896}, year={2025} }
Reference: Wei, Hongkai, Yang, Yang, Sun, Shijie, Feng, Mingtao, Song, Xiangyu, Lei, Qi, Hu, Hongli, Wang, Rong, Song, Huansheng, Akhtar, Naveed, others, Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking. In Proceedings of the Computer Vision and Pattern Recognition Conference, 2025.
Last submitted: August 07, 2025
Published: August 07, 2025 at 07:50:06
Submissions: 1
Project page / code: N/A
Open source: No

Benchmark performance

Submission Date SR@0.5 (↑) SR@0.7 (↑) AOR (↑) PR@1.0 (↑) ACE (↓) PR@0.5 (↑)
2025-08-07 07:50 81.6300 68.9400 85.1200 81.5600 0.5210 62.3600