Mono3DVLT-MT - Detail - Visual Language Perception

Method Detail: Mono3DVLT-MT

Benchmark:	VLSOT
Short name:	Mono3DVLT-MT
Long name:	Mono3DVLT-MT
Description:	@inproceedings{wei2025mono3dvlt, title={Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking}, author={Wei, Hongkai and Yang, Yang and Sun, Shijie and Feng, Mingtao and Song, Xiangyu and Lei, Qi and Hu, Hongli and Wang, Rong and Song, Huansheng and Akhtar, Naveed and others}, booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference}, pages={13886--13896}, year={2025} }
Reference:	Wei, Hongkai, Yang, Yang, Sun, Shijie, Feng, Mingtao, Song, Xiangyu, Lei, Qi, Hu, Hongli, Wang, Rong, Song, Huansheng, Akhtar, Naveed, others, Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking. In Proceedings of the Computer Vision and Pattern Recognition Conference, 2025.
Last submitted:	August 07, 2025
Published:	August 07, 2025 at 07:50:06
Submissions:	1
Project page / code:	N/A
Open source:	No

Submission Date	SR@0.5 (↑)	SR@0.7 (↑)	AOR (↑)	PR@1.0 (↑)	ACE (↓)	PR@0.5 (↑)
2025-08-07 07:50	81.6300	68.9400	85.1200	81.5600	0.5210	62.3600