Wasim, S. T., Naseer, M., Khan, S., Yang, M., & Khan, F. S. (2024, June 16). VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding. 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 18909-18918. https://doi.org/10.1109/CVPR52733.2024.01789
Chicago Style (17th ed.) CitationWasim, Syed Talal, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, and Fahad Shahbaz Khan. "VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding." 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) 16 Jun. 2024: 18909-18918. https://doi.org/10.1109/CVPR52733.2024.01789.
MLA (9th ed.) CitationWasim, Syed Talal, et al. "VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding." 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 16 Jun. 2024, pp. 18909-18918, https://doi.org/10.1109/CVPR52733.2024.01789.