Sahin, U., Li, H., Khan, Q., Cremers, D., & Tresp, V. (2024, January 3). Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining. Proceedings / IEEE Workshop on Applications of Computer Vision, 5551-5561. https://doi.org/10.1109/WACV57701.2024.00547
Chicago Style (17th ed.) CitationSahin, Ugur, Hang Li, Qadeer Khan, Daniel Cremers, and Volker Tresp. "Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining." Proceedings / IEEE Workshop on Applications of Computer Vision 3 Jan. 2024: 5551-5561. https://doi.org/10.1109/WACV57701.2024.00547.
MLA (9th ed.) CitationSahin, Ugur, et al. "Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining." Proceedings / IEEE Workshop on Applications of Computer Vision, 3 Jan. 2024, pp. 5551-5561, https://doi.org/10.1109/WACV57701.2024.00547.