One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework 30 September 2025
|
published at Computer Vision and Pattern Recognition 2026
|
L. Bianchi, G. Pacini et al.
Zero-shot captioners are recently proposed models that utilize common-space vision-language representations to caption images without...
read more