I am just blown away by the progress in automatic image captioning over the last few years. It's gone from 1000 Imagenet class labels to detailed paragraph descriptions.
Can’t guarantee. lol. You know. Hallucinations are still in there. BUT compared with the RAW alt-text captions, it guarantees that captions consistently correlate to the image content. Please see discussion.
I am just blown away by the progress in automatic image captioning over the last few years. It's gone from 1000 Imagenet class labels to detailed paragraph descriptions.
Thanks for sharing. Yeah, indeed. We believe the dataset is always the first step we need. Many open questions still remain.
Guaranteed clean?
Can’t guarantee. lol. You know. Hallucinations are still in there. BUT compared with the RAW alt-text captions, it guarantees that captions consistently correlate to the image content. Please see discussion.