A pipeline for generating detailed image captions by integrating YOLOv8 for object detection and FLAN-T5 for contextual enhancement.
-
Updated
Dec 18, 2024 - Jupyter Notebook
A pipeline for generating detailed image captions by integrating YOLOv8 for object detection and FLAN-T5 for contextual enhancement.
The excellent Image captioning model using the DETR inspired architecture
Add a description, image, and links to the image-captioning-object-detection topic page so that developers can more easily learn about it.
To associate your repository with the image-captioning-object-detection topic, visit your repo's landing page and select "manage topics."