Violet: A Vision-Language model for generating Arabic image captions using a Gemini Decoder and pretrained transformer.
-
Updated
Jan 5, 2025 - Jupyter Notebook
Violet: A Vision-Language model for generating Arabic image captions using a Gemini Decoder and pretrained transformer.
Add a description, image, and links to the gemini-decoder topic page so that developers can more easily learn about it.
To associate your repository with the gemini-decoder topic, visit your repo's landing page and select "manage topics."