Captionify

Captionify is a web application that generates a descriptive caption for an image using an encoder-decoder architecture. The application uses a pre-trained Transformer-based vision model (ViT) as an encoder and a pre-trained language model (GPT2) as a decoder to generate highly accurate captions for uploaded images or image URLs.

Usage

To use Captionify, simply upload an image or enter an image URL on the web interface. The tool will then use the pre-trained models to generate a descriptive caption that accurately describes the contents of the image.

Getting Started

To install Captionify, simply clone this repository and install the necessary dependencies using pip:

git clone https://github.com/<username>/<repository>.git
cd <repository>

Install the required dependencies using the following command:

pip install -r requirements.txt

Then run the app.py file using the following command:

streamlit run app.py

This will launch the application on your local machine. You can then upload an image or enter an image URL to generate a descriptive caption.

Architecture

Captionify uses an encoder-decoder architecture to generate captions for images. The encoder is a pre-trained Transformer-based vision model (ViT) that encodes the input image into a sequence of feature vectors. The decoder is a pre-trained language model (GPT2) that generates a descriptive caption for the image based on the encoded features.

Dependencies

streamlit
requests
Pillow
transformers
torch

References

This project is based on the Encoder-Decoder architecture and uses pre-trained models from the Hugging Face Transformers library.
The application was developed using Streamlit, an open-source app framework for Machine Learning and Data Science projects.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
__pycache__		__pycache__
README.md		README.md
app.py		app.py
model.py		model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Captionify

Usage

Getting Started

Architecture

Dependencies

References

About

Releases

Packages

Languages

RafayKhattak/Captionify

Folders and files

Latest commit

History

Repository files navigation

Captionify

Usage

Getting Started

Architecture

Dependencies

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages