Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Image Caption Generator - Flickr Dataset Added in Generative Models #360

Merged
merged 1 commit into from
Oct 15, 2024

Conversation

deepanshubaghel
Copy link
Contributor

@deepanshubaghel deepanshubaghel commented Oct 15, 2024

Fixes: #299

Project Information

The objective of the project is to predict the captions for the input image. The dataset consists of 8k images and 5 captions for each image. The features are extracted from both the image and the text captions for input. The features will be concatenated to predict the next word of the caption. CNN is used for image and LSTM is used for text. BLEU Score is used as a metric to evaluate the performance of the trained model.

Dataset link: https://www.kaggle.com/adityajn105/flickr8k

Environment: Kaggle

Libraries

  • numpy
  • matplotlib
  • keras
  • tensorflow
  • nltk

Neural Network

  • VGG16 Network
  • CNN-LSTM Network

Results

{6B12E549-4942-4826-A8AF-1162D3A9155B}

{3CB3A15C-8133-4FB0-90C1-E97619A22F70}

Copy link

👋 Thank you for opening this pull request! We appreciate your contribution to improving this project. Your PR is under review, and we'll get back to you shortly.
Don't forget to mention the issue you solved!.

To help move the process along, please tag @UppuluriKalyani, @Neilblaze, and @SaiNivedh26 for a faster review!

@UppuluriKalyani UppuluriKalyani merged commit d4b06f7 into UppuluriKalyani:main Oct 15, 2024
3 checks passed
Copy link

🎉🎉 Thank you for your contribution! Your PR #360 has been merged! 🎉🎉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Image Caption Generator - Flickr Dataset - CNN-LSTM
2 participants