About

Given a multi-modal embedder network that creates a joint representation between images and language attributes, allowing word2vec kind of math for product discovery; the aim of the project is to make the attribute addition and subtraction of the query words easier and more user friendly through auto-suggestion tags.

Current work

Understood the embedder model and several other losses for multi modal search
Collected the iMaterialist Data from kaggle with 228 distinct labels (2000 training images and 500 validation images)
Prepared the data to feed into the multi label classifier
Built a baseline multi label classifier and recorded performance.
Inference for retriving images

Next Steps

Make improvements to the baseline model through changing sigmoid thresholds, varying learning rate, experimenting with different pretrained models.
Read literature for other techniques for efficient multi label prediction

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
Dataset		Dataset
Model		Model
inference		inference
workspace		workspace
README.md		README.md
deleteme.html		deleteme.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Current work

Next Steps

About

Releases

Packages

Contributors 4

Languages

fellowship/auto_tag_embedder

Folders and files

Latest commit

History

Repository files navigation

About

Current work

Next Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages