Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 424 Bytes

README.md

File metadata and controls

5 lines (3 loc) · 424 Bytes

Image BART (Quick prototyping)

Finetuned BART by inputting Faster RCNN object features concatenated with the textual tokens. Tested with e-SNLI-VE dataset.

Result: No improvement in terms of generation by adding or not adding image features to the textual tokens. Since there is no image based pretrain objective, adding image input to the model doesnt make a difference.