Location-guided deep recurrent attention models (LG-DRAM)

Contains the code for my thesis project on location-guided recurrent attention models (LG-DRAM).

In this project I developed a training method for stochastic recurrent attention models [1][2] that enhances both recognition performance and learning speed.

Abstract

Similar to how humans direct their gaze, stochastic recurrent attention models process images in a sequence of glimpses focusing on the parts of the input most relevant for the task at hand, e.g. object detection. Trained with reinforcement learning they learn where to look in order to maximise their performance in a goal-driven way. However, parameter optimisation can be slow especially for large cluttered images.

We therefore propose to enrich the training procedure with an auxiliary supervised learning task, namely object localisation. Like a teacher occasionally pointing a student towards the relevant regions in an image this additional task term strengenths the reward signal and biases glimpses towards the target objects. Crucially, this method only requires very few location-annotations and is therefore useful in practice to make attention models more data efficient.

Demo

Samples of the model evaluated on several digit classification tasks (boxes indicate the glimpses; green: correct classification, red: incorrect).

Cluttered MNIST (CMNIST)

Modified cluttered MNIST (M-CMNIST)

[1] Mnih V., Heess N., Graves A., Kavukcuoglu K. 2014. Recurrent Models of Visual Attention. https://arxiv.org/abs/1406.6247

[2] Ba J., Mnih V., Kavukcuoglu K. 2014. Multiple Object Recognition with Visual Attention. https://arxiv.org/abs/1412.7755

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
analysis		analysis
figures		figures
src		src
.gitignore		.gitignore
ConvNet.py		ConvNet.py
DRAM.py		DRAM.py
DRAM_loc.py		DRAM_loc.py
GlimpseNetwork.py		GlimpseNetwork.py
GlimpseNetwork_scaled.py		GlimpseNetwork_scaled.py
Logger.py		Logger.py
RAM.py		RAM.py
README.md		README.md
__init__.py		__init__.py
config.py		config.py
config_dram.py		config_dram.py
data_generator.py		data_generator.py
evaluate.py		evaluate.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Location-guided deep recurrent attention models (LG-DRAM)

Abstract

Demo

Cluttered MNIST (CMNIST)

Modified cluttered MNIST (M-CMNIST)

About

Releases

Packages

Languages

nabla0001/ram

Folders and files

Latest commit

History

Repository files navigation

Location-guided deep recurrent attention models (LG-DRAM)

Abstract

Demo

Cluttered MNIST (CMNIST)

Modified cluttered MNIST (M-CMNIST)

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages