semantic segmentation of images with Fully Convolutional Networks

Semantic segmentation is pixel-level classification of an image into different classes of object. Basically it is a method of image segmentation in which objects belonging to the same class are labelled same. Semantic segmentation helps in providing pixel-level precision for tasks such as human action recognition. Fully Convolutional Networks (FCN) has been used to obtain segmentation masks for humans performing different actions. FCN can take input of any size as the network comprises of only Convolutional Layers and no Fully Connected Layer. The input images are taken after performing object detection over scenic images to detect human. The images are categorized into five classes - Kick, Punch, Push, Handshake and Victim. The dataset is taken from UT-Interaction dataset available at below mentioned link.

http://cvrc.ece.utexas.edu/SDHA2010/Human_Interaction.html

The FCN-8 arcitecture uses standard VGG-16 network for downsampling and does upsampling through transpose convolution. In order to aid the reconstruction, skip connection from Layer 3 and Layer 4 are added to the upsampling layers.

GluonCV has been used to implement the FCN-8 model.

$ python gluoncv_fcn.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
generated_masks		generated_masks
sample_input_images		sample_input_images
README.md		README.md
gluon_fcn.py		gluon_fcn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

semantic segmentation of images with Fully Convolutional Networks

About

Releases

Packages

Languages

saransh-mehta/semantic_segmentation_FCN

Folders and files

Latest commit

History

Repository files navigation

semantic segmentation of images with Fully Convolutional Networks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages