Skip to content

saransh-mehta/semantic_segmentation_FCN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

semantic segmentation of images with Fully Convolutional Networks

Semantic segmentation is pixel-level classification of an image into different classes of object. Basically it is a method of image segmentation in which objects belonging to the same class are labelled same. Semantic segmentation helps in providing pixel-level precision for tasks such as human action recognition. Fully Convolutional Networks (FCN) has been used to obtain segmentation masks for humans performing different actions. FCN can take input of any size as the network comprises of only Convolutional Layers and no Fully Connected Layer. The input images are taken after performing object detection over scenic images to detect human. The images are categorized into five classes - Kick, Punch, Push, Handshake and Victim. The dataset is taken from UT-Interaction dataset available at below mentioned link.

http://cvrc.ece.utexas.edu/SDHA2010/Human_Interaction.html

The FCN-8 arcitecture uses standard VGG-16 network for downsampling and does upsampling through transpose convolution. In order to aid the reconstruction, skip connection from Layer 3 and Layer 4 are added to the upsampling layers.

fcn-8

GluonCV has been used to implement the FCN-8 model.

$ python gluoncv_fcn.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages