Ultra-light Vehicle Detection using Tiny-Mobilenet + SSD (real time on CPU)

Model

The person detection model based on SSD architecture
There are 3 blocks: Backbone, Extralayers, Detection head

Backbone

The extractor is used: tiny Mobilenet-V2 + RFB(https://arxiv.org/abs/1711.07767) component
The tiny Mobilenet-V2 is customed Mobilenet-V2 (vertical, horizontal).

The backbone can be customed at: ./module/rfb_tiny_mobilenet_v2.py

Extra layers

Multi-scale feature maps for detection
The SSD architecture uses multiple layers (multi-scale feature maps) to detect objects independently. As CNN reduces the spatial dimension gradually, the resolution of the feature maps also decrease. SSD uses lower resolution layers to detect larger scale objects and vice versa. For example, the 4× 4 feature maps are used for larger scale object.
Because the model vehicle detection is used for surveillance camera (detect small, medium objects) so that the small feature maps do not contains lots of information. Thus, this model is eliminated 2 last feature maps
To the person dection use feature maps: 40-30, 20-15, 10-8, 5-4 (width-height). The resolution of feature map depending on the the input of network.
Note that: The high resolution of input network is not synonymous with the good results

The Extra Layers can be customed at: ./model/rfb_tiny_mb_ssd.py and ./module/ssd.py

Detection head

Regression head (location) and classification header (classification)
The most impotant factor in this component is anchor boxes (whith 3 parametert can obtimize scale, ratio, number of anchor per gid cell )
The anchor boxes are defined in [] based on the COCO, VOC dataset with many objects as well as ratio, size...
In [], the author proposed formula to generate anchors boxes. However, this formula is designed for many object catagory , size... For instance, the proposed scale factor in [] are [0.2, 0.9] which can be suitable for COCO dataset. However, this factor does not give good results... As mentioned above, the objects in surveillance applivation distribute small, medium and rarely large. Therefor the scale should have distribution at smaller than 0.5
Ratio: the ration between height/width of objects. statistics to get this factor. Code is available here

Anchor boxes: ./model/config/rfb_tiny_mb_ssd_config.py and ./model/rfb_tiny_mb_ssd.py

Loss

Localization oss: Use smooth_l1_loss
Classification loss: Use focal loss instead of CE loss to to address the issue of the class imbalance problem (person/background)

Dataset

There are 2 main dataset: COCO and Detrac

Requirements

anaconda
Python-3.6
Pytorch-1.2
Torchvision-0.4
opencv-python-
pandas

Training

Optimizer: SGD, with weight decay: 5e-4, batch size: 32, Number of echop:
Training with batch size: 32
Data augmentation:

python train.py, type_network rfb_tiny_mb2_ssd, setting base_channel = 64 with ver-1 or base_channel = 32 with ver2

Model parameter and results

Input: (320, 240)
Feature maps: 40-30, 20-15, 10-8, 5-4
Step (shrinkage): 8-8, 16-16, 32-30, 64-60
Scale (box size): (10, 16, 24), (32, 48), (64, 96), (128, 192, 256)
Ratios: 2.21, 2.47, 2.73
base_channel: Ver1(rfb_tiny_mb2_ssd_c64): 32, Ver2(rfb_tiny_mb2_ssd_c32): 64

Pytorch model

Total params: 4,332,748
Trainable params: 4,332,748
Non-trainable params: 0
Total FLOPs: 1,426,865,400
Total Madds: 2,827,840,000
Input size (MB): 0.22
Forward/backward pass size (MB): 141.75
Params size (MB): 4.13
Estimated Total Size (MB): 146.10
FLOPs size (GB): 1.43
Madds size (GB): 2.83

Model runtime :

0.032s (Intel(R) Core(TM) i7-8700 CPU @ 3.20GHz) - 30 FPS.

Model result

Tested on 10000 random DETRAC test set images

STT	Model	Confidence	DT_Box	AP(car)	MR(car)	AP(bus)	MR(bus)	AP(van)	MR(van)	mAP	mFPPI	Processing time
1	tiny_Mb2-SSD-Lite	0.5	73304	62.92%	36.38%	77.64%	21.46%	55.78%	42.61%	67.14%

0.05s

Run inference :

Run test/detect_imgs.py. Change the corresponding argument . If ModuleNotFoundError: No module named 'model', change the path to sys.path.append() to current TinyMBSSD_Vehicle/ folder.

Train

Run train/main.py. Change the train and validate dataset paths in train/train.py. If the number of datasets changes, you have to change in the dataloader

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
checkpoint		checkpoint
datasets		datasets
evaluate		evaluate
model		model
module		module
pre-train		pre-train
test		test
test_data		test_data
train		train
utils		utils
README.md		README.md
__init__.py		__init__.py
dongkinhnghiathuc.jpg		dongkinhnghiathuc.jpg
img.txt		img.txt
img00100.jpg		img00100.jpg
img00301.jpg		img00301.jpg
img00450.jpg		img00450.jpg
json.txt		json.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ultra-light Vehicle Detection using Tiny-Mobilenet + SSD (real time on CPU)

Model

Backbone

Extra layers

Detection head

Loss

Dataset

Requirements

Training

Model parameter and results

Pytorch model

Model runtime :

Model result

Run inference :

Train

About

Releases

Packages

Languages

GivralNguyen/Ultra-light-Vehicle-Detection-using-Tiny-Mobilenet-SSD-real-time-on-CPU

Folders and files

Latest commit

History

Repository files navigation

Ultra-light Vehicle Detection using Tiny-Mobilenet + SSD (real time on CPU)

Model

Backbone

Extra layers

Detection head

Loss

Dataset

Requirements

Training

Model parameter and results

Pytorch model

Model runtime :

Model result

Run inference :

Train

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages