EndoSLAM Dataset and An Unsupervised Monocular Visual Odometry and Depth Estimation Approach for Endoscopic Videos: Endo-SfMLearner

EndoSLAM Dataset Overview

We introduce an endoscopic SLAM dataset which consists of both ex-vivo and synthetically generated data. The ex-vivo part of the dataset includes standard as well as capsule endoscopy recordings. The dataset is divided into 35 sub-datasets. Specifically, 18, 5 and 12 sub-datasets exist for colon, small intestine and stomach respectively.

To the best of authors' knowledge, this is the very first dataset published to be used in capsule endoscopy SLAM tasks, with timed 6 DoF pose data and high precision 3D map ground truth
Two different capsules and conventional endoscope cameras, with high and low resolution were used, so as to generate variety in camera specifications and lighting conditions. Images from different cameras with various resolutions for same organs and depth for each related organs are further unique features of the proposed dataset. We also provide images and pose values for two types of wireless endoscopes, which differ from each other in certain aspects like camera resolution, frame rate, and diagnostic results for detecting Z-line, duodenal papillae and bleeding.
Some of the sub-datasets include the same trajectories in two versions, e.g with and without polyps so that effect of having polyps as distinguishable features in the organ environment can be analysed, as well.

The dataset is publicly available in DropBox.

1. Dataset Shooting

The experimental procedure of ex-vivo part of the dataset is demonstrated at YouTube. To get information about generation of synthetic data, please visit Virtaul Capsule Endoscopy repository.

2. Collection of frames taken on endoscope trajectories

Illustration of recorded frames are as following:

The ex-vivo and synthetic parts of dataset consist of a total of 42,700 and 21,887 frames respectively. The specifications of dataset parts recorded from each camera are as follows:

Parts	# of Frames	FPS	Resolution
HighCam	21,428	20	1280 x 720
LowCam	17,978	20	640 x 480
Pillcam	239	4 - 35	256 x 256
MiroCam	3,055	3	320 x 320
UnityCam	21,887	30	320 x 320

3. Dataset Organization

Endo-SfMLearner Overview

We introduce Endo-SfMLearner framework as self-supervised spatial attantion-based monocular depth and pose estimation method.

Our main contributions are as follows:

Brightness-aware photometric loss, which makes the predicted depth to be consistent under various illumination condition.
Spatial attention based pose network which is optimized for capsule endoscopy images.

Getting Started

1. Installation

Clone Endo-SfMLearner Repository

cd ~
git clone https://github.com/CapsuleEndoscope/EndoSLAM
cd Endo-SLAM

Prerequisities

To use the EndoSFM, you will need to:

pip3 install -r requirements.txt

2. Pretrained Models

Pretrained models (Endo-SfMLearner) can be downloaded here!

3. Use-Cases of Endo-SfMLearner with EndoSLAM Dataset

3.1 Depth Estimation

Unity	Endo-SfMLearner	Endo-SfM w/o brightness	Endo-SfM w/o attention	SC-SfMLearner	Monodepth2	Monodepth2 pretrained	SfMLearner	SfMLearner pretrained
RMSE(mean,stdev)	0.2966 , 0.0622	0.3288, 0.0608	0.3273, 0.1086	0.3692 , 0.0779	0.3322 , 0.0815	0.4531 , 0.1011	0.3888 , 0.0711	0.4911 , 0.0831

3.2 Pose Estimation

The pose trajectories of EndoSfMLearner and state-of-the-art methods on ex-vivo small intestine recording obtained from Low Resolution and High Resolution endoscope cameras and Mirocam capsule camera as follows:

The pose trajectory on synthetic stomach data acquired in Unity environment as follows:

3.3 3D Map Reconstruction

In this work, we propose and evaluate a hybrid 3D reconstruction technique. To exemplify the effectiveness of EndoSfMLearner, we compare the results of reconstructions on EndoSfMLearner, SC-SfMLearner and shape from shading method.

As an evaluation metric, root mean square error(RMSE) was used.

Algorithm	RMSE [cm]
EndoSfMLearner	0.51
SC-SfMLearner	0.86
Shape from Shading	0.65

Reference

If you find our work useful in your research or if you use parts of this code please consider citing our paper:

@article{ozyoruk2020quantitative,
  title={Quantitative Evaluation of Endoscopic SLAM Methods: EndoSLAM Dataset},
  author={[dataset] Ozyoruk, Kutsev Bengisu and Gokceler, Guliz Irem and Incetan, Kagan and Coskun, Gulfize and Almalioglu, Yasin and Mahmood, Faisal and Durr, Nicholas J and Curto, Eva and Perdigoto, Luis and Oliveira, Marina and others},
  journal={arXiv preprint arXiv:2006.16670},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Calibration		Calibration
Data Augmentation		Data Augmentation
Datasets		Datasets
Tests		Tests
imgs		imgs
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EndoSLAM Dataset and An Unsupervised Monocular Visual Odometry and Depth Estimation Approach for Endoscopic Videos: Endo-SfMLearner

EndoSLAM Dataset Overview

1. Dataset Shooting

2. Collection of frames taken on endoscope trajectories

3. Dataset Organization

Endo-SfMLearner Overview

Getting Started

1. Installation

Clone Endo-SfMLearner Repository

Prerequisities

2. Pretrained Models

3. Use-Cases of Endo-SfMLearner with EndoSLAM Dataset

3.1 Depth Estimation

3.2 Pose Estimation

3.3 3D Map Reconstruction

Reference

About

Releases

Packages

Languages

LuisPerdigoto/EndoSLAM

Folders and files

Latest commit

History

Repository files navigation

EndoSLAM Dataset and An Unsupervised Monocular Visual Odometry and Depth Estimation Approach for Endoscopic Videos: Endo-SfMLearner

EndoSLAM Dataset Overview

1. Dataset Shooting

2. Collection of frames taken on endoscope trajectories

3. Dataset Organization

Endo-SfMLearner Overview

Getting Started

1. Installation

Clone Endo-SfMLearner Repository

Prerequisities

2. Pretrained Models

3. Use-Cases of Endo-SfMLearner with EndoSLAM Dataset

3.1 Depth Estimation

3.2 Pose Estimation

3.3 3D Map Reconstruction

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages