Using Data Science Preprocessing Techniques for SCIN Dataset to Enhanced Secretary Bird Optimization Algorithm for Image Segmentation Research Paper.🛠️

The data science techniques utilized in this repository aim to enhance Optimization Algorithm for image segmentation. This work is a collaborative effort with a dedicated team, contributing to the advancement of research in this area.

To access Enhanced Secretary Bird Optimization Algorithm for Image Segmentation Paper click HERE

The SCIN (Skin Condition Image Network) open access dataset aims to supplement publicly available dermatology datasets from health system sources with representative images from internet users. To this end, the SCIN dataset was collected from Google Search users in the United States through a voluntary, consented image donation application. The SCIN dataset is intended for health education and research, and to increase the diversity of dermatology images available for public use.

The SCIN dataset contains 5,000+ volunteer contributions (10,000+ images) of common dermatology conditions. Contributions include Images, self-reported demographic, history, and symptom information, and self-reported Fitzpatrick skin type (sFST). In addition, dermatologist labels of the skin condition and estimated Fitzpatrick skin type (eFST) and layperson estimated Monk Skin tone (eMST) labels are provided for each contribution.

The data is stored in the dx-scin-public-data bucket on Google Cloud Storage. Check out the load_SCIN.ipynb notebook for a quick review of how to access the dataset and the (Dataset Description) for an overview of its schema.

Note: This dataset contains images of medical conditions, some of which may be sensitive and/or graphic in nature.

Data Science Life Cycle:

Defining SCIN Data Set
Defines Global Parameters for GCP
Dataset Schema
Intialize Google Cloud Storage client to Load CSV label files
Display the random images
Identify Invalid Images
Reverse Engineering
Drop One Hot Encoded Columns
Feature Engineering
Impute Missing Values & Fix Unbalanced Data
Dropping Unneeded Columns
Feature Engineering Part 2
Rename the Columns and its Values
Build Neural Network Model
Data Visualization\

Known issues:

There are 15 images that are duplicates (and appear 42 times total) in the data. Because this data was used for the paper, it's been included in the release.
There are 48 cases where the case is marked as gradable but no skin condition label is present. This happens for cases where they were marked as ungradable due to multiple conditions present.
Issue #1: 1 image file is missing

To access the Documentation that contains a step by step guide click HERE

To access the Data Before Preprocessing click HERE

To access the Data After Preprocessing click HERE

To access the 15 Valid Records click HERE

There is Two Notebooks I Used:

First Notebook is for data preproccessing and handeling the Structured Data, to acces it click HERE
Second Notebook is for image preprocessing and validate records, to access it click HERE

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Data Preprocessing Documentation.docx		Data Preprocessing Documentation.docx
Dataset Description		Dataset Description
Dermatology_Paper.pdf		Dermatology_Paper.pdf
Flow Chart Diagram.pdf		Flow Chart Diagram.pdf
Flow Chart.png		Flow Chart.png
Original_dataset.csv		Original_dataset.csv
Preproccessed_dataset.csv		Preproccessed_dataset.csv
README.md		README.md
SCIN.csv		SCIN.csv
Val_Samples.csv		Val_Samples.csv
load_SCIN.ipynb		load_SCIN.ipynb
preprocessing.ipynb		preprocessing.ipynb
scin_notebook.ipynb		scin_notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using Data Science Preprocessing Techniques for SCIN Dataset to Enhanced Secretary Bird Optimization Algorithm for Image Segmentation Research Paper.🛠️

The data science techniques utilized in this repository aim to enhance Optimization Algorithm for image segmentation. This work is a collaborative effort with a dedicated team, contributing to the advancement of research in this area.

To access Enhanced Secretary Bird Optimization Algorithm for Image Segmentation Paper click HERE

Data Science Life Cycle:

Known issues:

To access the Documentation that contains a step by step guide click HERE

To access the Data Before Preprocessing click HERE

To access the Data After Preprocessing click HERE

To access the 15 Valid Records click HERE

There is Two Notebooks I Used:

About

Releases

Packages

Languages

sahermuhamed1/Data-Science-Techniques-for-Enhanced-Optimization-Algorithm

Folders and files

Latest commit

History

Repository files navigation

Using Data Science Preprocessing Techniques for SCIN Dataset to Enhanced Secretary Bird Optimization Algorithm for Image Segmentation Research Paper.🛠️

The data science techniques utilized in this repository aim to enhance Optimization Algorithm for image segmentation. This work is a collaborative effort with a dedicated team, contributing to the advancement of research in this area.

To access Enhanced Secretary Bird Optimization Algorithm for Image Segmentation Paper click HERE

Data Science Life Cycle:

Known issues:

To access the Documentation that contains a step by step guide click HERE

To access the Data Before Preprocessing click HERE

To access the Data After Preprocessing click HERE

To access the 15 Valid Records click HERE

There is Two Notebooks I Used:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages