A Novel Adversarial Approach for EEG Dataset Refinement: Enhancing Generalization through Proximity-to-Boundary Scoring

This repository is the official implementations of Proximity-to-Boundary Score in pytorch-lightning style:

TBA

Overall Framework

Abstract

As deep learning performs remarkably in pattern recognition from complex data, it is used to interpret user intentions from electroencephalography (EEG) signals. However, deep learning models trained from EEG datasets have low generalization ability owing to numerous noisy samples in datasets. Therefore, pioneer research has focused on distinguishing and eliminating noisy samples from datasets. One intuitive solution is based on the property of noisy samples during the training phase. Noisy samples are located near the decision boundary after model training. Therefore, they can be detected using a gradient-based adversarial attack. However, limitations of usability exist because the intuitive solution requires additional hyperparameter optimizations, resulting in a trade-off between accurateness and efficiency. In this paper, we proposed a novel training framework that enhances the generalization ability of the model by reducing the influence of noisy samples during training, without additional hyperparameter optimizations. We designed the proximity-to-boundary score (PBS) to continuously measure the data closeness to the decision boundary. The proposed framework improved the generalization ability of the model across two motor imagery datasets and one sleep stage dataset. We qualitatively confirmed that data with low PBS are indeed noisy samples and degrade the model training. Hence, we demonstrated that employing the proposed framework accurately and efficiently mitigates the influence of noisy samples, enhancing the model's generalization capabilities.

Algorithm of the proposed framework

1. Installation

1.1 Clone this repository

$ git clone https://github.com/comojin1994/proximity-to-boundary-score.git

1.2 Environment setup

Create docker container and databases and logs directory by under script

$ cd docker
$ make start.train
$ docker exec -it torch-train bash
$ cd proximity-to-boundary-score

1.3 Preparing data

After downloading the BCI Competition IV 2a & 2b and Sleep-EDF, revise the data's directory in the datasets/setups/{dataset}.py files

# We provide the official code using BCI Competition IV 2a.
$ make setup

BASE_PATH = {Dataset directory}
SAVE_PATH = {Revised dataset directory}

2. Quantitative Analysis

2.1 Comparison of performances in motor imagery classification datasets

Method	BCIC2a			BCIC2b
Method	EEGNet	DeepConvNet	ShallowConvNet	EEGNet	DeepConvNet	ShallowConvNet
Baseline	61.51 $\pm$ 0.96	56.97 $\pm$ 0.48	59.55 $\pm$ 0.75	77.24 $\pm$ 0.36	76.44 $\pm$ 0.22	76.77 $\pm$ 0.26
Random Dropout	61.79 $\pm$ 0.34	56.74 $\pm$ 0.38	59.26 $\pm$ 0.54	77.05 $\pm$ 0.26	76.42 $\pm$ 0.16	76.45 $\pm$ 0.20
MC Dropout	62.16 $\pm$ 0.56	62.24 $\pm$ 0.34	63.44 $\pm$ 0.37	77.27 $\pm$ 0.20	79.55 $\pm$ 0.07	78.58 $\pm$ 0.27
Influence Score	62.48 $\pm$ 1.18	61.64 $\pm$ 1.01	64.09 $\pm$ 0.67	78.66 $\pm$ 0.17	80.34 $\pm$ 0.03	80.20 $\pm$ 0.19
Forgetting Score	61.85 $\pm$ 0.78	59.62 $\pm$ 0.01	60.09 $\pm$ 0.55	77.89 $\pm$ 0.25	80.28 $\pm$ 0.01	77.40 $\pm$ 0.29
DRAA
$\alpha = \text{1e-3}$	63.54 $\pm$ 0.95	56.71 $\pm$ 0.42	60.90 $\pm$ 0.71	78.27 $\pm$ 0.35	79.45 $\pm$ 1.06	79.25 $\pm$ 0.18
$\alpha = \text{1e-5}$	63.91 $\pm$ 0.35	62.16 $\pm$ 0.54	62.70 $\pm$ 0.40	78.35 $\pm$ 0.14	79.80 $\pm$ 0.22	80.03 $\pm$ 0.21
$\alpha = \text{1e-7}$	64.68 $\pm$ 0.43	62.59 $\pm$ 0.14	63.35 $\pm$ 1.16	78.62 $\pm$ 0.33	80.27 $\pm$ 0.18	80.98 $\pm$ 0.29
Proposed	64.90 $\pm$ 0.46	62.67 $\pm$ 0.47	64.31 $\pm$ 1.00	78.67 $\pm$ 0.23	80.31 $\pm$ 0.14	80.41 $\pm$ 0.16

2.2 Comparison of performances in sleep stage classification datasets

2.3 Comparison of efficiency

Method	Complexity	$\tau$ opt.	GPU Min.
Influence Score	$\mathcal{O}(ME+Mp^2+p^3+M^2p^2)$	✓	116.63
MC Dropout	$\mathcal{O}(ME+MS_\text{MC}f)$	✓	0.42
Forgetting Score	$\mathcal{O}(ME)$	✓	-
DRAA
$\alpha = \text{1e-3}$	$\mathcal{O}(ME+M\bar{S}_\text{1e-3}f)$	✓	0.05
$\alpha = \text{1e-5}$	$\mathcal{O}(ME+M\bar{S}_\text{1e-5}f)$	✓	6.2
$\alpha = \text{1e-7}$	$\mathcal{O}(ME+M\bar{S}_\text{1e-7}f)$	✓	839.15
Proposed	$\mathcal{O}(ME+MSf)$	𐄂	1.48

3. Qualitative Analysis

3.1 Signal visualization

3.2 Feature visualization

4. User Manual

STAGE 1: Training model $f$ using $X_{train}$

# config.yaml

mode: "all"

litmodel: "base"

...

score: null

...

weighted_loss: False

$ make train

STAGE 2: Evaluate the proximity of data to the boundary

Run evaluation.ipynb jupyter notebook

Revise the options in config files

# Block [2] in evaluation.ipynb

args = get_configs()
args = init_configs(args)
init_settings(args)

args.WEIGHT_PATH = "{checkpoint path}"

STAGE 3: Get $\hat{\theta}$ utilizing PBS-guided soft rejection

# config.yaml

mode: "cls"

litmodel: "weighted"

...

score: "pbs"

...

weighted_loss: True

$ make train

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs		configs
datasets		datasets
docker		docker
docs		docs
models		models
utils		utils
.gitignore		.gitignore
README.md		README.md
evaluation.ipynb		evaluation.ipynb
makefile		makefile
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Novel Adversarial Approach for EEG Dataset Refinement: Enhancing Generalization through Proximity-to-Boundary Scoring

Overall Framework

Abstract

Algorithm of the proposed framework

1. Installation

1.1 Clone this repository

1.2 Environment setup

1.3 Preparing data

2. Quantitative Analysis

2.1 Comparison of performances in motor imagery classification datasets

2.2 Comparison of performances in sleep stage classification datasets

2.3 Comparison of efficiency

3. Qualitative Analysis

3.1 Signal visualization

3.2 Feature visualization

4. User Manual

STAGE 1: Training model $f$ using $X_{train}$

STAGE 2: Evaluate the proximity of data to the boundary

STAGE 3: Get $\hat{\theta}$ utilizing PBS-guided soft rejection

About

Releases

Packages

Languages

comojin1994/proximity-to-boundary-score

Folders and files

Latest commit

History

Repository files navigation

A Novel Adversarial Approach for EEG Dataset Refinement: Enhancing Generalization through Proximity-to-Boundary Scoring

Overall Framework

Abstract

Algorithm of the proposed framework

1. Installation

1.1 Clone this repository

1.2 Environment setup

1.3 Preparing data

2. Quantitative Analysis

2.1 Comparison of performances in motor imagery classification datasets

2.2 Comparison of performances in sleep stage classification datasets

2.3 Comparison of efficiency

3. Qualitative Analysis

3.1 Signal visualization

3.2 Feature visualization

4. User Manual

STAGE 1: Training model $f$ using $X_{train}$

STAGE 2: Evaluate the proximity of data to the boundary

STAGE 3: Get $\hat{\theta}$ utilizing PBS-guided soft rejection

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages