EM Algorithm for Gaussian Mixture Model (GMM)

This project implements the Expectation-Maximization (EM) algorithm for Gaussian Mixture Model (GMM) from scratch in Python. The aim is to provide a deeper understanding of the EM algorithm and its application in clustering via GMM. Additionally, the performance of the custom implementation is compared against the inbuilt functions of the scikit-learn library.

Introduction

The Gaussian Mixture Model (GMM) is a probabilistic model that represents the presence of subpopulations within an overall population. The EM algorithm is an iterative method for finding maximum likelihood estimates of parameters in statistical models, where the model depends on unobserved latent variables. In this project, we delve into implementing the EM algorithm specifically for GMM from scratch in Python.

Usage

To use this implementation, simply clone the repository:

git clone https://github.com/abdulsamad183/EM_for_GMM_from_scratch.git

Dependencies

Python 3.x
NumPy
scikit-learn

Dataset

The Iris dataset is used for training the GMM and comparing performance. It is a classic dataset widely used for classification and clustering tasks. The dataset consists of 150 samples with 4 features each, representing 3 classes of iris plants.

Custom Implementation

The custom implementation of the EM algorithm for GMM is located in the .ipynb file. It includes functions for initializing parameters, calculating the expectation step, the maximization step, and iterating until convergence.

Performance Comparison

The performance of the custom implementation is compared with the inbuilt functions provided by the scikit-learn library. The comparison includes metrics such as convergence rate, and clustering accuracy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

EM Algorithm for Gaussian Mixture Model (GMM)

Table of Contents

Introduction

Usage

Dependencies

Dataset

Custom Implementation

Performance Comparison

Files

README.md

Latest commit

History

README.md

File metadata and controls

EM Algorithm for Gaussian Mixture Model (GMM)

Table of Contents

Introduction

Usage

Dependencies

Dataset

Custom Implementation

Performance Comparison