Skip to content

ThankiJay/Exasens

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Exacens Dataset Analysis

Project Overview

This repository contains the code and documentation for the EAS 509 Statistical Data Mining II Project 1, conducted by Jay Yogesh Thanki, and Rohan Ishwarlal Patel. The project involves a comprehensive analysis of the Exacens dataset using data science techniques, including data cleaning, exploratory data analysis (EDA), and modeling.

Team Members

Project Structure

The repository is organized as follows:

  1. Code:

    • data_cleaning.rmd: Jupyter Notebook containing the code for data cleaning and preprocessing.
    • eda.rmd: Jupyter Notebook for exploratory data analysis (EDA) on the Exacens dataset.
    • modeling.rmd: Jupyter Notebook with code for data modeling using various machine learning algorithms.
  2. Data:

    • exacens_dataset.csv: The Exacens dataset used for analysis.
  3. Images:

    • Contains images generated during data visualization and analysis.
  4. README.md:

    • This file providing an overview of the project, team members, and project structure.

Abstract

This project presents a thorough analysis of the Exacens dataset, including data cleaning, exploratory data analysis, and data modeling. The team applied various data science techniques to ensure the reliability of the analysis and gain valuable insights from the dataset. The modeling phase involved the experimentation with different machine learning models, with a detailed evaluation of their performance. The selected best-performing model is justified based on its alignment with the dataset characteristics.

How to Use

  1. Clone the repository to your local machine.
  2. Open the Jupyter Notebooks (data_cleaning.ipynb, eda.ipynb, modeling.ipynb) in a Jupyter Notebook environment.
  3. Execute the cells in each notebook sequentially to replicate the analysis.

Feel free to explore the code, data, and findings presented in the notebooks.

Conclusion

The project contributes to a deeper understanding of the Exacens dataset, showcasing the potential of data science in extracting actionable insights from complex data. The selected model and findings can serve as a reference for future studies in similar domains.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published