Automated EDA Web App

Overview

This project is a web application built with Streamlit that automates Exploratory Data Analysis (EDA) for CSV files. It provides an interactive interface for users to upload datasets and visualize various statistics, correlations, and distributions. The app features:

Dataset Overview: Displays the dataset, its shape, and basic statistics.
Correlation Chart: Visualizes correlations between continuous features.
Missing Values Distribution: Shows the distribution of missing values in the dataset.
Individual Column Stats: Analyzes and visualizes statistics for both continuous and categorical features.
Feature Relationships: Explores relationships between features using scatter plots.

Features

Interactive Data Exploration: View data, statistics, and visualizations interactively.
Charts and Graphs: Correlation heatmaps, missing values bar charts, histograms, and bar charts for categorical features.
Customizable Views: Select features to analyze and visualize different aspects of the data.

Installation

To run this application, ensure you have Python installed, and then install the required libraries listed in the requirements.txt file:

pip install -r requirements.txt

Usage

Clone the repository:

git clone https://github.com/your-username/your-repo-name.git

Run the app:

streamlit run app.py

Upload dataset. If uploading dataset results in axios error, try running the app using:
```
streamlit run app.py --server.enableXsrfProtection false
```
Explore the Application:

Dataset Overview Tab: View the dataset, basic statistics, correlation chart, and missing values distribution.
Individual Column Stats Tab: Analyze and visualize statistics for selected continuous or categorical features.
Feature Relationships Tab: Examine relationships between features with scatter plots, including color encoding for categorical features.

Sample Dataset

You can use the Titanic Dataset from Kaggle as a sample dataset to explore the app's features.

Dependencies

Streamlit Pandas Numpy Bokeh==2.4.3 Matplotlib Missingno

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
App.py		App.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automated EDA Web App

Overview

Features

Installation

Usage

Sample Dataset

Dependencies

About

Releases

Packages

Languages

Kiahmin/Automate-EDA

Folders and files

Latest commit

History

Repository files navigation

Automated EDA Web App

Overview

Features

Installation

Usage

Sample Dataset

Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages