Student Performance Prediction Using Machine Learning

This repository presents a machine learning project that analyzes student performance data and predicts grades based on average marks. The project is designed to introduce students to essential data science practices, including data preprocessing, visualization, feature engineering, and model training with Naive Bayes and Decision Tree classifiers.

Project Overview

The objective of this project is to:

Explore the Student Performance dataset by visualizing score distributions and examining factors such as gender and parental education.
Perform feature engineering by calculating average marks and assigning grades based on scores.
Train and evaluate Naive Bayes and Decision Tree models to classify students into grade categories.
Save the trained model in a joblib file, allowing for reuse and deployment.

Key Features

Data Visualization: Includes histograms and bar charts to visualize scores and understand relationships.
Feature Engineering: Adds new columns for average marks and grades to enhance predictive modeling.
Machine Learning Models: Uses Naive Bayes and Decision Tree classifiers for grade prediction.
Model Persistence: Saves the Decision Tree model for later use with joblib.

Dataset

The dataset includes student performance data across multiple subjects (Math, Reading, Writing), as well as demographic information such as gender and parental education. It’s a useful dataset for educational analysis and modeling.

Notable Observations

An inverse correlation exists between average marks and grade values due to the numeric grade mapping where higher grades (e.g., "A") are assigned lower values, while lower grades (e.g., "F") have higher values.

Requirements

Python 3.x
Required Libraries: Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, Joblib

Usage

After training, use the predict function to predict grades for new input scores, or load the saved Decision Tree model for deployment.

Project Structure

data/: Contains the dataset.
notebooks/: Jupyter notebooks with code and explanations.
models/: Contains the saved Decision Tree model in a joblib file.
README.md: Project overview and instructions.

Contributing

Contributions are welcome! Feel free to open issues or submit pull requests to improve this project.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
DTCmodel.joblib		DTCmodel.joblib
README.md		README.md
StudentPerf.ipynb		StudentPerf.ipynb
StudentsPerformance.csv		StudentsPerformance.csv
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Student Performance Prediction Using Machine Learning

Project Overview

Key Features

Dataset

Notable Observations

Requirements

Usage

Project Structure

Contributing

About

Releases

Packages

Languages

5103691/Student_Performance_Prediction

Folders and files

Latest commit

History

Repository files navigation

Student Performance Prediction Using Machine Learning

Project Overview

Key Features

Dataset

Notable Observations

Requirements

Usage

Project Structure

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages