Files

.github
Advertisement Click Prediction
Advertisement Success Prediction
Age, Gender and Ethnicity Prediction
Air Quality Prediction
Airbnb Price Prediction
Airline Passenger Satisfaction
Alpaca Identification
Amazon Alexa Reviews
Amazon Books Analysis
Amazon Data Analysis
Amazon Mobile Phones Reviews Analysis
Amazon Products Reviews Classification
American Sign Language (ASL) Recognition
Animate Me!
Anime Recommendation System
Appliances Energy Prediction
Automated Essay Grading
- Dataset
- Model
- README.md
Avito Product Analysis and Price Prediction
Avocado Price Prediction
Balls Image Classification
Bangladesh Premier League Analysis
Bank Customers Prediction
Bike Crash Analysis
Bike Rental Demand Analysis
Bike Sharing Prediction
Bird Species Classification and Recognition
Bitcoin Price Prediction
Black Friday Sales- Analysis and Prediction
Board Game Review Prediction
Body Fat Prediction
Body Parts Classification
Book Genre Prediction
Book Recommendation Systems
Brain Tumor Detection and Classification
Brain Weight Prediction
Brazil Fires Prediction
Breast Cancer Prediction
CAPTCHA Decoding
COVID19_Data-Analysis
Campus Placement Prediction
Campus Recruitment - Analysis and Prediction
Car Brand Classification
Car Insurance Prediction
Car Prices Prediction
Cartier Jewelry Classification
Cartoonify using KMeans
Cats Vs Dogs Classification
Cervical Cancer Risk Prediction
Churn Risk Score Prediction
Classification of Images using MNIST and CIFAR10
Coffee Production Prediction
Colour Identification using Machine Learning
Concrete Strength Prediction
Cotton Disease Prediction
Covid-19 Data Analysis
Credit Card Fraud Detection
Crop Fertilizers Analysis and Prediction
Crop Recommendation System
Crop Yield Prediction
Cryotherapy Analysis
Customer Income Segmentation Analysis
Customer Modeling Analysis
Customer Region Classification
DNA Sequencing Classification
Dance Form Classification
Dandelion Recognition
Data Analysis of Meteorological Data
DeepQNetworks
Detecting Motion and Moving Objects in a Video
Diabetes Prediction
Diamond Price Prediction
Discussion Forum Prediction
Disease Symptom Prediction
Disneyland Reviews Analysis
Dog Breed Identification
Dogecoin Price Prediction
Driver Drowsiness Detection System
Dry Beans Classification
Email_Classification
Emoji Classification using OpenCV
Emotion Classification
Emotion Recognition Based on NLP
Employee Retention Project
English Alphabet Classification
Exports Classification
Face Clustering
Face Detection and Blurring
Face Generation using DCGAN
Face Mask Detection using OpenCV
Face Verification using DL
Face detection using PCA
Facial Expression Recognition Using ML
Fake Currency Prediction
Fake Job Posting Prediction
Fake News Detection
Fashion MNIST Classification
Fish Weight Prediction
Flight Delay Prediction
Flight Fare Prediction
Flood Prediction
Flowers Recognition
Football Match Prediction
Football Team Rating Prediction
Forest Cover Type Classification
Forest Fire Prediction
Fragnance Price Prediction
Fresher's Salary Prediction
Fuel Consumption Analysis
GOT Episodes IMDb Rating Prediction
Gender Classification Using DL
German Traffic Sign Classification
Glass Classification
Gold Price Prediction
Graduate Admission Prediction
Gun Detection
Handwritten Digit Recognition
Heart Attack Analysis
Heart Disease prediction
Heights and Weights Prediction
Honey Bee Pollen Detection
Horses or Humans Classification
Hotel Booking Prediction
Hotel Rating Prediction
House Price Prediction
Human Activity Recognition using Smartphones
IMDB Sentiment Analysis
IPL Score Prediction
IPL Winner Prediction
IRIS Flower Classification
Ice Cream Revenue Prediction
Image Compression using Clustering
Imagenet classification
Income Prediction Web App
Insurance Claim Prediction
Kidney Stone Prediction
LEGO Minifigures
Laptop Prices Prediction
License Plate Detection and Recognition
Loan Eligibility Prediction
MBA Specialization Classification
Malaria Disease Detection
Male & Female Eyes Classification
Mall Customer Segmentation
Marathon Time Prediction
Marble Surface Anomaly Detection
Medical Cost Analysis for Smokers and Non-smokers
MemPool Prediction
Mobile Price Range Classification
Mortality Rate Analysis & Prediction
Movie Oscar Win Prediction
Movie Recommendation System
Mushroom Classification
Music Genre Classification
NASA Asteroids Classification
NBA-Analysis and Prediction
Named Entity Recognition
Natural Images Classification
Netflix EDA and Recommedation System
News Articles Classification
Object Detection using OpenCV
Olympic Medal Prediction
Organ Donors Prediction
Paris Housing Classification
Parkinson's Disease Prediction
Password Strength Classifier
Persian License Plate Characters Identification
Phishing Website Detection
Plant Disease Prediction
Plant Seedlings Classification
Predict Future Sales
Prediction of Subject based on Question (NLP)
Private Companies Prediction
Productivity Prediction
Railway Track Fault Detection
Rain Prediction
Ramen Noodles Rating Analysis
Reddit Tweets Prediction
Restaurant Recommendation System
Resume Categorizing
Road Lane Detection
Salary Prediction
Salary Range Classification
Salt Deposits Identification & Prediction
Sarcasm Detection
Shoulder X-ray Classification
Sign Language Prediction
Snapchat Filters
Snapchat Witch Filter
Social Distancing Detector
Social Network Influencer Prediction
Soil Moisture Prediction
Solar Eclipse Classification
Solar Radiation Prediction
Song Genre Classification
Spam Email Detection
Speech Emotion Recognition
Stack Overflow Questions Quality Rating Prediction
Star Radiation Analysis and Prediction
Stars, Galaxies and Quasars Classification
Startup Profit Prediction
Stock Price Prediction
Stocks and Crypto Research Analysis
Stress Level Prediction
Stroke Prediction
Student Performance Prediction
Supermarket Sales Prediction
Terrorist Attack Prediction
Test Score Prediction
Tetris Object Counter
Text Classification
Text Summarization
Titanic Survival Prediction
Tokyo Olympics Visualisation Data Analysis
Traffic Sign Classification
Twitch Streamer Analysis
Twitter Sentiment Analysis
U.S. Weather History Visualizations
USA House Pricing Prediction
Uber Fare predictions
Vehicle Image Classification
Vehicles and Pedestrian Tracking Using OpenCV
Voice Gender Identification
Walmart Sales Prediction
Waste Classification
Water Quality Prediction
Web Page Phishing Detection
Wine Quality Prediction
World Happiness Report Analysis
World Population by Year analysis
World Poverty Analysis
Youtube Video Recommendation System
Zomato Bangalore Restaurants Recommendation Analysis
Zoo-Animal-Classification
CODE_OF_CONDUCT.md
CONTRIBUTING.md
LICENSE
README.md
_config.yml

Automated Essay Grading

Jul 11, 2021

f70e03e · Jul 11, 2021

Name	Name	Last commit message	Last commit date
parent directory ..
Dataset	Dataset	Add files via upload	Jul 11, 2021
Model	Model	Create requirements.txt	Jul 11, 2021
README.md	README.md	Update README.md	Jul 11, 2021

README.md

Automated Essay Grading

Goal

The goal of this project is to make a prediction model which will give scoring to student-written essays.

DATASET

The dataset which is used in this project, is collected from Kaggle. Here is the link of the dataset : https://www.kaggle.com/c/asap-aes/overview. I have uploaded the same in Dataset folder too, you can access that!

LIBRARIES NEEDED

Pandas
Numpy
Sklearn
NLTK
Keras

WHAT I HAD DONE

Importing all the required libraries. Check requirements.txt.
Upload the dataset and the Jupyter Notebook file.
Loading Test and Train from Dataset
Apply K Fold Cross Validation and extract sentences from essays from training data.
Apply Word2Vec Model to convert sentences into test and training word vectors.
Apply Different Models for prediction and calculate their different evaluation metrics mainly Cohen Kappa Score.
Prediction Models Used:
- LSTM
- Linear Regression Model
- Gradient Boosting regressor
- Logistic Regression
- XgBoosting Classifer
- Decision Tree Classifier
- Random Forest Classifier
- KNN
- Support Vector Regression
Choose Best Model which has highest kappa score for final prediction for the scores on the test data.

Model Comparison

We have deployed 9 machine learning algorithms and every algorithm is deployed successfully without any hesitation. We have checked the accuracy of the models based on the Kappa score of each of the models. Now let's take a look at the scores of each models.

Name of the Model	Accuracy Score
LSTM	0.87
Linear Regression Model	0.86
Gradient Boosting regressor	0.95
Logistic Regression	0.46
XgBoosting Classifer	0.89
Decision Tree Classifier	0.85
Random Forest Classifier	0.94
KNN	0.92
Support Vector Regression	0.60

Conclusion

Comparing all those scores scored by the machine learning algorithms, it is clear that Gradient Boosting regressor Algorithm gives highest cohen kappa score(0.95) and will provide best prediction for scores for our essay data.

Code Contributed by Akash Jain (@Akash20x)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

Automated Essay Grading

Automated Essay Grading

README.md

Automated Essay Grading

Goal

DATASET

LIBRARIES NEEDED

WHAT I HAD DONE

Model Comparison

Conclusion

Files

Automated Essay Grading

Directory actions

More options

Directory actions

More options

Latest commit

History

Automated Essay Grading

Folders and files

parent directory

README.md

Automated Essay Grading

Goal

DATASET

LIBRARIES NEEDED

WHAT I HAD DONE

Model Comparison

Conclusion