Skip to content

Predicting whether adult income exceeds $50k from census data using Spark and Python

Notifications You must be signed in to change notification settings

George-Nyamao/Salary-prediction-pyspark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Salary-prediction-pyspark

Objective

The objective is to predict whether the income of adults in the census data will exceeds based on various features collected.

Some of the features are:

 Age
 Work class
 Education
 Marital-status
 Occupation
 Relationship in family
 Race
 Gender
 Capital gain
 Capital loss
 Hours per week
 Native country

We first build a model with Python, Pandas, and Scikit-learn libraries, who's result will be compared with the model built with Spark and Python.

About

Predicting whether adult income exceeds $50k from census data using Spark and Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published