The objective is to predict whether the income of adults in the census data will exceeds based on various features collected.
Some of the features are:
Age
Work class
Education
Marital-status
Occupation
Relationship in family
Race
Gender
Capital gain
Capital loss
Hours per week
Native country
We first build a model with Python, Pandas, and Scikit-learn libraries, who's result will be compared with the model built with Spark and Python.