Skip to content

jammy-bot/home-price-predictors

 
 

Repository files navigation

Module 2 Final Project

Images of Seattle

Introduction

This project focuses on building a multiple linear regression model capable of predicting home prices.

Objectives

Happy Clap GIF By The Boss Baby

Through this project, I explore the following questions and why:

  • What is the mean price for homes in King County?
  • What is the mean price within 20 miles of the city center?
  • What is the mean number of bedrooms for home sales?
  • Which influences price more: the number of bedrooms or the number of square feet?
  • Do renovations make a significant difference in such a competitive market?

The Dataset

correlations heatmap

King County housing data was provided for this project as a single, comma - separated flat file. The file includes housing records for greater than 20,000 properties. Also provided was the following column names and data description for the data set:

  • id - unique identified for a house
  • date - house was sold
  • price - is prediction target
  • bedrooms - Number of Bedrooms/House
  • bathrooms - Number of bathrooms/bedrooms
  • sqft_living - square footage of the home
Click to view more!
  • sqft_lot - square footage of the lot
  • floors - floors (levels) in house
  • waterfront - House which has a view to a waterfront
  • view - Has been viewed
  • condition - How good the condition is ( Overall )
  • grade - overall grade given to the housing unit, based on King County grading system
  • sqft_above - square footage of house apart from basement
  • sqft_basement - square footage of the basement
  • yr_built - Built Year
  • yr_renovated - Year when house was renovated
  • zipcode - zip
  • lat - Latitude coordinate
  • long - Longitude coordinate
  • sqft_living15 - The square footage of interior housing living space for the nearest 15 neighbors
  • sqft_lot15 - The square footage of the land lots of the nearest 15 neighbors

Featured Notebooks/Analysis

Visualizations & EDA

    King County prices overview

  • visualizations, with corresponding interpretations are included within the relevant notebook(s).

Non-Technical Presentation

Technologies

framework: jupyter notebook

languages: python

libraries: pandas, numpy, scipy, statsmodels, sci-kit learn, pickle

plot libraries: seaborn, matplotlib

About

Multiple linear regression in Python, for home price prediction

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.9%
  • HTML 0.1%