Skip to content
/ ADM_HW4 Public

Repository for the 4th homework of the course ADM @ Sapienza University of Rome

Notifications You must be signed in to change notification settings

Wuj94/ADM_HW4

Repository files navigation

ADM_HW4

Repository for the 4th homework of the course ADM @ Sapienza University of Rome from group group #23 composed by Francisca Alliende, Giuseppe Calabrese and Francesco Russo

Incoming, a summary of the files of this repository. To access to a document just press the link in the name of the corresponding file.

Jupiter Notebook, with the code and coments of the entire homework

First Part: Does basic house information reflect house's description?

Modules:

Databases:

  • datasetindex.csv: database with all the announcements after the scrapping process.
  • datasetIndex_preprocessed.csv: database that contains the data from "datasetindex.csv", prepocessed.
  • datastIndex_infmatrix.csv: database with the informatrion matrix. Input for clusterization.
  • datastIndex_tfidf.csv: database, with the description matrix. Input for clusterization. Unfortunately not available due to its weight.

Second Part: Find the duplicates!

All the code and comments of this parts, they are contained in the main file Homework_4

About

Repository for the 4th homework of the course ADM @ Sapienza University of Rome

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •