Identifying Suspicious URLs: An Application of Large-Scale Online Learning (Python Reproducibility Experiment)
A reproduction experiment of https://cseweb.ucsd.edu//~jtma/papers/url-icml2009.pdf using cutting-edge python libraries for machine-learning.
Read the accompanying research paper in the project files for more details ['Seminararbeit-Identifying-Suspicious-URLs.pdf'].
Download the provided data (Matlab) (http://www.sysnet.ucsd.edu/projects/url/#code) and install the necessary dependencies.
Set the path to the downloaded url.mat file in the top cell of every notebook.