DIVAN: Deep-Interest Virality-Aware Network to Exploit Temporal Dynamics in News Recommendation

This is the official repository for the paper "DIVAN: Deep-Interest Virality-Aware Network to Exploit Temporal Dynamics in News Recommendation", published at the ACM RecSys Challenge 2024 (RecSys Challenge ’24).

DIVAN (Deep-Interest Virality-Aware Network), our solution for the RecSys 2024 Challenge, combines a Deep Interest Network (DIN) for personalized user interest representation with a Virality-Aware Click Predictor that utilizes temporal features to estimate click probability based on news popularity. A user-specific weight balances the influence of DIN and virality-based predictions, enhancing personalization and accuracy. Experiments on the Ekstra Bladet dataset from the Challenge demonstrate how promising DIVAN is in accuracy and beyond-accuracy performance.

Find more details in the paper: https://dl.acm.org/doi/10.1145/3687151.3687153.

This repository is built on top of FuxiCTR, a configurable, tunable, and reproducible library for CTR prediction.

Paper reference FuxiCTR:

Jieming Zhu, Jinyang Liu, Shuai Yang, Qi Zhang, Xiuqiang He. Open Benchmarking for Click-Through Rate Prediction. The 30th ACM International Conference on Information and Knowledge Management (CIKM), 2021.

If you use any part of this code, please cite the work:

@inproceedings{10.1145/3687151.3687153,
author = {Ferrara, Antonio and Valentini, Marco and Masciullo, Paolo and De Candia, Antonio and Abbattista, Davide and Fusco, Riccardo and Pomo, Claudio and Anelli, Vito Walter and Biancofiore, Giovanni Maria and Boratto, Ludovico and Narducci, Fedelucio},
title = {DIVAN: Deep-Interest Virality-Aware Network to Exploit Temporal Dynamics in News Recommendation},
year = {2024},
isbn = {9798400711275},
publisher = {Association for Computing Machinery},
url = {https://doi.org/10.1145/3687151.3687153},
doi = {10.1145/3687151.3687153},
booktitle = {Proceedings of the Recommender Systems Challenge 2024},
pages = {12–16},
series = {RecSysChallenge '24}
}

Setup virtual environment

If you want to use venv

Please set up the environment as follows (we used python 3.9 and python 3.10).

python3 -m venv recsys_din
source recsys_din/bin/activate
python -m pip install --upgrade pip
pip install --no-cache-dir -r requirements.txt

If you want to use Docker

make sure you have started the docker engine

Build the container:

   docker build -t recsyschallenge2024_din .

Run the container

   docker run -d -it --name recsyschallenge_container  recsyschallenge2024_din /bin/bash

Access the terminal of the container

docker exec -it recsyschallenge_container /bin/sh

Data Preparation and Model Training

Download and preprare data

python prepare_data_v1.py --size large --test --embedding_size 64 --neg_sampling

Train the model on train and validation sets:

python run_param_tuner.py --config config/DIVAN_ebnerd_large_x1_tuner_config_01.yaml --gpu 0

Make predictions on the test set:

Get the experiment_id from running logs or the result csv file DIVAN_ebnerd_large_x1_tuner_config_01.csv, and then you can run prediction on the test.
```
python submit.py --config config/DIVAN_ebnerd_large_x1_tuner_config_01 --expid DIVAN_ebnerd_large_x1_001_1860e41e --gpu 0
```

Data preparation and prediction with PopularRanker and ViralRanker

Download and preprare data

python prepare_data_pop_and_vir_scores.py --size large --test

Test the model on the validation set:
```
python run_[popular|virality]_expid.py
```
Make predictions on the test set:
```
 python submit_[popular|viral].py
```

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
config		config
data		data
src		src
utils		utils
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
create_dataset_with_virality.py		create_dataset_with_virality.py
entrypoint.sh		entrypoint.sh
evaluate_DIVAN.py		evaluate_DIVAN.py
evaluate_model.py		evaluate_model.py
fuxictr_version.py		fuxictr_version.py
prepare_data_pop_and_vir_scores.py		prepare_data_pop_and_vir_scores.py
prepare_data_pop_predictor.py		prepare_data_pop_predictor.py
prepare_data_v1.py		prepare_data_v1.py
prepare_data_vnew.py		prepare_data_vnew.py
preprocess_dataset_split.py		preprocess_dataset_split.py
requirements.txt		requirements.txt
run_expid.py		run_expid.py
run_expid_v2.py		run_expid_v2.py
run_param_tuner.py		run_param_tuner.py
run_popular_expid.py		run_popular_expid.py
run_virality_expid.py		run_virality_expid.py
split_dataset_in_chunk.py		split_dataset_in_chunk.py
submit.py		submit.py
submit_DIVAN.py		submit_DIVAN.py
submit_popular.py		submit_popular.py
submit_viral.py		submit_viral.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DIVAN: Deep-Interest Virality-Aware Network to Exploit Temporal Dynamics in News Recommendation

Setup virtual environment

If you want to use venv

If you want to use Docker

Data Preparation and Model Training

Data preparation and prediction with PopularRanker and ViralRanker

About

Releases

Packages

Contributors 5

Languages

License

sisinflab/DIVAN

Folders and files

Latest commit

History

Repository files navigation

DIVAN: Deep-Interest Virality-Aware Network to Exploit Temporal Dynamics in News Recommendation

Setup virtual environment

If you want to use venv

If you want to use Docker

Data Preparation and Model Training

Data preparation and prediction with PopularRanker and ViralRanker

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages