Instruction:
- Unzip dataset.tar.bz2
- The folder contains multiple numpy or pickle files
- The file names show the dataset used with methods described in the paper.
- For ResIP-M1 and ResIP-M2 the train and test data contains input, single output (ResIP-M1), and output sequence (ResIP-M2) in tuple
- For ResIP-M3, the file names contains "ResIP-M3".
- The output is not scaled (use log2 transformation)
- The input represents sequence of embedding vectors for the researchers in the family graph/tree
- The output values represent family size during prediction period.
- For detail check the ResIP paper: https://doi.org/10.1016/j.joi.2023.101392