Source code for Make it Easy: An Effective End-to-End Entity Alignment Framework. SIGIR 2021.
To run our code, first install required packages. Then run preprocess
pip install -r requirements.txt
sh preprocess.sh
Run on all dataset with default settings
First get NEAP results.
sh neap.sh
Then get SRS results.
python main.py --pair all
The SRS process need the result of NEAP. To get NEAP results on a specific dataset(e.g. en_fr)
python neap.py --pair en_fr
For fasttext, please download aligned word vectors wiki.{lang}.align.vec
and place them
into aligned_vectors/
folder.
mkdir aligned_vectors
cd aligned_vectors
wget https://dl.fbaipublicfiles.com/fasttext/vectors-aligned/wiki.en.align.vec
wget https://dl.fbaipublicfiles.com/fasttext/vectors-aligned/wiki.fr.align.vec
wget https://dl.fbaipublicfiles.com/fasttext/vectors-aligned/wiki.de.align.vec
After acquiring similarity matrices from NEAP, run main.py to refine.
python main.py --pair en_fr
Change arguments for different settings. To get help on arugments, run
python main.py --help
The refinement process is based on the code of MRAEA, RREA, GCN-Align. In our experiment, training is done on CPU.
We use the code of MRAEA, RREA, GCN-Align, DGMC, AttrGNN, OpenEA, EAKit, SimAlign.
DBP15k dataset is from GMNN and AttrGNN.
SRPRS dataset is from RSN.