- (Windows-only) Visual C++ Build Tools: Scrolls down and select the "Build Tools for Visual Studio 2019" download.
- Download Anaconda3, selecting the Python 3.x version 64-bit installer, and install it.
- (Windows) During installation use the default settings, and ensure the boxes that add
conda
to your PATH are checked.
- (Windows) During installation use the default settings, and ensure the boxes that add
- Download the YAML (
openrefine.yml
) file from this repo or clone it. cd
to where you downloaded the YAML file.- Create the
conda
environment from the YAML file:
$ conda env create -f openrefine.yml
- Activate the environment:
$ source activate refine3
- If your shell complains (and/or you are using Anaconda Prompt), instead drop the
source
part:
$ activate refine3
Depending on your shell, it should say (refine3)
in your command prompt.
Make sure you have activated the environment as per above (you must see a refine3
in your prompt).
- Download or clone an existing reconciliation service
- FAST
- GeoNames (Note: requires special setup of API user account)
- Library of Congress
- VIAF
- Wikidata (Note: newer versions of OpenRefine have built-in reconciliation to Wikidata)
- Open your shell
cd
to where you saved the reconciliation service- Run it (It runs in the background. You will connect to it via OpenRefine)
$ python script_name.py
- Open OpenRefine to begin reconciliation
- Detailed instructions on running the reconciliation.