This repository contains a python script that performs collocation analysis and puts the results in a dataframe that can be exported to csv format. Example data is provided from the One Health dataset available on PubMed Central.
The script uses a csv file as the dataset where each row contains a new document.
nltk.tokenize
nltk.collocations
nltk.corpus
pandas