General Conference Word Frequency

My initial script here was very rough and ugly, done some time ago. I decided to update it to utilize the Natural Language Toolkit as well as a Count Vectorizer from scikit-learn. It's now much quicker and more accurate.

The script provides a word count for LDS General Conference talks.

Dependencies

Python 2.7
Pandas
Numpy
Scipy
BeautifulSoup
NLTK
Scikit-Learn
re
requests

Usage

Download the python script and change the "url" variable to the link of the General Conference talk you want to analyze. Run the script, which will output a CSV consisting of a word and the number of times it was used within the talk.

I've filtered out "stop words" (common words to be ignored) using both the Natural Language Toolkit's and scikit-learn's "stop word" dictionaries.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE.txt		LICENSE.txt
README.md		README.md
genconf_word_count.py		genconf_word_count.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

General Conference Word Frequency

Dependencies

Usage

About

Releases

Packages

Languages

License

orangganjil/gen-conf-word-frequency

Folders and files

Latest commit

History

Repository files navigation

General Conference Word Frequency

Dependencies

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages