This project is a bot that generates Reddit comments for several subreddits, including the following:
-
reddit.com
-
...list to be updated.
The comments are generated by using an n-gram learning model over billions of comments from Reddit dating back to 2005. The dataset was pulled from a publicly available API and filtered prior to being used with the project, which only selected certain criteria to focus on when generating comments. The several tens of GBs of training data was used to develop a varied (albeit unstructured) model for generating comments in English.
The following external libraries are required to run this project:
-
pipetools
-
tqdm
-
simplejson
More details will be made available as the project progresses.
The Reddit Commentor™ is created by Shantanu Gore and Chirag Bharadwaj. They originally made it for their final project in CS 4701: Foundations of Artificial Intelligence at Cornell University in Spring 2017.