deep_learning_class_project - Document Classification

*** Instructions for Running ***:

Put the original "train-balanced-sarcasm.csv" file in a folder named "sarcasm_data" in the parent folder of this project.
Run the main.py with pipeline(is_first_run=True, train_with_real_data=True, epochs=10)--> This will generate new files with preprocessed/cleaned comments and everything you need for data transformation + will train a basic NN model and predict on the validation data

If you pass train_with_real_data=False, you will use the validation file as training file and you have to create a small sample test file and pass it as an argument to pipeline(). For example:

pipeline(is_first_run=False, train_with_real_data=False,
         epochs=10, sample_test_file=data_prep.root_sarcasm_data_dir + "small_train.csv")

In the main.py: create_model(input_size) build a NN model. Currently it is a simple Feed Forward network with 1 hidden layer.

Also, we have a naive_bayes_pipeline that you can run to do a simple and quick test that the data are correct.

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
embedding		embedding
.gitignore		.gitignore
README.md		README.md
attention.py		attention.py
dataset_analysis.py		dataset_analysis.py
embeddings.py		embeddings.py
embs.py		embs.py
eval_methods.py		eval_methods.py
features_methods.py		features_methods.py
glove_yelp_vis.png		glove_yelp_vis.png
glove_yelp_vis_editted.png		glove_yelp_vis_editted.png
main.py		main.py
main_old.py		main_old.py
model.py		model.py
new_dataset_train_lengths.png		new_dataset_train_lengths.png
plot.png		plot.png
plot2.png		plot2.png
preprocess.py		preprocess.py
process_results.py		process_results.py
sample_yelp_reviews.py		sample_yelp_reviews.py
train_methods.py		train_methods.py
utils.py		utils.py
wordcloud_1K_reviews.png		wordcloud_1K_reviews.png
yelp_train_no_stopwords.png		yelp_train_no_stopwords.png
yelp_train_with_stopwords.png		yelp_train_with_stopwords.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

deep_learning_class_project - Document Classification

About

Releases

Packages

Contributors 3

Languages

besitocat/deep_learning_class_project

Folders and files

Latest commit

History

Repository files navigation

deep_learning_class_project - Document Classification

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages