Word2Mat

In this project, we extend the idea of representing words as vectors like Word2Vec to represent them as matrices. The code is based on the Tensorflow Word2Vec tutorial found Here

File Description

The project contains the following files:

word2mat.py: Contains the code for word2mat (documented).
word2vec_old.py: Original word2vec based on the above Git repo
word2vec_poc_nce_train.py: Trains the NCE layer using existing word2vec embeddings
wor2vec_prediction_evaluation.py: Calculates word2vec accuracy using existing word2vec embeddings and trained NCE layer weights and biases

How To Run

python word2mat.py | tee outputfile.txt This will run the code which downloads text8 data and runs the word2mat model. The output will be printed to standard output as well as in the file outputfile.txt

Word2Vec

python word2vec_old.py | tee outputfile.txt This will also download text8 data and runs the word2vec model.

Word2Vec Prediction Accuracy Evaluation

To validate the word2vec model using on the word prediction task, do the following:

python word2vec_old.py After training, the embeddings will be saved as a numpy object named w2v_embeddings.npy
python word2vec_poc_nce_train.py After NCE layer training, the NCE weights and biases will be saved in numpy objects named w2v_poc_nce_weights.npy and w2v_poc_nce_bias.npy
python wor2vec_prediction_evaluation.py This will print the accuracy of the word2vec model using the trained NCE layer.

Note: If you changed the hyper parameters in word2vec_old.py, make sure to change them in word2vec_poc_nce_train.py and wor2vec_prediction_evaluation.py as well.

Credits

This project was conducted by the following students as a part of the Machine Learning Seminar in Saarland University, in Summer Semester 2018: Ayan Majumdar, Ehtisham Ali, Mossad Helali, Shahzain Mehboob. The project was supervised by Prof. Dietrich Klakow.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
Word2Mat_Final_Report.docx.pdf		Word2Mat_Final_Report.docx.pdf
mlseminar18-master.zip		mlseminar18-master.zip
word2mat.py		word2mat.py
word2vec_old.py		word2vec_old.py
word2vec_poc_nce_train.py		word2vec_poc_nce_train.py
word2vec_prediction_evaluation.py		word2vec_prediction_evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Word2Mat - A New Word Representation

File Description

How To Run

Word2Mat

Word2Vec

Word2Vec Prediction Accuracy Evaluation

Credits

About

Releases

Packages

Languages

ayanmaj92/word2mat-mlseminar18

Folders and files

Latest commit

History

Repository files navigation

Word2Mat - A New Word Representation

File Description

How To Run

Word2Mat

Word2Vec

Word2Vec Prediction Accuracy Evaluation

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages