Predicting Similar Questions

The objective of this analysis is to use different Natural Language Processing methods to predict if pairs of questions have the same meaning (semantic similarity). The data is from Quora and hosted on Kaggle as Question Pairs Dataset. There are over 400,000 lines of potential question duplicate pairs. Each line contains IDs for each question in the pair, the full text for each question, and a binary value that indicates whether the line truly contains a duplicate pair.

The sections of this analysis include:

Transforming the text
Method 1: TfidfVectorizer
Method 2: Doc2Vec
Method 3: LSTM

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
Semantic Similarity.ipynb		Semantic Similarity.ipynb
lstm.ipynb		lstm.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Similar Questions

About

Releases

Packages

Languages

parakh-gupta/Quora_question_pair_semantic_similarity

Folders and files

Latest commit

History

Repository files navigation

Predicting Similar Questions

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages