Orthographic-Languages-Similarity-Measurements

To extract similar words between Orthographic languages along with their distance by using provided corpora with the help of Longest Common Substring (LCS) using Suffix Trees and n-gram.

For more information read the report and PPT attached with the code.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.vscode		.vscode
Dice stats		Dice stats
Filtered		Filtered
LCS Stats		LCS Stats
Miscellaneous		Miscellaneous
Porter Stemmer		Porter Stemmer
Stat Images		Stat Images
Unfiltered		Unfiltered
Words List		Words List
n-gram similarity stats		n-gram similarity stats
LCS.ipynb		LCS.ipynb
LCS.py		LCS.py
Preproceesing.py		Preproceesing.py
README.md		README.md
dice.py		dice.py
n-gram_similarity.py		n-gram_similarity.py
nlp_ppt.pdf		nlp_ppt.pdf
nlp_ppt.pptx		nlp_ppt.pptx
nlp_report.pdf		nlp_report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Orthographic-Languages-Similarity-Measurements

Also Implementation of Porter Stemmer is also there.

About

Releases

Packages

Languages

kartikey-singh/Orthographic-Languages-Similarity-Measurements

Folders and files

Latest commit

History

Repository files navigation

Orthographic-Languages-Similarity-Measurements

Also Implementation of Porter Stemmer is also there.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages