Skip to content

Latest commit

 

History

History
6 lines (4 loc) · 363 Bytes

File metadata and controls

6 lines (4 loc) · 363 Bytes

Orthographic-Languages-Similarity-Measurements

To extract similar words between Orthographic languages along with their distance by using provided corpora with the help of Longest Common Substring (LCS) using Suffix Trees and n-gram.

For more information read the report and PPT attached with the code.

Also Implementation of Porter Stemmer is also there.