Skip to content

Latest commit

 

History

History
24 lines (14 loc) · 458 Bytes

README.md

File metadata and controls

24 lines (14 loc) · 458 Bytes

ja-sentence

A light-weight sentence tokenizer for Japanese.

Installation

pip install ja-sentence

Sample

from ja_sentence.tokenizer import tokenize

paragraph_str = "えー!?くれるの?本当にいいの…?嬉しい!!"

sentence_list = tokenize(paragraph_str)

for sentence in sentence_list:
	print(sentence)

Other languages

JavaScript -> https://github.com/Rairye/js-sentence-tokenizers