What’s the easy way to split Japanese text?

Is there a convenient tool (website, command-line, javascript or Python library) that can parse Japanese enough to split it into individual words?

3 comments
  1. If you google “Japanese tokenizers” you’ll get a fair number of results. I personally know of cabocha (which is actually a dependency parser) but there are many.

Leave a Reply
You May Also Like