hi everybody.
I have been writing a module to bring yomichan/morphman style features to emacs ([https://github.com/dmgerman/yomikun](https://github.com/dmgerman/yomikun) \–open source)
So far I am able to do tokenization (including compounds), status tracking and dictionary. Though, it is still work in progress.
I want to add pitch accent support. But I have not been very successful at finding info on how to do it.
Does anybody know of a good source of information for how to do it?
1. Given a sentence, how the pitch accent of each term is determined
2. How/where to find the data necessary
I see there is an NHK pitch accent csv file floating around with pitch info. But I have not been able to find documentation about it.
thank you.
4 comments
Have you tried downloading an anki add on and seeing how they do it? For example you could download the migaku add on and check how it looks it up.
https://ankiweb.net/shared/info/278530045
why do you need pitch accent in emacs?
Not sure if there is already an open-source resource for determining pitch accent in sentences. [This](https://www.gavo.t.u-tokyo.ac.jp/ojad/eng/phrasing/index%5D is pretty much the best tool online to get pitch accent on a sentence level. Maybe look into whether their rules/engine is acccessible somewhere.
You can find some pitch info accent in the Kanjium GitHub repo:
https://github.com/mifunetoshiro/kanjium
It’s not comprehensive, but it’s got enough common words to be useful.