I've long been interested in comprehensible input and specifically what it is about comprehensible input that even makes in comprehensible in the first place. So I decided to combine my statistics skills and my obsession as a Japanese learner to try to find some answers. I decided to scrape https://cijapanese.com which is a comprehensible input platform for Japanese learners similar to DreamingSpanish and analyze the subtitles to look for patterns there.
You can check out the results of the interactive analysis here: https://cij-analysis.streamlit.app/
Most of the graphs are clickable and you can also get access to the code and data here: https://github.com/joshdavham/cij-analysis
Hopefully this will be interesting to some of y'all!
by joshdavham