Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What is your source dictionary to compare to? Seems kind of small. Also, how are you handling inflected forms?


https://github.com/words/an-array-of-english-words

using this, a combo of "covered enough" for the bit and easy to use

also, since i'm tracking every word (technically a better name for this project would be The Bluesky Corpus) all inflected forms are different words, which aligns with my thinking




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: