Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's probably an artifact of them lumping together all varieties/dialects of a given language. I don't speak Spanish, but I know that the R is one of the things that's different in e.g. Argentina.


I wonder if they have a large population of African French speakers in the dataset?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: