Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think it's actually CESU-8 encoding: http://www.unicode.org/reports/tr26/

I'm guessing it's implemented like this for performance reasons and calling it utf-8 was just a marketing ploy as everyone knows we should always use utf-8 for everything...



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: