Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A year or two ago I ran into the fact that classic TCL still only supports UCS2 as it internal text representation. In particular, that any unicode characters outside the basic multi-lingual plane, like many popular emoji, are not supported.

It can convert to/from utf-8, but any codepoints outside the BMP get mangled (each byte of the utf-8 considered an independent codepoint) and there's really no way to detect this happened in TCL. There's really nothing you can do.

I hoped it didn't, but unfortunately it looks like Jim Tcl has this same problem: http://jim.tcl.tk/index.html/doc/www/www/documentation/utf8/

(Also, Jim Tcl is originally written by antirez!)



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: