Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

We whined about emoji combining and other nonsense since introduction. Who would listen though, now sanitize.

This idea isn’t crazy, yes. It is useful. But it was implemented in the wrong place. Unicode is neither RTF nor HTML. You can’t stuff everything text-related into it until it cracks. It had one job: unfuck codepages and cjk. Should have stopped when it was done, without all that klingon tolkien flags bs. Emojis could live as some escape sequence standard like they did before. And language specifics as sgml-like tags. We don’t have “just text” with all that either way, so what was the point of spoiling the only raw text standard with that? It became a format with binary tags, complex and unpredictable as hell.



> without all that klingon tolkien

Can you elaborate? I just searched and can't find anything related to Klingon or Tolkien in Unicode... I definitely agree that would go too far. But has it, am I missing something?


Spoke from memory, Klingon was rejected it seems, Cirth/Tengwar is a 2023 proposal, fresh and again. https://www.unicode.org/roadmaps/smp/#:~:text=160,(Tengwar)

Anyways, I stand corrected on this.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: