That ship has largely sailed. UTF-8/UTF-16 will be around for a long time to come. It's encoded into data that is archived. It's built into practically everything we use today. It's reasonably space/transmission efficient. It's standardized across many locales. Of course, you can use all the bytes in memory that you want to. Some languages even do!
My mind was blown when I found out that there are invalid utf-8 sequences. I was then impressed to find out that some exploits started out on this premise against software that didn't understand/protect against this. What a mess indeed.
That ship has largely sailed. UTF-8/UTF-16 will be around for a long time to come. It's encoded into data that is archived. It's built into practically everything we use today. It's reasonably space/transmission efficient. It's standardized across many locales. Of course, you can use all the bytes in memory that you want to. Some languages even do!
My mind was blown when I found out that there are invalid utf-8 sequences. I was then impressed to find out that some exploits started out on this premise against software that didn't understand/protect against this. What a mess indeed.