To me, that's a design flaw. Would we really be any worse off if we simply decla...

ChrisSD · on April 14, 2020

Sure we could declare that but then what? Non-unicode filenames won't suddenly disappear. Operating systems won't suddenly enforce unicode. Filesystems will still allow non-unicode names.

Simply declaring it doesn't help anybody. In the meantime your application still needs to handle non-unicode filenames otherwise those malicious ones are free to be malicious.

PeterisP · on April 14, 2020

I'd assume that the proper place for defining what's a valid filename would be on the filesystem level, so a filesystem of standard ABC v123 would not allow non-unicode names; so non-unicode filenames would either get refused or modified upon copying/writing them to the filesystem.

This is not new, this would match the current behavior of the OS/filesystem enforcing other character restrictions such as when writing (for example) a file name with an asterisk or colon to a FAT32 USB flash drive.

AnIdiotOnTheNet · on April 14, 2020

If unicode had a set of "explictly this byte" codepoints, it should be simple to deal with, just pass the invalid bytes of the filename in that way.

ygra · on April 19, 2020

Unicode deals with text, so such a set of codepoints is a non-starter, anyway.

mark-r · on April 14, 2020

Once you lose the expectation of being able to work with non-unicode filenames, those files will quickly get renamed and cease to be a problem.

ChrisSD · on April 14, 2020

How can you rename them if you can only use unicode paths?

mark-r · on April 14, 2020

You would need to use some special utility created just for that purpose.

bjourne · on April 14, 2020

As long as the tool for renaming files handles non-utf8 filenames you'd be fine.