Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
mark-r
on April 30, 2024
|
parent
|
context
|
favorite
| on:
You can't just assume UTF-8
Unfortunately Windows code page 1252 has no invalid bytes, so it will always succeed. You'd better try that one last.
Dwedit
on April 30, 2024
[–]
81, 8D, 8F, 90, 9D are invalid.
layer8
on April 30, 2024
|
parent
[–]
These are actually interpreted as the corresponding C1 control codes by Windows, so arguably not invalid in practice, just formally reserved to be reassigned to other characters in the future.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: