Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Taking an ID3 tag example, if you are mass-converting/sanitizing/etc. tag titles and other similar metadata, the strings are often very short, sometimes only even a single codepoint or character, and proper assumptions of encoding can not be relied on because so many people violate specs and put whatever they want in there, which is the whole point of wanting to sanitize the info in the first place.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: