I may be wrong about OPs intention, but AFAICT, because no encoding is specified, the client gets to choose. For someone not using a default encoding that's a superset of ASCII (like ISO-2022-KR) the page appears as a �.
Current practice is to put a meta tag with your encoding, use a Unicode BOM, or less favorably, send the charset attribute in the Content-type header.
Current practice is to put a meta tag with your encoding, use a Unicode BOM, or less favorably, send the charset attribute in the Content-type header.