It does not make sense to have a string without knowing what encoding it usesIn plain text files, a special byte order mark is used to specify the type of Unicode encoding the text file is in. In XML there is the encoding attribute in the opening tag. In HTML there are content-type meta tags, et cetera. I also found out that my blog website uses UTF-8, which is what Wordpress generates the pages in. This is very good, since it means I can write text in a whole lot of languages anyone can see: עברית (Hebrew) Español (Spanish) Русский (Russian)
Unicode and character sets
For comments, please send me an email.