Encoding Gone Bad
Table 2.5 shows an example of some “bad” encoding. (Is it true that there’s no bad encoding, just bad programmers?)
Table 2.5 Examples of Encoding Gone Bad
Original Text |
After Encoding |
Notes |
der MÜlleimer |
der Mülleimer |
Stored UTF-8 being interpreted as Latin-1. Heads Up! |
NÜrnberg |
N√˚rnberg |
Stored UTF-8 being interpreted as Latin-1. Heads Up! |
Understanding how to “get” certain characters, certain glyphs and explain how they’re represented |
Understanding how to ¿$B!F¿Bget¿$B!G¿(B certain characters, certain glyphs and explain how they_$B!G¿Bre represented |
Source file was Japanese (ISO 2022-JP) and was read in using the UTF-8 encoding. |