What are some commonly encountered problems with encodings?
Surprise – it’s UTF-8!By far the most common problem people have is that they didn’t expect the parsing process to translate their data into UTF-8. Whether this is an actual problem or merely a perceived problem is the subject of some debate. Sure, you may need to change the encoding when you output your data, but doing all your processing in Unicode will lead to less pain in the long term. The preceding section gives you plenty of ways to deal with UTF-8.