Tag Archives: Unicode

Watch Out for the Byte Order Mark (BOM) In Linux

Windows applications often (inappropriately) insert a BOM in UTF-8 encoded files, causing pain and suffering for Linux users. Here’s how to take care of that in Python. Continue reading

Posted in Python | Tagged , , , , | Leave a comment

How to Insert Unicode Characters in Ubuntu and Windows

Use the Character Map, Keyboard Shortcuts, or an application. Continue reading

Posted in Ubuntu | Tagged , , | 1 Comment

A Short Character Table

My character table contains HTML Entities, Glyphs, Names, and Descriptions for selected characters. Continue reading

Posted in Web Development | Tagged , | Leave a comment

en-dash HTML Encoding Bug Fixed With Python

The en-dash is one of the more common causes of HTML encoding bugs. Read how I ran into this bug and how I fixed it with Python. Also a few more tips in here. Continue reading

Posted in Python | Tagged , , | Leave a comment

Introduction to Character Sets and Unicode in Python

A summary of the basics about character encoding. I’ve caused my fair share of character encoding bugs, and these are my notes to prevent those problems in the future. Continue reading

Posted in Python | Tagged , | Leave a comment