Entity World

Welcome to Entity World. I hope that the links below will help to shed some light on Character Entities used in SGML, XML, and HTML.

Please contact me with any comments, questions, or suggestions.

Jeff Beck beck@ncbi.nlm.nih.gov


  • HTML 4 Character Entities Character entities that can be used in HTML pages. These lists will show you which characters will display in your current browser and how to tag them in your HTML.
  • ISO Character Sets These are the ISO standard character sets that are typically used in publishing. These sets are very much expanded over the HTML 4 set(s). These are displayed using the decimal representation of the character. [Here is a list sorted by Unicode hexidecimal number (like U039B or Λ)
  • Unicode Character Sets These are the Unicode character sets (defined by The Unicode Standard. These pages will show you how Unicode characters will display in your browser. The missing alphabets will be added shortly.
  • Unicode Combining Characters There are two character sets in the Unicode standard that define "combining characters." These are characters that combine with its immediate preceeding character to form a complex character. The combinations may be between and keyboard letter and a combining accent or between a unicode character and a combining accent. There is not much support for these in the browsers that we have today, but you will see some of the accents combining. Use Netscape 7 for best results.


Web resources: