The HTML Document Character Set - iso 8859-1



(  Introduction Basic Codes Supplementary Codes Notes
    ¯
  )

 
INTRODUCTION

This is a reference for the HTML Document Character Set (liberally "borrowed" without permission from http://www.natural-innovations.com/boo/doc-charset.html). In addition to the standard ISO-8859-1 (Latin-1) character repertoire, the original author has included a selection of Unicode characters. This is designed as a look-up reference for HTML authors. If you are looking for a browser check, see Ian Graham's entity test page.

New: Characters above #160 (the "#" means the number that follows is in base ten, not eight or sixteen, for example) are sorted by function (alphabetic, diacritic, punctuation, and math/currency), and 63 GIFs are provided for the characters not in the {union set} of {ISO-8859-1} and {Macintosh} (U.S.) character sets.

If you already have the Chinese or Japanese character, but its rendering is wrong, then get the Unicode of Chinese and Japanese characters and phrases at this page. It also works for PinYin, accents and all!

If you have additional questions, see the original author's Q & A page. Due to the number of cells in this page, it may take some time to render. Be patient; don't panic!


 
BASIC CHARACTER SET
Decimal Code Char Description Entity Notes PostScript Name
� -    Unused      
	   Horizontal tab      

   Line feed      
 -    Unused      

   Carriage Return      
 -    Unused      
    Space  
  space
! ! Exclamation mark     exclam
" " Quotation mark "   quotedbl
# # Number sign     numbersign
$ $ Dollar sign     dollar
% % Percent sign     percent
& & Ampersand &   ampersand
' ' Apostrophe     quotesingle
( ( Left parenthesis     parenleft
) ) Right parenthesis     parenright
* * Asterisk     asterisk
+ + Plus sign     plus
, , Comma     comma
- - Hyphen     hyphen
. . Period (fullstop)     period
/ / Forward slash     slash
0 - 9 0 - 9 Decimal digits      
: : Colon     colon
&#59; ; Semi-colon     semicolon
&#60; < Less than &lt;   less
&#61; = Equals sign     equal
&#62; > Greater than &gt;   greater
&#63; ? Question mark     question
&#64; @ at sign     at
&#65; - &#90; A - Z Uppercase letters      
&#91; [ Left square bracket     bracketleft
&#92; \ Backward slash     backslash
&#93; ] Right square bracket     bracketright
&#94; ^ Caret     asciicircum
&#95; _ Horizontal bar     underscore
&#96; ` Grave accent     grave
&#97; - &#122; a - z Lowercase letters      
&#123; { Left curly brace     braceleft
&#124; | Vertical bar     bar
&#125; } Right curly brace     braceright
&#126; ~ Tilde     asciitilde
&#127; - &#159;   Unused (truly!)      
&#160;   Nonbreaking space &nbsp;   nbspace
nobreakspace
&#192; À Capital A, grave accent &Agrave;   Agrave
&#224; à Small a, grave accent &agrave;   agrave
&#193; Á Capital A, acute accent &Aacute;   Aacute
&#225; á Small a, acute accent &aacute;   aacute
&#194; Â Capital A, circumflex accent &Acirc;   Acircumflex
&#226; â Small a, circumflex accent &acirc;   acircumflex
&#195; Ã Capital A, tilde &Atilde;   Atilde
&#227; ã Small a, tilde &atilde;   atilde
&#196; Ä Capital A, dieresis or umlaut mark &Auml;   Adieresis
&#228; ä Small a, dieresis or umlaut mark &auml;   adieresis
&#197; Å Capital A, ring &Aring;   Aring
&#229; å Small a, ring &aring;   aring
&#198; Æ Capital AE dipthong (ligature) &AElig;   AE
&#230; æ Small ae dipthong (ligature) &aelig;   ae
&#199; Ç Capital C, cedilla &Ccedil;   Ccedilla
&#231; ç Small c, cedilla &ccedil;   ccedilla
&#208; [GIF] Capital Eth, Icelandic &ETH; c1 Eth
&#240; [GIF] Small eth, Icelandic &eth; c1 eth
&#200; È Capital E, grave accent &Egrave;   Egrave
&#232; è Small e, grave accent &egrave;   egrave
&#201; É Capital E, acute accent &Eacute;   Eacute
&#233; é Small e, acute accent &eacute;   eacute
&#202; Ê Capital E, circumflex accent &Ecirc;   Ecircumflex
&#234; ê Small e, circumflex accent &ecirc;   ecircumflex
&#203; Ë Capital E, dieresis or umlaut mark &Euml;   Edieresis
&#235; ë Small e, dieresis or umlaut mark &euml;   edieresis
&#204; Ì Capital I, grave accent &Igrave;   Igrave
&#236; ì Small i, grave accent &igrave;   igrave
&#205; Í Capital I, acute accent &Iacute;   Iacute
&#237; í Small i, acute accent &iacute   iacute
&#206; Î Capital I, circumflex accent &Icirc;   Icircumflex
&#238; î Small i, circumflex accent &icirc;   icircumflex
&#207; Ï Capital I, dieresis or umlaut mark &Iuml;   Idieresis
&#239; ï Small i, dieresis or umlaut mark &iuml;   idieresis
&#181; µ Micro sign &micro;   mu
&#209; Ñ Capital N, tilde &Ntilde;   Ntilde
&#241; ñ Small n, tilde &ntilde;   ntilde
&#210; Ò Capital O, grave accent &Ograve;   Ograve
&#242; ò Small o, grave accent &ograve;   ograve
&#211; Ó Capital O, acute accent &Oacute;   Oacute
&#243; ó Small o, acute accent &oacute;   oacute
&#212; Ô Capital O, circumflex accent &Ocirc;   Ocircumflex
&#244; ô Small o, circumflex accent &ocirc;   ocircumflex
&#213; Õ Capital O, tilde &Otilde;   Otilde
&#245; õ Small o, tilde &otilde;   otilde
&#214; Ö Capital O, dieresis or umlaut mark &Ouml;   Odieresis
&#246; ö Small o, dieresis or umlaut mark &ouml;   odieresis
&#216; Ø Capital O, slash &Oslash;   Oslash
&#248; ø Small o, slash &oslash;   oslash
&#223; ß Small sharp s, German (sz ligature) &szlig;   germandbls
&#222; [GIF] Capital THORN, Icelandic &THORN; c1 Thorn
&#254; [GIF] Small thorn, Icelandic &thorn; c1 thorn
&#217; Ù Capital U, grave accent &Ugrave;   Ugrave
&#249; ù Small u, grave accent &ugrave;   ugrave
&#218; Ú Capital U, acute accent &Uacute;   Uacute
&#250; ú Small u, acute accent &uacute;   uacute
&#219; Û Capital U, circumflex accent &Ucirc;   Ucircumflex
&#251; û Small u, circumflex accent &ucirc;   ucircumflex
&#220; Ü Capital U, dieresis or umlaut mark &Uuml;   Udieresis
&#252; ü Small u, dieresis or umlaut mark &uuml;   udieresis
&#221; [GIF] Capital Y, acute accent &Yacute; c1 Yacute
&#253; [GIF] Small y, acute accent &yacute; c1 yacute
&#255; ÿ Small y, dieresis or umlaut mark &yuml;   ydieresis
&#168; ¨ Umlaut &uml;   dieresis
&#175; ¯ Macron accent &macr;   macron
&#180; ´ Acute accent &acute;   acute
&#184; ¸ Cedilla &cedil;   cedilla
&#161; ¡ Inverted exclamation &iexcl;   exclamdown
&#191; ¿ Inverted question mark &iquest;   questiondown
&#183; · Middle dot &middot;   periodcentered
&#166; [GIF] Broken vertical bar &brvbar; c1 brokenbar
&#171; « Left angle quote &laquo;   guillemotleft
&#187; » Right angle quote &raquo;   guillemotright
&#182; Paragraph sign &para;   paragraph
&#167; § Section sign &sect;   section
&#169; © Copyright &copy;   copyright
copyrightserif
&#174; ® Registered trademark &reg;   registered
registerserif
&#185; [GIF] Superscript one &sup1; c1 onesuperior
&#178; [GIF] Superscript two &sup2; c1 twosuperior
&#179; [GIF] Superscript three &sup3; c1 threesuperior
&#173; ­ Soft hyphen &shy; e1 minus(?)
&#215; [GIF] Multiply sign &times; c1 multiply
&#247; ÷ Division sign &divide;   divide
&#188; [GIF] Fraction one-fourth &frac14; c1 onequarter
&#189; [GIF] Fraction one-half &frac12; c1 onehalf
&#190; [GIF] Fraction three-fourths &frac34; c1 threequarters
&#170; ª Feminine ordinal &ordf;   ordfeminine
&#186; º Masculine ordinal &ordm;   ordmasculine
&#172; ¬ Not sign &not;   logicalnot
&#176; ° Degree sign &deg;   degree
&#177; ± Plus or minus &plusmn;   plusminus
&#164; ¤ General currency sign &curren;   currency
&#162; ¢ Cent sign &cent;   cent
&#163; £ Pound sterling &pound;   sterling
&#165; ¥ Yen sign &yen;   yen
 
SUPPLEMENTARY CHARACTER SET
(Unicode UCS-2 is "charset=UNICODE-1-1")
Decimal Code Char Description Entity Notes PostScript Name
&#916; [GIF] delta &Delta; e0 Delta
&#402; [GIF] florin &fnof; e0 florin
&#937; [GIF] omega &Omega; e0 Omega
&#338; [GIF] OE ligature &OElig; e0 OE
&#339; [GIF] oe ligature &oelig; e0 oe
&#352; [GIF] Scaron &Scaron; e0 Scaron
&#353; [GIF] scaron &scaron; e0 scaron
&#376; [GIF] Ydieresis &Yuml; e0 Ydieresis
&#305; [GIF] dotless i &inodot; e0 dotlessi
&#710; [GIF] circumflex &circ; e0 circumflex
&#711; [GIF] caron &caron; e0 caron
&#728; [GIF] breve &breve; e0 breve
&#729; [GIF] dot accent &dot; e0 dotaccent
&#730; [GIF] ring &ring; e0 ring
&#731; [GIF] ogonek &ogon; e0 ogonek
&#732; [GIF] tilde &tilde; e0 tilde
&#733; [GIF] double acute accent &dblac; e0 hungarumlaut
&#8211; [GIF] en dash &ndash; e1 endash
&#8212; [GIF] em dash &mdash; e1 emdash
&#8224; [GIF] dagger &dagger; e0 dagger
&#8225; [GIF] double dagger &Dagger; e0 daggerdbl
&#8226; [GIF] bullet &bull; e0 bullet
&#8230; [GIF] ellipsis &hellip; e0 ellipsis
&#8216; [GIF] quote left &lsquo; e0 quoteleft
&#8217; [GIF] quote right &rsquo; e0 quoteright
&#8218; [GIF] quote single base &lsquor; e0 quotesinglbase
&#8220; [GIF] quote double left &ldquo; e0 quotedblleft
&#8221; [GIF] quote double right &rdquo; e0 quotedblright
&#8222; [GIF] quote double base &ldquor; e0 quotedblbase
&#8249; [GIF] guille single left &lsaquo; e? guilsinglleft
&#8250; [GIF] guille single right &rsaquo; e? guilsinglright
&#8482; [GIF] trademark, TM &trade; e1 trademark
trademarkserif
&#8480; [GIF] service mark, SM      
&#8471; [GIF] sound recording copyright, (P)      
&#8730; [GIF] radical &radic; e0 radical
&#8734; [GIF] infinity &infin; e0 infinity
&#8747; [GIF] integral &int; e0 integral
&#8706; [GIF] partial differential &part; e0 partialdiff
&#8773; [GIF] approximately equal &ap; e0 approxequal
&#8800; [GIF] not equal &ne; e0 notequal
&#8804; [GIF] less than or equal &le; e0 lessequal
&#8805; [GIF] greater than or equal &ge; e0 greaterequal
&#931; [GIF] summation &sum; e0 summation
&#8240; [GIF] per thousand (mille) &permil; e0 perthousand
&#8260; [GIF] fraction separator     fraction
&#8719; product     product
&#960; π pi &pi; e0 pi
&#9674; lozenge (diamond)     lozenge
&#8355; franc     franc
&#8984; [GIF] command key      
&#8997; [GIF] option key      
&#9774; [GIF] peace symbol      
&#9775; [GIF] yin yang      

 
NOTES
c1 Character not in charset="macintosh" (x-mac-roman)
c2 Character not in charset="xxxxxx" (What is Windows' charset name?)
c3 Character not in charset="yyyyyy" (What is Unix's charset name?)
.
e? Entity proposed for ISO-8879 but not yet approved
e0 Entity from ISO-8879 (not yet recognized by HTML agents)
e1 Entity not recognized by Netscape 1.1
e2 Entity not recognized by NCSA Mosaic 2.0 (I need info...)
e3 Entity not recognized by Lynx 2.4 (I need info...)
h2 Introduced in HTML 2.0 (RFC 1866)
h3 Proposed for HTML 3.0
The supplementary characters use Unicode decimal values. Although current browsers do not yet support Unicode, this is the direction in which HTML appears to be heading.

Since Unicode is designed for information exchange rather than typography, there is only one glyph per character (although the same character may appear in more than one code page). Unicode is not a font technology. Adobe has developed CID fonts to handle large character sets (such as in Far East languages); Bitstream has developed the Cyberbit font (for 8,500 Unicode characters); and Apple's QuickDraw GXfont technology supports multiple glyphs per character (such as swash capitals, superscripts and subscripts, true small caps, true ligatures, etc.). For more information about fonts on the Web, see the W3C page fonts and the Web.

 
Top of page