HTML Text Encoder

This tool allows you to type in any Unicode text or Emoji and encode it into HTML.

Escaping Options

Use numeric escape
Capitalize
Skip printable ASCII

Encoded HTML

About our HTML text encoder

In HTML, there are several ways to escape Unicode characters in your HTML files.

  • Hexadecimal numeric character reference:
    &# x20AC ;
    • Hexadecimal codepoints must be 2 characters
    • Hexadecimal codepoints are case insensitive
  • For characters in the range 0x0100 to 0xFFFF you can use the following form
    \uXXXX
    • Hexadecimal codepoints must be 4 characters
    • Hexadecimal codepoints are case insensitive
  • For characters 0x10000 and above, these are known as supplementary or astral plane codepoints. Typically you would have to encode them with a surrogate pair of codepoints, but with the newer ES6 Javascript support you can use the following escape sequence:
    \u{XXXXX}

References