🔣 Character reference conversion tool

Batch conversion of characters to Unicode code points, HTML references, JavaScript escapes, URL encoding, and UTF-8 byte sequences.

Conversion Result

Conversion results will appear here when you enter text.

Reverse conversion (code point → character)

Usage and Application Examples

  • Get numeric and character references to safely display special characters in HTML
  • Check Unicode escape sequences (⌘uXXXX) in JavaScript
  • Check the contents of URL-encoded Japanese strings
  • Examine UTF-8 byte sequences of characters and use them to parse binary data
  • Reverse character lookup from code point to enter special characters

What is Character Reference Converter?

Character reference conversion is essential for web developers, content managers, and technical writers who need to represent special characters in HTML and XML documents. This free tool converts characters into multiple reference formats: Unicode code points (U+XXXX), HTML numeric references (decimal and hexadecimal), HTML character references (named entities like &), and other encoding systems. Perfect for inserting special symbols, international characters, or reserved HTML characters without breaking your code structure.

How to Use

Enter your character or paste text into the input field. The tool instantly provides conversions in all standard formats:
• HTML numeric references in decimal format (A for 'A')
• HTML numeric references in hexadecimal format (A for 'A')
• HTML character entities for recognized symbols (& for &, © for ©)
• Unicode code points (U+0041 for 'A')
You can copy any format directly to your clipboard. The tool recognizes hundreds of named HTML entities and provides visual previews to confirm character accuracy before insertion into your documents.

Use Cases

Web developers embedding copyright symbols, mathematical operators, or currency signs in HTML can use numeric references to ensure cross-browser compatibility. Content managers preparing international marketing materials for multiple regions convert characters to HTML entities that won't corrupt when exported from different editing systems. Technical documentation writers reference this tool when including special notation symbols, mathematical formulas, or international currency markers in XML or SGML documents. Email template designers use HTML numeric references for special characters since not all email clients support Unicode equally, ensuring your message displays correctly across Gmail, Outlook, and mobile clients.

Tips & Insights

Decimal numeric references (A) are more widely supported than hexadecimal (A), though both work in modern browsers. Named entities like © are concise but only work for recognized symbols—a comprehensive entity reference list is essential. Numeric references are safer than entity names when dealing with lesser-known characters. Always test character display in your actual target environments, as rendering varies by browser, device, and system fonts.

Frequently Asked Questions

What is Unicode?

Unicode is an international standard for the unified handling of the world's characters. Each character is assigned a unique number (code point), and all characters, including Japanese, English, and pictographs, can be represented in a single system.

What is a code point?

Code points are unique numbers assigned to each character in Unicode, denoted by U+ followed by a hexadecimal number. For example, "A" is U+3042 and "A" is U+0041. This tool allows you to convert from character to code point and vice versa.

What is an HTML character reference (HTML entity)?

HTML character references are a notation for safely displaying special characters within HTML. There are two types: numeric references (あ or あ) and named references (& <, etc.). This tool allows you to check both formats at the same time.

What is URL encoding?

URL encoding (percent encoding) is a method of representing characters that cannot be used in a URL by using a percent sign and a hexadecimal number. For example, "A" becomes %E3%81%82 in UTF-8. It is used to safely handle URLs containing Japanese characters.

What is a UTF-8 byte sequence?

UTF-8 is an encoding scheme that represents Unicode as a sequence of bytes: ASCII characters are represented by 1 byte, Japanese characters are usually represented by 3 bytes, and pictographs are represented by 4 bytes. This tool displays the UTF-8 byte sequence for each character in hexadecimal (0xE3 0x81 0x82, etc.).

Is the data entered secure?

Yes, it is completely secure. All conversion processes take place within your browser (client-side) and no input data is ever sent to the server. Please use with confidence.

Can I convert special characters and emoji?

Yes, the tool handles special characters, emoji, and symbols. It will display each as a Unicode code point, HTML numeric reference (&#...;), HTML character reference (&...;), and URL encoding.

What's the difference between HTML numeric and character references?

Numeric references use the character's code point (A for 'A'), while character references use predefined names like  . Some characters have character references, but all Unicode characters have numeric references.

How do I use these conversions in my HTML/CSS/JavaScript?

You can paste numeric references (A) or character references ( ) directly into HTML. In JavaScript, use Unicode escapes like \u0041, and in CSS, use \0041 or the literal character.

Why do some characters not have HTML character references?

Not all Unicode characters have standard HTML character references defined. Common characters like accented letters and emoji typically only have numeric references; the most common 252 characters have named references.

Is there a limit to the text I can convert?

The tool is browser-based and handles most reasonable input sizes efficiently. For very large documents, conversion may take slightly longer, but it's generally fast enough for typical use cases.

Can I use URL encoding results in query parameters?

Yes, URL-encoded values work well in query parameters and form submissions. The URL encoding provided is fully compatible with web standards for special character handling in URLs.