Unicode Converter
Convert text to Unicode code points, view UTF-8/UTF-16/UTF-32 byte representations, and lookup detailed character information.
Input
Character Breakdown
| Char | Unicode | Decimal | UTF-8 (Hex) | UTF-16 (Hex) | Category |
|---|
Code Points
Decode from Code Points
Supports U+XXXX, decimal, or hex (0xXX) formats
About Unicode
Unicode is a universal character encoding standard that assigns a unique number (code point) to every character across all languages and symbols. It supports over 140,000 characters from 150+ writing systems.
Unicode Basics
- Code Point: Unique number for each character (U+0041 = 'A')
- Planes: Unicode divided into 17 planes (0-16)
- BMP: Basic Multilingual Plane (U+0000 to U+FFFF)
- Astral Planes: Above U+FFFF (emojis, rare chars)
UTF Encodings
- UTF-8: Variable 1-4 bytes, ASCII compatible
- UTF-16: Variable 2 or 4 bytes
- UTF-32: Fixed 4 bytes per character
- Most Common: UTF-8 (web standard)
Examples:
'A' = U+0041 = UTF-8: 41 = UTF-16: 0041'δΈ' = U+4E16 = UTF-8: E4 B8 96 = UTF-16: 4E16'π' = U+1F600 = UTF-8: F0 9F 98 80 = UTF-16: D83D DE00
Feedback
Help us improve this page by providing feedback:
Sending...
Feedback sent. Thank you!
Error occurred!