To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ꧬhê§ | 11101010101001111000111010101100011010001110101010100111 | eaa78eac68eaa7 |
SJIS-WIN | ?§?¬h?§ | 00111111100000011001100000111111100000011100101001101000001111111000000110011000 | 3f81983f81ca683f8198 |
EUC-JP | ê§?¬hê§ | 1000111110101011101101001010000111111000001111111010001011001100011010001000111110101011101101001010000111111000 | 8fabb4a1f83fa2cc688fabb4a1f8 |
UTF-8 | ꧬhê§ | 11000011101010101100001010100111110000101000111011000010101011000110100011000011101010101100001010100111 | c3aac2a7c28ec2ac68c3aac2a7 |
UHC | ?§??h?§ | 001111111010000111010111001111110011111101101000001111111010000111010111 | 3fa1d73f3f683fa1d7 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)