To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????^ | 001111110011111100111111001111110011111101011110 | 3f3f3f3f3f5e |
SJIS-WIN | 邏頑。√@^ | 11100111101101001000101011100110101000011000000111100011100000011001011101011110 | e7b48ae6a181e381975e |
EUC-JP | 邏頑。√@^ | 1110111010110110101101001110100010001110101000011010001011100101101000011111011101011110 | eeb6b4e88ea1a2e5a1f75e |
UTF-8 | 邏頑。√@^ | 11101001100000101000111111101001101000001001000111101111101111011010000111100010100010001001101011101111101111001010000001011110 | e9828fe9a091efbda1e2889aefbca05e |
UHC | 邏頑?√@^ | 11010101101001001110100011010111001111111010000111101110101000111100000001011110 | d5a4e8d73fa1eea3c05e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)