To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????? | 001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f |
SJIS-WIN | 蔘¥醜腴?処 | 1110010011110011100000011000111110001111010110001110010001001010001111111000111110001000 | e4f3818f8f58e44a3f8f88 |
EUC-JP | 蔘¥醜腴鍿処 | 11101000111101011010000111101111101111011011100111100111101010111000111111100101101000111011110111101000 | e8f5a1efbdb9e7ab8fe5a3bde8 |
UTF-8 | 蔘¥醜腴鍿処 | 111010001001010010011000111011111011111110100101111010011000011010011100111010001000010110110100111010011000110110111111111001011000011110100110 | e89498efbfa5e9869ce885b4e98dbfe587a6 |
UHC | 蔘¥醜??? | 110111111011100010100001110011011111010111011101001111110011111100111111 | dfb8a1cdf5dd3f3f3f |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)