To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ???B??? | 00111111001111110011111101000010001111110011111100111111 | 3f3f3f423f3f3f |
SJIS-WIN | 壯も?B訝?? | 10011010111000011000001011100000001111110100001011100110011000100011111100111111 | 9ae182e03f42e6623f3f |
EUC-JP | 壯も?B訝?? | 11010100111000111010010011100010001111110100001011101011110000110011111100111111 | d4e3a4e23f42ebc33f3f |
UTF-8 | 壯も뀾B訝덌풛 | 11100101101000111010111111100011100000101000001011101011100000001011111001000010111010001010100010011101111010111000110110001100111011011001001010011011 | e5a3afe38282eb80be42e8a89deb8d8ced929b |
UHC | 壯も뀾B訝덌풛 | 11101101111000001010101011100010100001011011010001000010111001001011100010001000111011111011111010011110 | ede0aae285b442e4b888efbe9e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)