To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??Þ?? | 0011111100111111110111100011111100111111 | 3f3fde3f3f |
SJIS-WIN | 懿翁?漿野 | 100111001111001010001001101001010011111110011111111101111001011011101100 | 9cf289a53f9ff796ec |
EUC-JP | 懿翁Þ漿野 | 1101100011110100101100101010011110001111101010011011000011011110111110011100110011101110 | d8f4b2a78fa9b0def9ccee |
UTF-8 | 懿翁Þ漿野 | 1110011010000111101111111110011110111111100000011100001110011110111001101011110010111111111010011000011110001110 | e687bfe7bf81c39ee6bcbfe9878e |
UHC | 懿翁Þ漿野 | 11101011111100111110100010111010101010001010110111101101111011001110010110101111 | ebf3e8baa8adedece5af |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)