To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????^ | 001111110011111100111111001111110011111101011110 | 3f3f3f3f3f5e |
SJIS-WIN | 逋誤胸奛・^ | 11100111100110011000110011101011100010111011100111111010101000011010010101011110 | e7998ceb8bb9faa1a55e |
EUC-JP | 逋誤胸奛・^ | 111011011111100110111000111011011011011010111011100011111011100011110111100011101010010101011110 | edf9b8edb6bb8fb8f78ea55e |
UTF-8 | 逋誤胸奛・^ | 11101001100000001000101111101000101010101010010011101000100000111011100011100101101001011001101111101111101111011010010101011110 | e9808be8aaa4e883b8e5a59befbda55e |
UHC | 逋誤胸??^ | 111110001110011111101000101001101111110111011000001111110011111101011110 | f8e7e8a6fdd83f3f5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)