To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?B??B?^ | 00111111010000100011111100111111010000100011111101011110 | 3f423f3f423f5e |
SJIS-WIN | 廈B无廈B旛^ | 1001101111111010010000101001110111011001100110111111101001000010100111011101011101011110 | 9bfa429dd99bfa429dd75e |
EUC-JP | 廈B无廈B旛^ | 1101011011111100010000101101101011011011110101101111110001000010110110101101100101011110 | d6fc42dadbd6fc42dad95e |
UTF-8 | 廈B无廈B旛^ | 111001011011101110001000010000101110011010010111101000001110010110111011100010000100001011100110100101111001101101011110 | e5bb8842e697a0e5bb8842e6979b5e |
UHC | 廈B无廈B?^ | 11111001101111110100001011011001111010011111100110111111010000100011111101011110 | f9bf42d9e9f9bf423f5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)