To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???mz???mzB 0011111100111111001111110110110101111010001111110011111100111111011011010111101001000010 3f3f3f6d7a3f3f3f6d7a42
SJIS-WIN 逆??mz逆??mzB 10001011011101000011111100111111011011010111101010001011011101000011111100111111011011010111101001000010 8b743f3f6d7a8b743f3f6d7a42
EUC-JP 逆??mz逆??mzB 10110101110101010011111100111111011011010111101010110101110101010011111100111111011011010111101001000010 b5d53f3f6d7ab5d53f3f6d7a42
UTF-8 逆곭깑mz逆곭깑mzB 1110100110000000100001101110101010110011101011011110101010111001100100010110110101111010111010011000000010000110111010101011001110101101111010101011100110010001011011010111101001000010 e98086eab3adeab9916d7ae98086eab3adeab9916d7a42
UHC 逆곭깑mz逆곭깑mzB 1110011010111101100000011110011110000011100010110110110101111010111001101011110110000001111001111000001110001011011011010111101001000010 e6bd81e7838b6d7ae6bd81e7838b6d7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)