To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???z???zB 001111110011111100111111011110100011111100111111001111110111101001000010 3f3f3f7a3f3f3f7a42
SJIS-WIN ?僊?z?僊?zB 0011111110011001010000010011111101111010001111111001100101000001001111110111101001000010 3f99413f7a3f99413f7a42
EUC-JP 饍僊?z饍僊?zB 100011111110100011101101110100011010001000111111011110101000111111101000111011011101000110100010001111110111101001000010 8fe8edd1a23f7a8fe8edd1a23f7a42
UTF-8 饍僊鐥z饍僊鐥zB 111010011010010110001101111001011000001110001010111010011001000010100101011110101110100110100101100011011110010110000011100010101110100110010000101001010111101001000010 e9a58de5838ae990a57ae9a58de5838ae990a57a42
UHC 饍僊鐥z饍僊鐥zB 111000001101011111100000101110101110000011010110011110101110000011010111111000001011101011100000110101100111101001000010 e0d7e0bae0d67ae0d7e0bae0d67a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)