To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ÔãÎÉÔãäúz 110101001110001111001110110010011101010011100011111001001111101001111010 d4e3cec9d4e3e4fa7a
SJIS-WIN ????????z 001111110011111100111111001111110011111100111111001111110011111101111010 3f3f3f3f3f3f3f3f7a
EUC-JP ÔãÎÉÔãäúz 10001111101010101101010010001111101010111010101010001111101010101100001010001111101010101011000110001111101010101101010010001111101010111010101010001111101010111010001110001111101010111110001001111010 8faad48fabaa8faac28faab18faad48fabaa8faba38fabe27a
UTF-8 ÔãÎÉÔãäúz 1100001110010100110000111010001111000011100011101100001110001001110000111001010011000011101000111100001110100100110000111011101001111010 c394c3a3c38ec389c394c3a3c3a4c3ba7a
UHC ????????z 001111110011111100111111001111110011111100111111001111110011111101111010 3f3f3f3f3f3f3f3f7a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)