To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?寥???????? 0011111110011011100011000011111100111111001111110011111100111111001111110011111100111111 3f9b8c3f3f3f3f3f3f3f3f
EUC-JP ?寥???????? 0011111111010101111011000011111100111111001111110011111100111111001111110011111100111111 3fd5ec3f3f3f3f3f3f3f3f
UTF-8 쒔寥렕렳쒀롋뤏쮱굶ㆂ 111011001001001010010100111001011010111110100101111010111010000010010101111010111010000010110011111011001001001010000000111010111010000110001011111010111010010010001111111011001010111010110001111010101011010110110110111000111000011010000010 ec9294e5afa5eba095eba0b3ec9280eba18beba48fecaeb1eab5b6e38682
UHC 쒔寥렕렳쒀롋뤏쮱굶ㆂ 1011111010101101111010001110111110001110101010101000111011000000101111101010110010001110110100011000111110111111101010001000111010110001101111101010010011110010 beade8ef8eaa8ec0beac8ed18fbfa88eb1bea4f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)