To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????U??K 001111110011111100111111001111110011111101010101001111110011111101001011 3f3f3f3f3f553f3f4b
SJIS-WIN ???領?U??K 00111111001111110011111110010111110011000011111101010101001111110011111101001011 3f3f3f97cc3f553f3f4b
EUC-JP ???領?U??K 00111111001111110011111111001110110011100011111101010101001111110011111101001011 3f3f3fcece3f553f3f4b
UTF-8 렻렋렻領렊U렻렋K 1110101110100000101110111110101110100000100010111110101110100000101110111110100110100000100110001110101110100000100010100101010111101011101000001011101111101011101000001000101101001011 eba0bbeba08beba0bbe9a098eba08a55eba0bbeba08b4b
UHC 렻렋렻領렊U렻렋K 10001110110000111000111010100010100011101100001111010110110001011000111010100001010101011000111011000011100011101010001001001011 8ec38ea28ec3d6c58ea1558ec38ea24b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)