To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???寥?Р梧??^ 00111111001111110011111110011011100011000011111110000100010100011000110011100110001111110011111101011110 3f3f3f9b8c3f84518ce63f3f5e
EUC-JP ???寥?Р梧??^ 00111111001111110011111111010101111011000011111110100111101100101011100011101000001111110011111101011110 3f3f3fd5ec3fa7b2b8e83f3f5e
UTF-8 遼꿱엩寥덆Р梧귡몚^ 111011111010011110000011111010101011111110110001111011001001011110101001111001011010111110100101111010111000110110000110110100001010000011100110101000101010011111101010101101111010000111101011101010101001101001011110 efa783eabfb1ec97a9e5afa5eb8d86d0a0e6a2a7eab7a1ebaa9a5e
UHC 遼꿱엩寥덆Р梧귡몚^ 11101001101011001011001011101000100111101000001011101000111011111000100011101001101011001011001011100111111111001000001011101001100100011000100001011110 e9acb2e89e82e8ef88e9acb2e7fc82e991885e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)