To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 玉??敖??徇 10001011110010100011111100111111100111011100001000111111001111111001110001101101 8bca3f3f9dc23f3f9c6d
EUC-JP 玉??敖??徇 10110110110011000011111100111111110110101100010000111111001111111101011111001110 b6cc3f3fdac43f3fd7ce
UTF-8 玉먲쉔敖잍퍍徇 111001111000111010001001111010111010100010110010111011001000100110010100111001101001010110010110111011001001111010001101111011011000110110001101111001011011111010000111 e78e89eba8b2ec8994e69596ec9e8ded8d8de5be87
UHC 玉먲쉔敖잍퍍徇 1110100010101100100100001110111110111101101010001110011111111001100111111110011010111011100001001110001011011111 e8ac90efbda8e7f99fe6bb84e2df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)