To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????醫??? 0011111100111111001111110011111100111111001111111110011111001110001111110011111100111111 3f3f3f3f3f3fe7ce3f3f3f
EUC-JP ??????醫??? 0011111100111111001111110011111100111111001111111110111011010000001111110011111100111111 3f3f3f3f3f3feed03f3f3f
UTF-8 說뺣뱯麟뗥선醫꾧뎄力 111011111010011010100001111010111011101010100011111010111011000110101111111011111010011110110011111010111001011110100101111011001000010010100000111010011000011010101011111010101011111010100111111010111000111010000100111011111010011010001010 efa6a1ebbaa3ebb1afefa7b3eb97a5ec84a0e986abeabea7eb8e84efa68a
UHC 說뺣뱯麟뗥선醫꾧뎄力 1110011011110010100101011110101110010011100101011110110011101000100010111110010110111100101100011110110010100010100001001110101010110101101011001110011010110011 e6f295eb9395ece88be5bcb1eca284eab5ace6b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)