To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄??韋??湲??筌 1001011011101111001111110011111111101000111010000011111100111111100111111101000100111111001111111110001010100011 96ef3f3fe8e83f3f9fd13f3fe2a3
EUC-JP 厄??韋??湲??筌 1100110011110001001111110011111111110000111010100011111100111111110111101101001100111111001111111110010010100101 ccf13f3ff0ea3f3fded33f3fe4a5
UTF-8 厄닿낮韋껇땻湲몃왂筌 111001011000111010000100111010111000101110111111111010111000001010101110111010011001111110001011111010101011101110000111111010111001010110111011111001101011100110110010111010111010101010000011111011001001100110000010111001111010110110001100 e58e84eb8bbfeb82aee99f8beabb87eb95bbe6b9b2ebaa83ec9982e7ad8c
UHC 厄닿낮韋껇땻湲몃왂筌 1110010011111000101101001110101010110011101101111110101011011111100000111110100010001011100100011110101010111000101110001110101110011110101101011110111110100111 e4f8b4eab3b7eadf83e88b91eab8b8eb9eb5efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)