To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 瓮??瓦??瓮??鳶 1110000101000100001111110011111110001010101000100011111100111111111000010100010000111111001111111001001111001110 e1443f3f8aa23f3fe1443f3f93ce
EUC-JP 瓮??瓦??瓮??鳶 1110000110100101001111110011111110110100101001000011111100111111111000011010010100111111001111111100011011010000 e1a53f3fb4a43f3fe1a53f3fc6d0
UTF-8 瓮뚳슘瓦뱄쉴瓮쎿솄鳶 111001111001001110101110111010111001101010110011111011001000101010011000111001111001001110100110111010111011000110000100111011001000100110110100111001111001001110101110111011001000111010111111111011001000011010000100111010011011001110110110 e793aeeb9ab3ec8a98e793a6ebb184ec89b4e793aeec8ebfec8684e9b3b6
UHC 瓮뚳슘瓦뱄쉴瓮쎿솄鳶 1110100010110111100011001110111110111101101101111110100010111111101110011110111110111101101011111110100010110111100110111110011010011001100010011110011011101001 e8b78cefbdb7e8bfb9efbdafe8b79be69989e6e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)