To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 瓮??梧??獰? 1110000101000100001111110011111110001100111001100011111100111111111000001101011000111111 e1443f3f8ce63f3fe0d63f
EUC-JP 瓮??梧??獰? 1110000110100101001111110011111110111000111010000011111100111111111000001101100000111111 e1a53f3fb8e83f3fe0d83f
UTF-8 瓮뚳슛梧삥㎙獰좩 111001111001001110101110111010111001101010110011111011001000101010011011111001101010001010100111111011001000001010100101111000111000111010011001111001111000110110110000111011001010001010101001 e793aeeb9ab3ec8a9be6a2a7ec82a5e38e99e78db0eca2a9
UHC 瓮뚳슛梧삥㎙獰좩 11101000101101111000110011101111101111011011100011100111111111001011101111100110101001111010101111100111101111101010000101000100 e8b78cefbdb8e7fcbbe6a7abe7bea144

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)