To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 叱鉀ミ釁糂瞎鄙 111100101010000110001110101101101111101111000101110100001110011111010101111000101111000111100001110100001110011110111111 f2a18eb6fbc5d0e7d5e2f1e1d0e7bf
EUC-JP ?叱鉀ミ釁糂瞎鄙 00111111101111001011100010001111111000111101100010001110110100001110111011010111111001001111001111100010110100101110111011000001 3fbcb88fe3d88ed0eed7e4f3e2d2eec1
UTF-8 叱鉀ミ釁糂瞎鄙 111011101000011110011000111001011000111110110001111010011000100110000000111011111011111010010000111010011000011110000001111001111011001110000010111001111001111010001110111010011000010010011001 ee8798e58fb1e98980efbe90e98781e7b382e79e8ee98499
UHC ?叱鉀????鄙 0011111111110010111010101100101110100101001111110011111100111111001111111101111010101001 3ff2eacba53f3f3f3fdea9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)