To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 亦????キ違??冶 1001011010010010001111110011111100111111001111111000001101001100100010001110000100111111001111111001011011101000 96923f3f3f3f834c88e13f3f96e8
EUC-JP 亦????キ違??冶 1100101111110010001111110011111100111111001111111010010110101101101100001110001100111111001111111100110011101010 cbf23f3f3f3fa5adb0e33f3fccea
UTF-8 亦껋쉶栒덅キ違먯춪冶 111001001011101010100110111010101011101110001011111011001000100110110110111001101010000010010010111010111000110110000101111000111000001010101101111010011000000110010101111010111010100010101111111011001011011010101010111001011000011010110110 e4baa6eabb8bec89b6e6a092eb8d85e382ade98195eba8afecb6aae586b6
UHC 亦껋쉶栒덅キ違먯춪冶 1110011010110010100000111110110010011010100011001110001011100011100010001110100010101011101011011110101011011110100100001110110010101101100001111110010110100111 e6b283ec9a8ce2e388e8abadeade90ecad87e5a7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)