To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????L?????????L^ 001111110011111100111111001111110011111100111111001111110011111100111111010011000011111100111111001111110011111100111111001111110011111100111111001111110100110001011110 3f3f3f3f3f3f3f3f3f4c3f3f3f3f3f3f3f3f3f4c5e
SJIS-WIN 上眈ャ捨濵竺杓軸L上眈ャ捨濵竺杓軸L^ 10001111111000111110000110111100101011001111000010111111100011101100110011111011010011011000111010110001100011101101101110001110101100100100110010001111111000111110000110111100101011001111000010111111100011101100110011111011010011011000111010110001100011101101101110001110101100100100110001011110 8fe3e1bcacf0bf8eccfb4d8eb18edb8eb24c8fe3e1bcacf0bf8eccfb4d8eb18edb8eb24c5e
EUC-JP 上眈ャ?捨濵竺杓軸L上眈ャ?捨濵竺杓軸L^ 101111101110010111100010101111101000111010101100001111111011110011001110100011111100100110100110101111001011001110111100110111011011110010110100010011001011111011100101111000101011111010001110101011000011111110111100110011101000111111001001101001101011110010110011101111001101110110111100101101000100110001011110 bee5e2be8eac3fbcce8fc9a6bcb3bcddbcb44cbee5e2be8eac3fbcce8fc9a6bcb3bcddbcb44c5e
UTF-8 上眈ャ捨濵竺杓軸L上眈ャ捨濵竺杓軸L^ 111001001011100010001010111001111001110010001000111011111011110110101100111011101000000110111110111001101000110110101000111001101011111110110101111001111010101110111010111001101001110110010011111010001011101110111000010011001110010010111000100010101110011110011100100010001110111110111101101011001110111010000001101111101110011010001101101010001110011010111111101101011110011110101011101110101110011010011101100100111110100010111011101110000100110001011110 e4b88ae79c88efbdacee81bee68da8e6bfb5e7abbae69d93e8bbb84ce4b88ae79c88efbdacee81bee68da8e6bfb5e7abbae69d93e8bbb84c5e
UHC 上眈??捨?竺杓軸L上眈??捨?竺杓軸L^ 110111111011111011110111101011110011111100111111110111101101011100111111111101011110011111111000111101011111010111101110010011001101111110111110111101111010111100111111001111111101111011010111001111111111010111100111111110001111010111110101111011100100110001011110 dfbef7af3f3fded73ff5e7f8f5f5ee4cdfbef7af3f3fded73ff5e7f8f5f5ee4c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)