To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 厓?┨蟻ε?腰??厓?┨蟻ε?腰??^ 1111101010001101001111111000010010110111100010110110000110000011110000110011111110001101100110000011111100111111111110101000110100111111100001001011011110001011011000011000001111000011001111111000110110011000001111110011111101011110 fa8d3f84b78b6183c33f8d983f3ffa8d3f84b78b6183c33f8d983f3f5e
EUC-JP 厓?┨蟻ε?腰??厓?┨蟻ε?腰??^ 10001111101101001100011100111111101010001011100110110101110000101010011011000101001111111011100111111000001111110011111110001111101101001100011100111111101010001011100110110101110000101010011011000101001111111011100111111000001111110011111101011110 8fb4c73fa8b9b5c2a6c53fb9f83f3f8fb4c73fa8b9b5c2a6c53fb9f83f3f5e
UTF-8 厓꿴┨蟻ε뤁腰뱀꽍厓꿴┨蟻ε뤁腰뱀꽍^ 1110010110001110100100111110101010111111101101001110001010010100101010001110100010011111101110111100111010110101111010111010010010000001111010001000010110110000111010111011000110000000111010101011110110001101111001011000111010010011111010101011111110110100111000101001010010101000111010001001111110111011110011101011010111101011101001001000000111101000100001011011000011101011101100011000000011101010101111011000110101011110 e58e93eabfb4e294a8e89fbbceb5eba481e885b0ebb180eabd8de58e93eabfb4e294a8e89fbbceb5eba481e885b0ebb180eabd8d5e
UHC 厓꿴┨蟻ε뤁腰뱀꽍厓꿴┨蟻ε뤁腰뱀꽍^ 11100100111011011011001011101001101001101011100111101011111111001010010111100101100011111011001011101001101001101011100111101100100001001001110111100100111011011011001011101001101001101011100111101011111111001010010111100101100011111011001011101001101001101011100111101100100001001001110101011110 e4edb2e9a6b9ebfca5e58fb2e9a6b9ec849de4edb2e9a6b9ebfca5e58fb2e9a6b9ec849d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)