To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??爾←?猿??筌??爾←?猿??B 111000101010001100111111001111111000111010100010100000011010100100111111100010011000111000111111001111111110001010100011001111110011111110001110101000101000000110101001001111111000100110001110001111110011111101000010 e2a33f3f8ea281a93f898e3f3fe2a33f3f8ea281a93f898e3f3f42
EUC-JP 筌??爾←?猿??筌??爾←?猿??B 111001001010010100111111001111111011110010100100101000101010101100111111101100011110111000111111001111111110010010100101001111110011111110111100101001001010001010101011001111111011000111101110001111110011111101000010 e4a53f3fbca4a2ab3fb1ee3f3fe4a53f3fbca4a2ab3fb1ee3f3f42
UTF-8 筌뤿툖爾←땟猿딆끝筌뤿툖爾←땟猿딆끝B 11100111101011011000110011101011101001001011111111101101100010001001011011100111100010001011111011100010100001101001000011101011100101011001111111100111100011001011111111101011100101001000011011101011100000011001110111100111101011011000110011101011101001001011111111101101100010001001011011100111100010001011111011100010100001101001000011101011100101011001111111100111100011001011111111101011100101001000011011101011100000011001110101000010 e7ad8ceba4bfed8896e788bee28690eb959fe78cbfeb9486eb819de7ad8ceba4bfed8896e788bee28690eb959fe78cbfeb9486eb819d42
UHC 筌뤿툖爾←땟猿딆끝筌뤿툖爾←땟猿딆끝B 11101111101001111000111111101011101110001000110111101100101100111010000111100111101101101010110111101010101110111000101011101100101100111010000111101111101001111000111111101011101110001000110111101100101100111010000111100111101101101010110111101010101110111000101011101100101100111010000101000010 efa78febb88decb3a1e7b6adeabb8aecb3a1efa78febb88decb3a1e7b6adeabb8aecb3a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)