To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 燿?????循??閻??要??將??循??冗??^ 1110000010100000001111110011111100111111001111110011111110001111011110100011111100111111111010001000010100111111001111111001011101110110001111110011111110011011100100100011111100111111100011110111101000111111001111111000111111100111001111110011111101011110 e0a03f3f3f3f3f8f7a3f3fe8853f3f97763f3f9b923f3f8f7a3f3f8fe73f3f5e
EUC-JP 燿?????循??閻??要??將??循??冗??^ 1110000010100010001111110011111100111111001111110011111110111101110110110011111100111111111011111110010100111111001111111100110111010111001111110011111111010101111100100011111100111111101111011101101100111111001111111011111011101001001111110011111101011110 e0a23f3f3f3f3fbddb3f3fefe53f3fcdd73f3fd5f23f3fbddb3f3fbee93f3f5e
UTF-8 燿ⓩ닁曆섋략循덌쉠閻뺟칻要띷뀞將뚩였循덌쉠冗뷜뵠^ 11100111100001111011111111100010100100111010100111101011100010111000000111101111101001101000101111101100100001001000101111101011100111101011010111100101101111101010101011101011100011011000110011101100100010011010000011101001100101101011101111101011101110101001111111101100101110011011101111101000101001101000000111101011100111011011011111101011100000001001111011100101101100001000011111101011100110101010100111101100100110001000000011100101101111101010101011101011100011011000110011101100100010011010000011100101100001101001011111101011101101111001110011101011101101011010000001011110 e787bfe293a9eb8b81efa68bec848beb9eb5e5beaaeb8d8cec89a0e996bbebba9fecb9bbe8a681eb9db7eb809ee5b087eb9aa9ec9880e5beaaeb8d8cec89a0e58697ebb79cebb5a05e
UHC 燿ⓩ닁曆섋략循덌쉠閻뺟칻要띷뀞將뚩였循덌쉠冗뷜뵠^ 11101000111111001010100011100110100010001000101011100110101101111001100011101000101101111010101111100010111000001000100011101111101111011010101011100111101000101001010111100111101011111000101111101001101010011000110111100110100001011001010111101101111000101000110011101000101111111011010011100010111000001000100011101111101111011010101011101001101101111011101011100010100101001010000001011110 e8fca8e6888ae6b798e8b7abe2e088efbdaae7a295e7af8be9a98de68595ede28ce8bfb4e2e088efbdaae9b7bae294a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)