To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???乙??癲??魏⑨?飮??壓??裕?? 00111111001111110011111110001001101100110011111100111111111000011001111100111111001111111110100110110000100001110100100000111111100111110101101000111111001111111001101011011000001111110011111110010111010101000011111100111111 3f3f3f89b33f3fe19f3f3fe9b087483f9f5a3f3f9ad83f3f97543f3f
EUC-JP ???乙??癲??魏??飮??壓??裕?? 001111110011111100111111101100101011010100111111001111111110001010100001001111110011111111110010101100100011111100111111110111011011101100111111001111111101010011011010001111110011111111001101101101010011111100111111 3f3f3fb2b53f3fe2a13f3ff2b23f3fddbb3f3fd4da3f3fcdb53f3f
UTF-8 捻뀁뫑乙녹퓞癲딉퐡魏⑨쭫飮뗭췀壓믩갭裕띄솾 111011111010011010100100111010111000000010000001111010111010101110010001111001001011100110011001111010111000010110111001111011011001001110011110111001111001100110110010111010111001010010001001111011011001000010100001111010011010110110001111111000101001000110101000111011001010110110101011111010011010001110101110111010111001011110101101111011001011011110000000111001011010001110010011111010111010111110101001111010101011000010101101111010001010001110010101111010111001110110000100111011001000011010111110 efa6a4eb8081ebab91e4b999eb85b9ed939ee799b2eb9489ed90a1e9ad8fe291a8ecadabe9a3aeeb97adecb780e5a393ebafa9eab0ade8a395eb9d84ec86be
UHC 捻뀁뫑乙녹퓞癲딉퐡魏⑨쭫飮뗭췀壓믩갭裕띄솾 111001101111011110110010111011001001000110110011111010111110000010110011111011001011111110001000111011111010011010001010111011111011110110001010111010101110000010101000111011111010011110011111111010111110011010001011111011001010110110011100111001001110001010010010111010111011000010111000111010111010111010110110111001111001100110110010 e6f7b2ec91b3ebe0b3ecbf88efa68aefbd8aeae0a8efa79febe68becad9ce4e292ebb0b8ebaeb6e799b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)