To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄β???ぜ???壓??毅??柔j?鈺??釗 10010110111011111000001111000000001111110011111100111111100000101011101000111111001111110011111110011010110110000011111100111111100010110100001000111111001111111000111101011111100000101000101000111111111110111100010000111111001111111111101110111011 96ef83c03f3f3f82ba3f3f3f9ad83f3f8b423f3f8f5f828a3ffbc43f3ffbbb
EUC-JP 厄β?瑗?ぜ洧??壓??毅??柔j?鈺??釗 11001100111100011010011011000010001111111000111111001100110000000011111110100100101111001000111111000111101101000011111100111111110101001101101000111111001111111011010110100011001111110011111110111101110000001010001111101010001111111000111111100011110101010011111100111111100011111110001110100110 ccf1a6c23f8fccc03fa4bc8fc7b43f3fd4da3f3fb5a33f3fbdc0a3ea3f8fe3d53f3f8fe3a6
UTF-8 厄β돦瑗띈ぜ洧곗춷壓믩베毅뽫춯柔j데鈺곌래釗 1110010110001110100001001100111010110010111010111000111110100110111001111001000110010111111010111001110110001000111000111000000110011100111001101011010010100111111010101011001110010111111011001011011010110111111001011010001110010011111010111010111110101001111010111011001010100000111001101010111110000101111010111011110110101011111011001011011010101111111001101001111110010100111011111011110110001010111010111000110110110000111010011000100010111010111010101011001110001100111010111001111010011000111010011000011110010111 e58e84ceb2eb8fa6e79197eb9d88e3819ce6b4a7eab397ecb6b7e5a393ebafa9ebb2a0e6af85ebbdabecb6afe69f94efbd8aeb8db0e988baeab38ceb9e98e98797
UHC 厄β돦瑗띈ぜ洧곗춷壓믩베毅뽫춯柔j데鈺곌래釗 1110010011111000101001011110001010001001101010101110101010111100101101101110100010101010101111001110101011111011101100001110110010101101100100111110010011100010100100101110101110111010101000111110101111110110100101101110011110101101100011001110101011110101101000111110101010110101101001011110100010101101101100001110101010110111101000011110000111110010 e4f8a5e289aaeabcb6e8aabceafbb0ecad93e4e292ebbaa3ebf696e7ad8ceaf5a3eab5a5e8adb0eab7a1e1f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)