To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??柔??轅⑤????違??魏??筌??? 1110000110011111001111110011111110001111010111110011111100111111111001110111011010000111010001000011111100111111001111110011111110001000111000010011111100111111111010011011000000111111001111111110001010100011001111110011111100111111 e19f3f3f8f5f3f3fe77687443f3f3f3f88e13f3fe9b03f3fe2a33f3f3f
EUC-JP 癲??柔??轅????ı違??魏??筌??? 111000101010000100111111001111111011110111000000001111110011111111101101110101110011111100111111001111110011111110001111101010011100010110110000111000110011111100111111111100101011001000111111001111111110010010100101001111110011111100111111 e2a13f3fbdc03f3fedd73f3f3f3f8fa9c5b0e33f3ff2b23f3fe4a53f3f3f
UTF-8 癲얘퀡柔꾦걬轅⑤쭋列룸ı違됵쫯魏녾섭筌겹굢溜 1110011110011001101100101110110010010110100110001110110110000000101000011110011010011111100101001110101010111110101001101110101010110001101011001110100010111101100001011110001010010001101001001110110010101101100010111110111110100110100111001110101110100011101110001100010010110001111010011000000110010101111010111001000010110101111011001010101110101111111010011010110110001111111010111000010110111110111011001000010010101101111001111010110110001100111010101011001010111001111010101011010110100010111011111010011110001011 e799b2ec9698ed80a1e69f94eabea6eab1ace8bd85e291a4ecad8befa69ceba3b8c4b1e98195eb90b5ecabafe9ad8feb85beec84ade7ad8ceab2b9eab5a2efa78b
UHC 癲얘퀡柔꾦걬轅⑤쭋列룸ı違됵쫯魏녾섭筌겹굢溜 1110111110100110101111101110101010110011100101011110101011110101100001001110100110000001100101011110101010111111101010001110101110100111100001011110011011101010101101111110101110101001101001011110101011011110100010011110111110100110100001111110101011100000100001101110101010111100101101111110111110100111101100001110001110000010100010011110101011111110 efa6beeab395eaf584e98195eabfa8eba785e6eab7eba9a5eade89efa687eae086eabcb7efa7b0e38289eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)