To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲???塋??踰??魏??橈??維??怨?サ 111000011001111100111111001111110011111110011010110010000011111100111111111001101111101000111111001111111110100110110000001111110011111110011110111101000011111100111111100010001101101100111111001111111000100110000101001111111000001101010100 e19f3f3f3f9ac83f3fe6fa3f3fe9b03f3f9ef43f3f88db3f3f89853f8354
EUC-JP 癲???塋??踰??魏??橈??維??怨?サ 111000101010000100111111001111110011111111010100110010100011111100111111111011001111110000111111001111111111001010110010001111110011111111011100111101100011111100111111101100001101110100111111001111111011000111100101001111111010010110110101 e2a13f3f3fd4ca3f3fecfc3f3ff2b23f3fdcf63f3fb0dd3f3fb1e53fa5b5
UTF-8 癲앷쑴짹塋딆뭾踰㏝뇡魏껉뭣橈볥쑐維쒏룚怨ㅼサ 111001111001100110110010111011001001010110110111111011001001000110110100111011001010011110111001111001011010000110001011111010111001010010000110111010111010110110111110111010001011100010110000111000111000111110011101111010111000011110100001111010011010110110001111111010101011101110001001111010111010110110100011111001101010100110001000111010111011001110100101111011001001000110010000111001111011011010101101111011001001001010001111111010111010001110011010111001101000000010101000111000111000010110111100111000111000001010110101 e799b2ec95b7ec91b4eca7b9e5a18beb9486ebadbee8b8b0e38f9deb87a1e9ad8feabb89ebada3e6a988ebb3a5ec9190e7b6adec928feba39ae680a8e385bce382b5
UHC 癲앷쑴짹塋딆뭾踰㏝뇡魏껉뭣橈볥쑐維쒏룚怨ㅼサ 1110111110100110100111011110101010111110101010011100001010110001111001111010101110001010111011001001001010001101111010111011001010100111111010011000011110001001111010101110000010000011111010101011100110111101111010001111101010010011111010111001110010101111111010111010101110011100111001101000111110010110111010101011001110100100111011001010101110110101 efa69deabea9c2b1e7ab8aec928debb2a7e98789eae083eab9bde8fa93eb9cafebab9ce68f96eab3a4ecabb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)