To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????咀??蛟??爰?臟?咀??蛟??圓 00111111001111110011111100111111100110011111000000111111001111111110010110000000001111110011111111100000101001110011111111100100011001100011111110011001111100000011111100111111111001011000000000111111001111111001101010100010 3f3f3f3f99f03f3fe5803f3fe0a73fe4663f99f03f3fe5803f3f9aa2
EUC-JP 焌???咀?勖蛟??爰?臟?咀?勖蛟??圓 10001111110010011110100000111111001111110011111111010010111100100011111110001111101100111110110111101001111000000011111100111111111000001010100100111111111001111100011100111111110100101111001000111111100011111011001111101101111010011110000000111111001111111101010010100100 8fc9e83f3f3fd2f23f8fb3ede9e03f3fe0a93fe7c73fd2f23f8fb3ede9e03f3fd4a4
UTF-8 焌띳렰렡咀렡勖蛟렰렮爰렩臟렪咀렡勖蛟렰렮圓 111001111000010010001100111010111001110110110011111010111010000010110000111010111010000010100001111001011001001010000000111010111010000010100001111001011000101110010110111010001001101110011111111010111010000010110000111010111010000010101110111001111000100010110000111010111010000010101001111010001000011110011111111010111010000010101010111001011001001010000000111010111010000010100001111001011000101110010110111010001001101110011111111010111010000010110000111010111010000010101110111001011001110010010011 e7848ceb9db3eba0b0eba0a1e59280eba0a1e58b96e89b9feba0b0eba0aee788b0eba0a9e8879feba0aae59280eba0a1e58b96e89b9feba0b0eba0aee59c93
UHC 焌띳렰렡咀렡勖蛟렰렮爰렩臟렪咀렡勖蛟렰렮圓 111100011110000010110110111100011000111010111101100011101011001011101110101110101000111010110010111010011110110111001110111100011000111010111101100011101011101111101010101110101000111010110111111011011111010010001110101110001110111010111010100011101011001011101001111011011100111011110001100011101011110110001110101110111110101010101101 f1e0b6f18ebd8eb2eeba8eb2e9edcef18ebd8ebbeaba8eb7edf48eb8eeba8eb2e9edcef18ebd8ebbeaad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)