To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 訂竭??齬孟└雋????雍?齬孟└雋??? 1001001011111001111000101001000100111111001111111110101010010111100101101101000010000100101001001110100010110010001111110011111100111111001111111110100010110100001111111110101010010111100101101101000010000100101001001110100010110010001111110011111100111111 92f9e2913f3fea9796d084a4e8b23f3f3f3fe8b43fea9796d084a4e8b23f3f3f
EUC-JP 訂竭??齬孟└雋????雍?齬孟└雋??? 1100010011111011111000111111000100111111001111111111001111110111110011001101001010101000101001101111000010110100001111110011111100111111001111111111000010110110001111111111001111110111110011001101001010101000101001101111000010110100001111110011111100111111 c4fbe3f13f3ff3f7ccd2a8a6f0b43f3f3f3ff0b63ff3f7ccd2a8a6f0b43f3f3f
UTF-8 訂竭렭렏齬孟└雋잰렫브혁雍렏齬孟└雋잰렫뮈 111010001010100010000010111001111010101110101101111010111010000010101101111010111010000010001111111010011011110110101100111001011010110110011111111000101001010010010100111010011001101110001011111011001001111010110000111010111010000010101011111010111011100010001100111011011001100010000001111010011001101110001101111010111010000010001111111010011011110110101100111001011010110110011111111000101001010010010100111010011001101110001011111011001001111010110000111010111010000010101011111010111010111010001000 e8a882e7abadeba0adeba08fe9bdace5ad9fe29494e99b8bec9eb0eba0abebb88ced9881e99b8deba08fe9bdace5ad9fe29494e99b8bec9eb0eba0abebae88
UHC 訂竭렭렏齬孟└雋잰렫브혁雍렏齬孟└雋잰렫뮈 111011111111010011001010111001101000111010111010100011101010010111100101111000011101100011101011101001101010011011110001111001101100000011101001100011101011100110111010111010101100011111110101111010001011110010001110101001011110010111100001110110001110101110100110101001101111000111100110110000001110100110001110101110011011100110111111 eff4cae68eba8ea5e5e1d8eba6a6f1e6c0e98eb9baeac7f5e8bc8ea5e5e1d8eba6a6f1e6c0e98eb9b9bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)