To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 訂竭??齬孟?雋????雍?齬孟?雋??? 100100101111100111100010100100010011111100111111111010101001011110010110110100000011111111101000101100100011111100111111001111110011111111101000101101000011111111101010100101111001011011010000001111111110100010110010001111110011111100111111 92f9e2913f3fea9796d03fe8b23f3f3f3fe8b43fea9796d03fe8b23f3f3f
EUC-JP 訂竭??齬孟?雋????雍?齬孟?雋??? 110001001111101111100011111100010011111100111111111100111111011111001100110100100011111111110000101101000011111100111111001111110011111111110000101101100011111111110011111101111100110011010010001111111111000010110100001111110011111100111111 c4fbe3f13f3ff3f7ccd23ff0b43f3f3f3ff0b63ff3f7ccd23ff0b43f3f3f
UTF-8 訂竭렭렏齬孟웃雋잰렫브혁雍렏齬孟웃雋잰렫뮈 111010001010100010000010111001111010101110101101111010111010000010101101111010111010000010001111111010011011110110101100111001011010110110011111111011001001101110000011111010011001101110001011111011001001111010110000111010111010000010101011111010111011100010001100111011011001100010000001111010011001101110001101111010111010000010001111111010011011110110101100111001011010110110011111111011001001101110000011111010011001101110001011111011001001111010110000111010111010000010101011111010111010111010001000 e8a882e7abadeba0adeba08fe9bdace5ad9fec9b83e99b8bec9eb0eba0abebb88ced9881e99b8deba08fe9bdace5ad9fec9b83e99b8bec9eb0eba0abebae88
UHC 訂竭렭렏齬孟웃雋잰렫브혁雍렏齬孟웃雋잰렫뮈 111011111111010011001010111001101000111010111010100011101010010111100101111000011101100011101011101111111111010011110001111001101100000011101001100011101011100110111010111010101100011111110101111010001011110010001110101001011110010111100001110110001110101110111111111101001111000111100110110000001110100110001110101110011011100110111111 eff4cae68eba8ea5e5e1d8ebbff4f1e6c0e98eb9baeac7f5e8bc8ea5e5e1d8ebbff4f1e6c0e98eb9b9bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)