To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 畯???長?除???[畯???長?除???[^ 1111101101101111001111110011111100111111100100101011011100111111100011111001110000111111001111110011111101011011111110110110111100111111001111110011111110010010101101110011111110001111100111000011111100111111001111110101101101011110 fb6f3f3f3f92b73f8f9c3f3f3f5bfb6f3f3f3f92b73f8f9c3f3f3f5b5e
EUC-JP 畯???長?除???[畯???長?除???[^ 10001111110011011011101100111111001111110011111111000100101110010011111110111101111111000011111100111111001111110101101110001111110011011011101100111111001111110011111111000100101110010011111110111101111111000011111100111111001111110101101101011110 8fcdbb3f3f3fc4b93fbdfc3f3f3f5b8fcdbb3f3f3fc4b93fbdfc3f3f3f5b5e
UTF-8 畯얜렰렋長렢除쿰렰렦[畯얜렰렋長렢除쿰렰렦[^ 111001111001010110101111111011001001011010011100111010111010000010110000111010111010000010001011111010011001010110110111111010111010000010100010111010011001100110100100111011001011111110110000111010111010000010110000111010111010000010100110010110111110011110010101101011111110110010010110100111001110101110100000101100001110101110100000100010111110100110010101101101111110101110100000101000101110100110011001101001001110110010111111101100001110101110100000101100001110101110100000101001100101101101011110 e795afec969ceba0b0eba08be995b7eba0a2e999a4ecbfb0eba0b0eba0a65be795afec969ceba0b0eba08be995b7eba0a2e999a4ecbfb0eba0b0eba0a65b5e
UHC 畯얜렰렋長렢除쿰렰렦[畯얜렰렋長렢除쿰렰렦[^ 11110001111000011011111011101011100011101011110110001110101000101110110111111110100011101011001111110000101101101100010011110001100011101011110110001110101101010101101111110001111000011011111011101011100011101011110110001110101000101110110111111110100011101011001111110000101101101100010011110001100011101011110110001110101101010101101101011110 f1e1beeb8ebd8ea2edfe8eb3f0b6c4f18ebd8eb55bf1e1beeb8ebd8ea2edfe8eb3f0b6c4f18ebd8eb55b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)