To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????↑ぜ恂??齬??誼←?釉??沃 001111110011111100111111001111111000000110101010100000101011101010011100100101100011111100111111111010101001011100111111001111111000101101100010100000011010100100111111111001111101011000111111001111111001011110000000 3f3f3f3f81aa82ba9c963f3fea973f3f8b6281a93fe7d63f3f9780
EUC-JP ????↑ぜ恂??齬??誼←?釉??沃 001111110011111100111111001111111010001010101100101001001011110011010111111101100011111100111111111100111111011100111111001111111011010111000011101000101010101100111111111011101101100000111111001111111100110111100000 3f3f3f3fa2aca4bcd7f63f3ff3f73f3fb5c3a2ab3feed83f3fcde0
UTF-8 閱뤿툕璘↑ぜ恂⑸뙑齬잕퀣誼←븦釉띾㎣沃 111010011001011010110001111010111010010010111111111011011000100010010101111011111010011110101111111000101000011010010001111000111000000110011100111001101000000110000010111000101001000110111000111010111001100110010001111010011011110110101100111011001001111010010101111011011000000010100011111010001010101010111100111000101000011010010000111010111011100010100110111010011000011110001001111010111001110110111110111000111000111010100011111001101011001010000011 e996b1eba4bfed8895efa7afe28691e3819ce68182e291b8eb9991e9bdacec9e95ed80a3e8aabce28690ebb8a6e98789eb9dbee38ea3e6b283
UHC 閱뤿툕璘↑ぜ恂⑸뙑齬잕퀣誼←븦釉띾㎣沃 1110011011110011100011111110101110111000100011001110110011011110101000011110100010101010101111001110001011100001101010011110101110001100100101101110010111100001100111111110101010110011100101111110101111111110101000011110011110010101100011111110101110111000100011011110101110100111101001111110100010101010 e6f38febb88cecdea1e8aabce2e1a9eb8c96e5e19feab397ebfea1e7958febb88deba7a7e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)