To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????C?????????C^ 001111110011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111001111110100001101011110 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f435e
SJIS-WIN 掩??揖?????C掩??揖?????C^ 10001001100001100011111100111111100101110100101100111111001111110011111100111111001111110100001110001001100001100011111100111111100101110100101100111111001111110011111100111111001111110100001101011110 89863f3f974b3f3f3f3f3f4389863f3f974b3f3f3f3f3f435e
EUC-JP 掩??揖?????C掩??揖?????C^ 10110001111001100011111100111111110011011010110000111111001111110011111100111111001111110100001110110001111001100011111100111111110011011010110000111111001111110011111100111111001111110100001101011110 b1e63f3fcdac3f3f3f3f3f43b1e63f3fcdac3f3f3f3f3f435e
UTF-8 掩뽰룊揖욘를琉꾩뒴C掩뽰룊揖욘를琉꾩뒴C^ 111001101000111010101001111010111011110110110000111010111010001110001010111001101000111110010110111011001001101010011000111010111010010110111100111011111010011110001100111010101011111010101001111010111001001010110100010000111110011010001110101010011110101110111101101100001110101110100011100010101110011010001111100101101110110010011010100110001110101110100101101111001110111110100111100011001110101010111110101010011110101110010010101101000100001101011110 e68ea9ebbdb0eba38ae68f96ec9a98eba5bcefa78ceabea9eb92b443e68ea9ebbdb0eba38ae68f96ec9a98eba5bcefa78ceabea9eb92b4435e
UHC 掩뽰룊揖욘를琉꾩뒴C掩뽰룊揖욘를琉꾩뒴C^ 111001011111001110010110111011001000111110001001111010111110011110111111111001101011100010100110111010111010010010000100111011001000101010101101010000111110010111110011100101101110110010001111100010011110101111100111101111111110011010111000101001101110101110100100100001001110110010001010101011010100001101011110 e5f396ec8f89ebe7bfe6b8a6eba484ec8aad43e5f396ec8f89ebe7bfe6b8a6eba484ec8aad435e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)