To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????m????mB 0011111100111111001111110011111101101101001111110011111100111111001111110110110101000010 3f3f3f3f6d3f3f3f3f6d42
SJIS-WIN 剪???m剪???mB 10011001100100100011111100111111001111110110110110011001100100100011111100111111001111110110110101000010 99923f3f3f6d99923f3f3f6d42
EUC-JP 剪???m剪???mB 11010001111100100011111100111111001111110110110111010001111100100011111100111111001111110110110101000010 d1f23f3f3f6dd1f23f3f3f6d42
UTF-8 剪띳렰렜m剪띳렰렜mB 111001011000100110101010111010111001110110110011111010111010000010110000111010111010000010011100011011011110010110001001101010101110101110011101101100111110101110100000101100001110101110100000100111000110110101000010 e589aaeb9db3eba0b0eba09c6de589aaeb9db3eba0b0eba09c6d42
UHC 剪띳렰렜m剪띳렰렜mB 11101110111100101011011011110001100011101011110110001110101011100110110111101110111100101011011011110001100011101011110110001110101011100110110101000010 eef2b6f18ebd8eae6deef2b6f18ebd8eae6d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)