To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????^????^B 0011111100111111001111110011111101011110001111110011111100111111001111110101111001000010 3f3f3f3f5e3f3f3f3f5e42
SJIS-WIN 疋嶝??^疋嶝??^B 100101010100010010011011110100010011111100111111010111101001010101000100100110111101000100111111001111110101111001000010 95449bd13f3f5e95449bd13f3f5e42
EUC-JP 疋嶝??^疋嶝??^B 110010011010010111010110110100110011111100111111010111101100100110100101110101101101001100111111001111110101111001000010 c9a5d6d33f3f5ec9a5d6d33f3f5e42
UTF-8 疋嶝렰렪^疋嶝렰렪^B 111001111001011010001011111001011011011010011101111010111010000010110000111010111010000010101010010111101110011110010110100010111110010110110110100111011110101110100000101100001110101110100000101010100101111001000010 e7968be5b69deba0b0eba0aa5ee7968be5b69deba0b0eba0aa5e42
UHC 疋嶝렰렪^疋嶝렰렪^B 11111001101101011101010011110001100011101011110110001110101110000101111011111001101101011101010011110001100011101011110110001110101110000101111001000010 f9b5d4f18ebd8eb85ef9b5d4f18ebd8eb85e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)