To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ????衛?????^ 001111110011111100111111001111111000100101110001001111110011111100111111001111110011111101011110 3f3f3f3f89713f3f3f3f3f5e
EUC-JP ????衛????蔿^ 0011111100111111001111110011111110110001110100100011111100111111001111110011111110001111110110011100000001011110 3f3f3f3fb1d23f3f3f3f8fd9c05e
UTF-8 룶괄룶괌衛룶괄룶괌蔿^ 11101011101000111011011011101010101101001000010011101011101000111011011011101010101101001000110011101000101000011001101111101011101000111011011011101010101101001000010011101011101000111011011011101010101101001000110011101000100101001011111101011110 eba3b6eab484eba3b6eab48ce8a19beba3b6eab484eba3b6eab48ce894bf5e
UHC 룶괄룶괌衛룶괄룶괌蔿^ 100011111010101110110000111111011000111110101011101100011010000111101010110110111000111110101011101100001111110110001111101010111011000110100001111010101101100101011110 8fabb0fd8fabb1a1eadb8fabb0fd8fabb1a1ead95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)