To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 髦サ譫晞仆闢喊 11101001100110001011101111100110100111101001110111101001100110001011101111101000100100111001101001011110 e998bbe69e9de998bbe8939a5e
EUC-JP 髦サ譫晞仆闢喊 1111000111111000100011101011101111101011111111101101101011101011110100001011110111101111111100111101001110111111 f1f88ebbebfedaebd0bdeff3d3bf
UTF-8 髦サ譫晞仆闢喊 111010011010101110100110111011111011110110111011111010001010110110101011111001101001100110011110111001001011101110000110111010011001011110100010111001011001011010001010 e9aba6efbdbbe8adabe6999ee4bb86e997a2e5968a
UHC ???晞?闢喊 00111111001111110011111111111101111101010011111111011100101000111111100111100010 3f3f3ffdf53fdca3f9e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)