To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 跼ァ遽梧遽型 11100110111100001010011111100111101011111000110011100110111100001010011111100111101011111000110001011110 e6f0a7e7af8ce6f0a7e7af8c5e
EUC-JP 跼ァ遽梧?遽型 11101100111100101000111010100111111011101011000110111000111010000011111111101110101100011011011110111111 ecf28ea7eeb1b8e83feeb1b7bf
UTF-8 跼ァ遽梧遽型 111010001011011110111100111011111011110110100111111010011000000110111101111001101010001010100111111011101000000110100110111010011000000110111101111001011001111010001011 e8b7bcefbda7e981bde6a2a7ee81a6e981bde59e8b
UHC ??遽梧?遽型 0011111100111111110010111110100011100111111111000011111111001011111010001111101011111110 3f3fcbe8e7fc3fcbe8fafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)