To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
EUC-JP 獒????獒????B 100011111100101110111011001111110011111100111111001111111000111111001011101110110011111100111111001111110011111101000010 8fcbbb3f3f3f3f8fcbbb3f3f3f3f42
UTF-8 獒앭날燎굀獒앭날燎굀B 11100111100011011001001011101100100101011010110111101011100000101010000011101111101001111000000011101010101101011000000011100111100011011001001011101100100101011010110111101011100000101010000011101111101001111000000011101010101101011000000001000010 e78d92ec95adeb82a0efa780eab580e78d92ec95adeb82a0efa780eab58042
UHC 獒앭날燎굀獒앭날燎굀B 111010001010001110011101111001011011001110101111111010001111101110000010011010011110100010100011100111011110010110110011101011111110100011111011100000100110100101000010 e8a39de5b3afe8fb8269e8a39de5b3afe8fb826942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)