To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 蠢????砒韻?△ 11100101101111110011111100111111001111110011111111100001111001011000100101000011001111111000000110100010 e5bf3f3f3f3fe1e589433f81a2
EUC-JP 蠢????砒韻?△ 11101010110000010011111100111111001111110011111111100010111001111011000110100100001111111010001010100100 eac13f3f3f3fe2e7b1a43fa2a4
UTF-8 蠢렋履머면砒韻펙△ 111010001010000010100010111010111010000010001011111011111010011110011111111010111010100010111000111010111010100110110100111001111010000010010010111010011001111110111011111011011000111010011001111000101001011010110011 e8a0a2eba08befa79feba8b8eba9b4e7a092e99fbbed8e99e296b3
UHC 蠢렋履머면砒韻펙△ 111100011110001110001110101000101110110010101010101110001101001110111000111010011101110111110111111010101010010011000110111001011010000111100010 f1e38ea2ecaab8d3b8e9ddf7eaa4c6e5a1e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)