To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蜊ィ闖ス闖ス逋 1110010110001101101010001110100010001111101111011110100010001111101111011110011110011001 e58da8e88fbde88fbde799
EUC-JP 蜊ィ闖ス闖ス逋 1110100111101101100011101010100011101111111011111000111010111101111011111110111110001110101111011110110111111001 e9ed8ea8efef8ebdefef8ebdedf9
UTF-8 蜊ィ闖ス闖ス逋 111010001001110010001010111011111011110110101000111010011001011110010110111011111011110110111101111010011001011110010110111011111011110110111101111010011000000010001011 e89c8aefbda8e99796efbdbde99796efbdbde9808b
UHC ??闖?闖?逋 00111111001111111111011111100110001111111111011111100110001111111111100011100111 3f3ff7e63ff7e63ff8e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)