To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 耳鶯シ鈬ⅳ」 111100001010010110001110101010001110100111110010101111001110011111100011111110100100001110100011 f0a58ea8e9f2bce7e3fa43a3
EUC-JP ?耳鶯シ鈬?」 001111111011110010101010111100101111010010001110101111001110111011100101001111111000111010100011 3fbcaaf2f48ebceee53f8ea3
UTF-8 耳鶯シ鈬ⅳ」 111011101000000110100100111010001000000010110011111010011011011010101111111011111011110110111100111010011000100010101100111000101000010110110011111011111011110110100011 ee81a4e880b3e9b6afefbdbce988ace285b3efbda3
UHC ?耳鶯??ⅳ? 00111111111011001011110011100101101000110011111100111111101001011010010000111111 3fecbce5a33f3fa5a43f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)