To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蠏灘、ア蟾仙廠 111001011011010110010011111001011010010010110001111001011011011110010000111001011000111110110001 e5b593e5a4b1e5b790e58fb1
EUC-JP 蠏灘、ア蟾仙廠 1110101010110111110001101110011110001110101001001000111010110001111010101011100111000000111001111011111010110011 eab7c6e78ea48eb1eab9c0e7beb3
UTF-8 蠏灘、ア蟾仙廠 111010001010000010001111111001111000000110011000111011111011110110100100111011111011110110110001111010001001111110111110111001001011101110011001111001011011101110100000 e8a08fe78198efbda4efbdb1e89fbee4bb99e5bba0
UHC ?灘??蟾仙廠 0011111111110111101010000011111100111111111000001110101011100000101110011111001111011111 3ff7a83f3fe0eae0b9f3df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)