To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 髣比シ∝宸鬮ョ 111010011001011110010100111001001011110010000001111001011001101110000010111010011010101110101110 e99794e4bc81e59b82e9abae
EUC-JP 髣比シ∝宸鬮ョ 1111000111110111110010001110011010001110101111001010001011100111110101011110001011110010101011011000111010101110 f1f7c8e68ebca2e7d5e2f2ad8eae
UTF-8 髣比シ∝宸鬮ョ 111010011010101110100011111001101010111110010100111011111011110110111100111000101000100010011101111001011010111010111000111010011010110010101110111011111011110110101110 e9aba3e6af94efbdbce2889de5aeb8e9acaeefbdae
UHC ?比?∝宸?? 00111111110111011110111100111111101000011111000011100011111001000011111100111111 3fddef3fa1f0e3e43f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)