To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 髴域≒蝨キ髴域≒蝨某 11101001100111001000100011100110100000011110000011100101100111001011011111101001100111001000100011100110100000011110000011100101100111001001011001011110 e99c88e681e0e59cb7e99c88e681e0e59c965e
EUC-JP 髴域≒蝨キ髴域≒蝨某 1111000111111100101100001110100010100010111000101110100111111100100011101011011111110001111111001011000011101000101000101110001011101001111111001100101110111111 f1fcb0e8a2e2e9fc8eb7f1fcb0e8a2e2e9fccbbf
UTF-8 髴域≒蝨キ髴域≒蝨某 111010011010101110110100111001011001111110011111111000101000100110010010111010001001110110101000111011111011110110110111111010011010101110110100111001011001111110011111111000101000100110010010111010001001110110101000111001101001111110010000 e9abb4e59f9fe28992e89da8efbdb7e9abb4e59f9fe28992e89da8e69f90
UHC ?域≒蝨??域≒蝨某 0011111111100110101101001010000111010110111000111010010000111111001111111110011010110100101000011101011011100011101001001101100110111011 3fe6b4a1d6e3a43f3fe6b4a1d6e3a4d9bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)