To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 弔?衣釜弔?衣釜^ 100100101010001000111111100010001101111110001010100110001001001010100010001111111000100011011111100010101001100001011110 92a23f88df8a9892a23f88df8a985e
EUC-JP 弔?衣釜弔?衣釜^ 110001001010010000111111101100001110000110110011111110001100010010100100001111111011000011100001101100111111100001011110 c4a43fb0e1b3f8c4a43fb0e1b3f85e
UTF-8 弔렲衣釜弔렲衣釜^ 11100101101111001001010011101011101000001011001011101000101000011010001111101001100001111001110011100101101111001001010011101011101000001011001011101000101000011010001111101001100001111001110001011110 e5bc94eba0b2e8a1a3e9879ce5bc94eba0b2e8a1a3e9879c5e
UHC 弔렲衣釜弔렲衣釜^ 1111000011000000100011101011111111101011111111011101110110111100111100001100000010001110101111111110101111111101110111011011110001011110 f0c08ebfebfdddbcf0c08ebfebfdddbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)