To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 陷逸庁陷逸庁^ 11101000100111001000100011101101100100101010000111101000100111001000100011101101100100101010000101011110 e89c88ed92a1e89c88ed92a15e
EUC-JP 陷逸庁陷逸庁^ 11101111111111001011000011101111110001001010001111101111111111001011000011101111110001001010001101011110 effcb0efc4a3effcb0efc4a35e
UTF-8 陷逸庁陷逸庁^ 11101001100110011011011111101001100000001011100011100101101110101000000111101001100110011011011111101001100000001011100011100101101110101000000101011110 e999b7e980b8e5ba81e999b7e980b8e5ba815e
UHC 陷逸?陷逸?^ 1111100111101000111011001110111100111111111110011110100011101100111011110011111101011110 f9e8ecef3ff9e8ecef3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)