To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 聿主害聿主害^ 11100011111001001000111011100101100010100101000111100011111001001000111011100101100010100101000101011110 e3e48ee58a51e3e48ee58a515e
EUC-JP 聿主害聿主害^ 11100110111001101011110011100111101100111011001011100110111001101011110011100111101100111011001001011110 e6e6bce7b3b2e6e6bce7b3b25e
UTF-8 聿主害聿主害^ 11101000100000011011111111100100101110001011101111100101101011101011001111101000100000011011111111100100101110001011101111100101101011101011001101011110 e881bfe4b8bbe5aeb3e881bfe4b8bbe5aeb35e
UHC 聿主害聿主害^ 11101011110100111111000110101011111110101010101011101011110100111111000110101011111110101010101001011110 ebd3f1abfaaaebd3f1abfaaa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)