To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 俑▼?俑▼?^ 1001100011011010100000011010010100111111100110001101101010000001101001010011111101011110 98da81a53f98da81a53f5e
EUC-JP 俑▼?俑▼?^ 1101000011011100101000101010011100111111110100001101110010100010101001110011111101011110 d0dca2a73fd0dca2a73f5e
UTF-8 俑▼뇚俑▼뇚^ 11100100101111111001000111100010100101101011110011101011100001111001101011100100101111111001000111100010100101101011110011101011100001111001101001011110 e4bf91e296bceb879ae4bf91e296bceb879a5e
UHC 俑▼뇚俑▼뇚^ 11101001101101011010000111100101100001111000010111101001101101011010000111100101100001111000010101011110 e9b5a1e58785e9b5a1e587855e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)