To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 šeˆÚn}šeˆÚn{^ 10011010011001011000100011011010011011100111110110011010011001011000100011011010011011100111101101011110 9a6588da6e7d9a6588da6e7b5e
SJIS-WIN ?e??n}?e??n{^ 00111111011001010011111100111111011011100111110100111111011001010011111100111111011011100111101101011110 3f653f3f6e7d3f653f3f6e7b5e
EUC-JP ?e?Ún}?e?Ún{^ 0011111101100101001111111000111110101010111000100110111001111101001111110110010100111111100011111010101011100010011011100111101101011110 3f653f8faae26e7d3f653f8faae26e7b5e
UTF-8 šeˆÚn}šeˆÚn{^ 11000010100110100110010111000010100010001100001110011010011011100111110111000010100110100110010111000010100010001100001110011010011011100111101101011110 c29a65c288c39a6e7dc29a65c288c39a6e7b5e
UHC ?e??n}?e??n{^ 00111111011001010011111100111111011011100111110100111111011001010011111100111111011011100111101101011110 3f653f3f6e7d3f653f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)