To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 Šà‰yon}Šà‰yon{^ 100010101110000010001001011110010110111101101110011111011000101011100000100010010111100101101111011011100111101101011110 8ae089796f6e7d8ae089796f6e7b5e
SJIS-WIN ???yon}???yon{^ 001111110011111100111111011110010110111101101110011111010011111100111111001111110111100101101111011011100111101101011110 3f3f3f796f6e7d3f3f3f796f6e7b5e
EUC-JP ?à?yon}?à?yon{^ 00111111100011111010101110100010001111110111100101101111011011100111110100111111100011111010101110100010001111110111100101101111011011100111101101011110 3f8faba23f796f6e7d3f8faba23f796f6e7b5e
UTF-8 Šà‰yon}Šà‰yon{^ 110000101000101011000011101000001100001010001001011110010110111101101110011111011100001010001010110000111010000011000010100010010111100101101111011011100111101101011110 c28ac3a0c289796f6e7dc28ac3a0c289796f6e7b5e
UHC ???yon}???yon{^ 001111110011111100111111011110010110111101101110011111010011111100111111001111110111100101101111011011100111101101011110 3f3f3f796f6e7d3f3f3f796f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)