To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 æCŠŒ‰A}æCŠŒ‰A{^ 111001100100001110001010100011001000100101000001011111011110011001000011100010101000110010001001010000010111101101011110 e6438a8c89417de6438a8c89417b5e
SJIS-WIN ?C???A}?C???A{^ 001111110100001100111111001111110011111101000001011111010011111101000011001111110011111100111111010000010111101101011110 3f433f3f3f417d3f433f3f3f417b5e
EUC-JP æC???A}æC???A{^ 10001111101010011100000101000011001111110011111100111111010000010111110110001111101010011100000101000011001111110011111100111111010000010111101101011110 8fa9c1433f3f3f417d8fa9c1433f3f3f417b5e
UTF-8 æCŠŒ‰A}æCŠŒ‰A{^ 1100001110100110010000111100001010001010110000101000110011000010100010010100000101111101110000111010011001000011110000101000101011000010100011001100001010001001010000010111101101011110 c3a643c28ac28cc289417dc3a643c28ac28cc289417b5e
UHC æC???A}æC???A{^ 1010100110100001010000110011111100111111001111110100000101111101101010011010000101000011001111110011111100111111010000010111101101011110 a9a1433f3f3f417da9a1433f3f3f417b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)