To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN □?□?碇ビ??舒 1000000110100000001111111000000110100000001111111001001011110100100000110111001000111111001111111001100010101110 81a03f81a03f92f483723f3f98ae
EUC-JP □?□?碇ビ??舒 1010001010100010001111111010001010100010001111111100010011110110101001011101001100111111001111111101000010110000 a2a23fa2a23fc4f6a5d33f3fd0b0
UTF-8 □▩□룫碇ビ룶ㄱ舒 111000101001011010100001111000101001011010101001111000101001011010100001111010111010001110101011111001111010001010000111111000111000001110010011111010111010001110110110111000111000010010110001111010001000100010010010 e296a1e296a9e296a1eba3abe7a287e38393eba3b6e384b1e88892
UHC □▩□룫碇ビ룶ㄱ舒 101000011110000010100010110011001010000111100000100011111010001011101111111011011010101111010011100011111010101110100100101000011110000010100010 a1e0a2cca1e08fa2efedabd38faba4a1e0a2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)