To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 «åX¢±«² 10001111101010111110010101011000100011111010001010110001100011111010101110110010 8fabe5588fa2b18fabb2
SJIS-WIN ???X?¢±??? 001111110011111100111111010110000011111110000001100100011000000101111101001111110011111100111111 3f3f3f583f8191817d3f3f3f
EUC-JP ??åX?¢±??? 0011111100111111100011111010101110101001010110000011111110100001111100011010000111011110001111110011111100111111 3f3f8faba9583fa1f1a1de3f3f3f
UTF-8 «åX¢±«² 11000010100011111100001010101011110000111010010101011000110000101000111111000010101000101100001010110001110000101000111111000010101010111100001010110010 c28fc2abc3a558c28fc2a2c2b1c28fc2abc2b2
UHC ???X??±??² 001111110011111100111111010110000011111100111111101000011011111000111111001111111010100111110111 3f3f3f583f3fa1be3f3fa9f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)