To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ° 10110000 b0
SJIS-WIN ° 1000000110001011 818b
EUC-JP ° 1010000111101011 a1eb
UTF-8 ° 1100001010110000 c2b0
UHC ° 1010000111000110 a1c6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)