To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ±å}v±å}vB 101100011110010101111101011101101011000111100101011111010111011001000010 b1e57d76b1e57d7642
SJIS-WIN ±?}v±?}vB 1000000101111101001111110111110101110110100000010111110100111111011111010111011001000010 817d3f7d76817d3f7d7642
EUC-JP ±å}v±å}vB 101000011101111010001111101010111010100101111101011101101010000111011110100011111010101110101001011111010111011001000010 a1de8faba97d76a1de8faba97d7642
UTF-8 ±å}v±å}vB 11000010101100011100001110100101011111010111011011000010101100011100001110100101011111010111011001000010 c2b1c3a57d76c2b1c3a57d7642
UHC ±?}v±?}vB 1010000110111110001111110111110101110110101000011011111000111111011111010111011001000010 a1be3f7d76a1be3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)