To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???}v???}vB 0011111100111111001111110111110101110110001111110011111100111111011111010111011001000010 3f3f3f7d763f3f3f7d7642
SJIS-WIN ?絲?}v?絲?}vB 00111111111000110100111000111111011111010111011000111111111000110100111000111111011111010111011001000010 3fe34e3f7d763fe34e3f7d7642
EUC-JP ?絲?}v?絲?}vB 00111111111001011010111100111111011111010111011000111111111001011010111100111111011111010111011001000010 3fe5af3f7d763fe5af3f7d7642
UTF-8 綎絲짚}v綎絲짚}vB 1110011110110110100011101110011110110101101100101110110010100111100110100111110101110110111001111011011010001110111001111011010110110010111011001010011110011010011111010111011001000010 e7b68ee7b5b2eca79a7d76e7b68ee7b5b2eca79a7d7642
UHC 綎絲짚}v綎絲짚}vB 1110111111110010110111101110101011000010101001000111110101110110111011111111001011011110111010101100001010100100011111010111011001000010 eff2deeac2a47d76eff2deeac2a47d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)