To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???A}v???A}vB 00111111001111110011111101000001011111010111011000111111001111110011111101000001011111010111011001000010 3f3f3f417d763f3f3f417d7642
SJIS-WIN ???A}v???A}vB 00111111001111110011111101000001011111010111011000111111001111110011111101000001011111010111011001000010 3f3f3f417d763f3f3f417d7642
EUC-JP ?莘?A}v?莘?A}vB 0011111110001111110110001010101100111111010000010111110101110110001111111000111111011000101010110011111101000001011111010111011001000010 3f8fd8ab3f417d763f8fd8ab3f417d7642
UTF-8 얇莘렢A}v얇莘렢A}vB 11101100100101101000011111101000100011101001100011101011101000001010001001000001011111010111011011101100100101101000011111101000100011101001100011101011101000001010001001000001011111010111011001000010 ec9687e88e98eba0a2417d76ec9687e88e98eba0a2417d7642
UHC 얇莘렢A}v얇莘렢A}vB 10111110111000111110001111101110100011101011001101000001011111010111011010111110111000111110001111101110100011101011001101000001011111010111011001000010 bee3e3ee8eb3417d76bee3e3ee8eb3417d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)