To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???v???vB 001111110011111100111111011101100011111100111111001111110111011001000010 3f3f3f763f3f3f7642
SJIS-WIN 軟??v軟??vB 1001001111101110001111110011111101110110100100111110111000111111001111110111011001000010 93ee3f3f7693ee3f3f7642
EUC-JP 軟??v軟??vB 1100011011110000001111110011111101110110110001101111000000111111001111110111011001000010 c6f03f3f76c6f03f3f7642
UTF-8 軟㏝갈v軟㏝갈vB 111010001011101110011111111000111000111110011101111010101011000010001000011101101110100010111011100111111110001110001111100111011110101010110000100010000111011001000010 e8bb9fe38f9deab08876e8bb9fe38f9deab0887642
UHC 軟㏝갈v軟㏝갈vB 111001101110001110100111111010011011000010100101011101101110011011100011101001111110100110110000101001010111011001000010 e6e3a7e9b0a576e6e3a7e9b0a57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)