To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 鋏・岦凡v鋏・岦凡vB 1110011111110101101001011111101010101100100101100111110101110110111001111111010110100101111110101010110010010110011111010111011001000010 e7f5a5faac967d76e7f5a5faac967d7642
EUC-JP 鋏・岦凡v鋏・岦凡vB 111011101111011110001110101001011000111110111011101100111100101111011110011101101110111011110111100011101010010110001111101110111011001111001011110111100111011001000010 eef78ea58fbbb3cbde76eef78ea58fbbb3cbde7642
UTF-8 鋏・岦凡v鋏・岦凡vB 111010011000101110001111111011111011110110100101111001011011001010100110111001011000011110100001011101101110100110001011100011111110111110111101101001011110010110110010101001101110010110000111101000010111011001000010 e98b8fefbda5e5b2a6e587a176e98b8fefbda5e5b2a6e587a17642
UHC 鋏??凡v鋏??凡vB 111110101111100100111111001111111101101111101101011101101111101011111001001111110011111111011011111011010111011001000010 faf93f3fdbed76faf93f3fdbed7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)