To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 趙シ魏・v趙シ魏・vB 111001101110001010111100111010011011000010100101011101101110011011100010101111001110100110110000101001010111011001000010 e6e2bce9b0a576e6e2bce9b0a57642
EUC-JP 趙シ魏・v趙シ魏・vB 11101100111001001000111010111100111100101011001010001110101001010111011011101100111001001000111010111100111100101011001010001110101001010111011001000010 ece48ebcf2b28ea576ece48ebcf2b28ea57642
UTF-8 趙シ魏・v趙シ魏・vB 111010001011011010011001111011111011110110111100111010011010110110001111111011111011110110100101011101101110100010110110100110011110111110111101101111001110100110101101100011111110111110111101101001010111011001000010 e8b699efbdbce9ad8fefbda576e8b699efbdbce9ad8fefbda57642
UHC 趙?魏?v趙?魏?vB 111100001110000100111111111010101110000000111111011101101111000011100001001111111110101011100000001111110111011001000010 f0e13feae03f76f0e13feae03f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)