To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 潁〓?烏g??j????厭??矣????。 1001111111110001100000011010110000111111100010010100011110000010100001110011111100111111100000101000101000111111001111110011111100111111100010010111110100111111001111111110000111100001001111110011111100111111001111111000000101000010 9ff181ac3f894782873f3f828a3f3f3f3f897d3f3fe1e13f3f3f3f8142
EUC-JP 潁〓?烏g??j????厭??矣????。 1101111011110011101000101010111000111111101100011010100010100011111001110011111100111111101000111110101000111111001111110011111100111111101100011101111000111111001111111110001011100011001111110011111100111111001111111010000110100011 def3a2ae3fb1a8a3e73f3fa3ea3f3f3f3fb1de3f3fe2e33f3f3f3fa1a3
UTF-8 潁〓젙烏g뎡溜j퀓溜뽯젙厭묒뼍矣깍쭫溜계。 111001101011110110000001111000111000000010010011111011001010000010011001111001111000001110001111111011111011110110000111111010111000111010100001111011111010011110001011111011111011110110001010111011011000000010010011111011111010011110001011111010111011110110101111111011001010000010011001111001011000111010101101111010111010110010010010111010111011110010001101111001111001111110100011111010101011100110001101111011001010110110101011111011111010011110001011111010101011001110000100111000111000000010000010 e6bd81e38093eca099e7838fefbd87eb8ea1efa78befbd8aed8093efa78bebbdafeca099e58eadebac92ebbc8de79fa3eab98decadabefa78beab384e38082
UHC 潁〓젙烏g뎡溜j퀓溜뽯젙厭묒뼍矣깍쭫溜계。 111001111011100010100001111010111010000010010101111010001010000110100011111001111011010110110010111010101111111010100011111010101011001110001000111010101111111010010110111010111010000010010101111001101111010010010001111011001001011010010101111010111111100010110001111011111010011110011111111010101111111010110000111010001010000110100011 e7b8a1eba095e8a1a3e7b5b2eafea3eab388eafe96eba095e6f491ec9695ebf8b1efa79feafeb0e8a1a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)