To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 潁??衣??源??筌 1001111111110001001111110011111110001000110111110011111100111111100011001011100100111111001111111110001010100011 9ff13f3f88df3f3f8cb93f3fe2a3
EUC-JP 潁??衣??源??筌 1101111011110011001111110011111110110000111000010011111100111111101110001011101100111111001111111110010010100101 def33f3fb0e13f3fb8bb3f3fe4a5
UTF-8 潁딉퐠衣쏙쭓源놁솇筌 111001101011110110000001111010111001010010001001111011011001000010100000111010001010000110100011111011001000111110011001111011001010110110010011111001101011101010010000111010111000011010000001111011001000011010000111111001111010110110001100 e6bd81eb9489ed90a0e8a1a3ec8f99ecad93e6ba90eb8681ec8687e7ad8c
UHC 潁딉퐠衣쏙쭓源놁솇筌 1110011110111000100010101110111110111101100010011110101111111101101111011110111110100111100010111110101010111001100001101110110010011001100010111110111110100111 e7b88aefbd89ebfdbdefa78beab986ec998befa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)