To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 賊?繒魄賊?繒白^ 100100011010111100111111111110111000111111101001101011101001000110101111001111111111101110001111100101001001001001011110 91af3ffb8fe9ae91af3ffb8f94925e
EUC-JP 賊?繒魄賊?繒白^ 1100001010110001001111111000111111010100110101001111001010110000110000101011000100111111100011111101010011010100110001111111001001011110 c2b13f8fd4d4f2b0c2b13f8fd4d4c7f25e
UTF-8 賊렍繒魄賊렍繒白^ 11101000101100111000101011101011101000001000110111100111101110011001001011101001101011011000010011101000101100111000101011101011101000001000110111100111101110011001001011100111100110011011110101011110 e8b38aeba08de7b992e9ad84e8b38aeba08de7b992e799bd5e
UHC 賊렍繒魄賊렍繒白^ 1110111011100100100011101010001111110001111110011101101111011110111011101110010010001110101000111111000111111001110110111101110001011110 eee48ea3f1f9dbdeeee48ea3f1f9dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)