To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???k}???k{^ 0011111100111111001111110110101101111101001111110011111100111111011010110111101101011110 3f3f3f6b7d3f3f3f6b7b5e
SJIS-WIN 薔??k}薔??k{^ 11100101010010110011111100111111011010110111110111100101010010110011111100111111011010110111101101011110 e54b3f3f6b7de54b3f3f6b7b5e
EUC-JP 薔?祛k}薔?祛k{^ 1110100110101100001111111000111111010000110101110110101101111101111010011010110000111111100011111101000011010111011010110111101101011110 e9ac3f8fd0d76b7de9ac3f8fd0d76b7b5e
UTF-8 薔며祛k}薔며祛k{^ 1110100010010110100101001110101110101001101100001110011110100101100110110110101101111101111010001001011010010100111010111010100110110000111001111010010110011011011010110111101101011110 e89694eba9b0e7a59b6b7de89694eba9b0e7a59b6b7b5e
UHC 薔며祛k}薔며祛k{^ 1110110111111001101110001110011111001011111001000110101101111101111011011111100110111000111001111100101111100100011010110111101101011110 edf9b8e7cbe46b7dedf9b8e7cbe46b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)