To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霆?芝?依?媚芝?依? 1110100010111011001111111000111011000101001111111000100011001011001111111001101101011010100011101100010100111111100010001100101100111111 e8bb3f8ec53f88cb3f9b5a8ec53f88cb3f
EUC-JP 霆?芝?依?媚芝?依? 1111000010111101001111111011110011000111001111111011000011001101001111111101010110111011101111001100011100111111101100001100110100111111 f0bd3fbcc73fb0cd3fd5bbbcc73fb0cd3f
UTF-8 霆렕芝렫依멸媚芝렫依렛 111010011001110010000110111010111010000010010101111010001000101010011101111010111010000010101011111001001011111010011101111010111010100110111000111001011010101010011010111010001000101010011101111010111010000010101011111001001011111010011101111010111010000010011011 e99c86eba095e88a9deba0abe4be9deba9b8e5aa9ae88a9deba0abe4be9deba09b
UHC 霆렕芝렫依멸媚芝렫依렛 11101111111111011000111010101010111100101011100110001110101110011110101111101110101110001110101011011010101011001111001010111001100011101011100111101011111011101011011110111111 effd8eaaf2b98eb9ebeeb8eadaacf2b98eb9ebeeb7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)