To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 頷「蜷晉竇鬩包ス 1110100011110101101000101110010110010000100111011110011111100010100001011110100110101001100101011110111110111101 e8f5a2e5909de7e285e9a995efbd
EUC-JP 頷「蜷晉竇鬩包ス 11110000111101111000111010100010111010011111000011011010111010011110001111100101111100101010101111001010111100011000111010111101 f0f78ea2e9f0dae9e3e5f2abcaf18ebd
UTF-8 頷「蜷晉竇鬩包ス 111010011010000010110111111011111011110110100010111010001001110010110111111001101001100110001001111001111010101110000111111010011010110010101001111001011000110010000101111011111011110110111101 e9a0b7efbda2e89cb7e69989e7ab87e9aca9e58c85efbdbd
UHC ???晉竇?包? 0011111100111111001111111111001011001011110101001110010000111111111110001101000000111111 3f3f3ff2cbd4e43ff8d03f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)