To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????^ 0011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f5e
SJIS-WIN 諸オ陬晟嵯陲ョ^ 11111011101010011011010111101000101000111001110111101110100011011011010111101000101000101010111001011110 fba9b5e8a39dee8db5e8a2ae5e
EUC-JP ?オ陬晟嵯陲ョ^ 0011111110001110101101011111000010100101110110101111000010111010101101111111000010100100100011101010111001011110 3f8eb5f0a5daf0bab7f0a48eae5e
UTF-8 諸オ陬晟嵯陲ョ^ 11101111101010001010001011101111101111011011010111101001100110011010110011100110100110011001111111100101101101011010111111101001100110011011001011101111101111011010111001011110 efa8a2efbdb5e999ace6999fe5b5afe999b2efbdae5e
UHC ???晟嵯??^ 00111111001111110011111111100000111110011111001110101011001111110011111101011110 3f3f3fe0f9f3ab3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)