To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 鏑?笠鏑?洌^ 1001001101001100001111111000101001111101100100110100110000111111100111111010111001011110 934c3f8a7d934c3f9fae5e
EUC-JP 鏑?笠鏑?洌^ 1100010110101101001111111011001111011110110001011010110100111111110111101011000001011110 c5ad3fb3dec5ad3fdeb05e
UTF-8 鏑쮸笠鏑쯤洌^ 11101001100011111001000111101100101011101011100011100111101011001010000011101001100011111001000111101100101011111010010011100110101101001000110001011110 e98f91ecaeb8e7aca0e98f91ecafa4e6b48c5e
UHC 鏑쮸笠鏑쯤洌^ 11101110111010111100001011101001110110001010001011101110111010111100001011101011110101101010110001011110 eeebc2e9d8a2eeebc2ebd6ac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)