To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ?科咐??鴦與 0011111110001001110010001001100111110011001111110011111111101001111100011110010001101111 3f89c899f33f3fe9f1e46f
EUC-JP ?科咐堧?鴦與 00111111101100101100101011010010111101011000111110111000101010000011111111110010111100111110011111010000 3fb2cad2f58fb8a83ff2f3e7d0
UTF-8 룴科咐堧룵鴦與 111010111010001110110100111001111010011110010001111001011001001010010000111001011010000010100111111010111010001110110101111010011011010010100110111010001000100010000111 eba3b4e7a791e59290e5a0a7eba3b5e9b4a6e88887
UHC 룴科咐堧룵鴦與 1000111110101001110011101010000111011100111110111110011011000000100011111010101011100100111011001110011010101000 8fa9cea1dcfbe6c08faae4ece6a8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)