To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ??申?鬱耿? 00111111001111111001000001011100001111111001111101010100111000111101010000111111 3f3f905c3f9f54e3d43f
EUC-JP 玎?申?鬱耿? 100011111100101111010010001111111011111110111101001111111101110110110101111001101101011000111111 8fcbd23fbfbd3fddb5e6d63f
UTF-8 玎렓申렑鬱耿렏 111001111000111010001110111010111010000010010011111001111001010010110011111010111010000010010001111010011010110010110001111010001000000010111111111010111010000010001111 e78e8eeba093e794b3eba091e9acb1e880bfeba08f
UHC 玎렓申렑鬱耿렏 1110111111101001100011101010100011100011111010011000111010100110111010101010011011001100111010101000111010100101 efe98ea8e3e98ea6eaa6ccea8ea5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)