To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 魍域錐譌ヲ驕廬 11101001101100011000100011100110100100001000110111100110100101111010011011101001100000011001110001001001 e9b188e6908de697a6e9819c49
EUC-JP 魍域錐譌ヲ驕廬 1111001010110011101100001110100010111111111011011110101111110111100011101010011011110001111000011101011110101010 f2b3b0e8bfedebf78ea6f1e1d7aa
UTF-8 魍域錐譌ヲ驕廬 111010011010110110001101111001011001111110011111111010011000110010010000111010001010110110001100111011111011110110100110111010011010100110010101111001011011101110101100 e9ad8de59f9fe98c90e8ad8cefbda6e9a995e5bbac
UHC ?域錐??驕廬 0011111111100110101101001111010111011110001111110011111111001110111101101101010111100110 3fe6b4f5de3f3fcef6d5e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)