To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 魴ー訷ゐ字魴ー 111010011011010110110000111110111010010010000010111011101000111010011010111010011011010110110000 e9b5b0fba482ee8e9ae9b5b0
EUC-JP 魴ー訷ゐ字魴ー 111100101011011110001110101100001000111111011101110101001010010011110000101110111111101011110010101101111000111010110000 f2b78eb08fddd4a4f0bbfaf2b78eb0
UTF-8 魴ー訷ゐ字魴ー 111010011010110110110100111011111011110110110000111010001010100010110111111000111000001010010000111001011010110110010111111010011010110110110100111011111011110110110000 e9adb4efbdb0e8a8b7e38290e5ad97e9adb4efbdb0
UHC ???ゐ字?? 001111110011111100111111101010101111000011101101101011100011111100111111 3f3f3faaf0edae3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)