To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 薔?魚???莊 11100101010010110011111110001011100110110011111100111111001111111110010010110101 e54b3f8b9b3f3f3fe4b5
EUC-JP 薔?魚???莊 11101001101011000011111110110101111110110011111100111111001111111110100010110111 e9ac3fb5fb3f3f3fe8b7
UTF-8 薔렡魚펼렠렩莊 111010001001011010010100111010111010000010100001111010011010110110011010111011011000111010111100111010111010000010100000111010111010000010101001111010001000111010001010 e89694eba0a1e9ad9aed8ebceba0a0eba0a9e88e8a
UHC 薔렡魚펼렠렩莊 1110110111111001100011101011001011100101111000001100011011101110100011101011000110001110101101111110110111110110 edf98eb2e5e0c6ee8eb18eb7edf6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)