To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 魑カ竏ス岺オ邨 1110100110110011101101101110001010001000101111011111101010101101101101011110011110110101 e9b3b6e288bdfaadb5e7b5
EUC-JP 魑カ竏ス岺オ邨 111100101011010110001110101101101110001111101000100011101011110110001111101110111011100010001110101101011110111010110111 f2b58eb6e3e88ebd8fbbb88eb5eeb7
UTF-8 魑カ竏ス岺オ邨 111010011010110110010001111011111011110110110110111001111010101110001111111011111011110110111101111001011011001010111010111011111011110110110101111010011000001010101000 e9ad91efbdb6e7ab8fefbdbde5b2baefbdb5e982a8
UHC ????岺?邨 001111110011111100111111001111111101011010111001001111111111010110111110 3f3f3f3fd6b93ff5be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)