To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 雋樌沫迯オ菫コ 111010001011001010011110111011001001011010010111111001111000110110110101111001001011111110111010 e8b29eec9697e78db5e4bfba
EUC-JP 雋樌沫迯オ菫コ 1111000010110100110111001110111011001011111101111110110111101101100011101011010111101000110000011000111010111010 f0b4dceecbf7eded8eb5e8c18eba
UTF-8 雋樌沫迯オ菫コ 111010011001101110001011111001101010100010001100111001101011001010101011111010001011111110101111111011111011110110110101111010001000111110101011111011111011110110111010 e99b8be6a88ce6b2abe8bfafefbdb5e88fabefbdba
UHC 雋?沫??菫? 11110001111001100011111111011000110010000011111100111111110100001100101100111111 f1e63fd8c83f3fd0cb3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)