To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???D???D^ 001111110011111100111111010001000011111100111111001111110100010001011110 3f3f3f443f3f3f445e
SJIS-WIN 爲??D爲??D^ 1110000010101000001111110011111101000100111000001010100000111111001111110100010001011110 e0a83f3f44e0a83f3f445e
EUC-JP 爲??D爲??D^ 1110000010101010001111110011111101000100111000001010101000111111001111110100010001011110 e0aa3f3f44e0aa3f3f445e
UTF-8 爲렯롌D爲렯롌D^ 111001111000100010110010111010111010000010101111111010111010000110001100010001001110011110001000101100101110101110100000101011111110101110100001100011000100010001011110 e788b2eba0afeba18c44e788b2eba0afeba18c445e
UHC 爲렯롌D爲렯롌D^ 111010101101001110001110101111001000111011010010010001001110101011010011100011101011110010001110110100100100010001011110 ead38ebc8ed244ead38ebc8ed2445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)