To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 狄??狄??B 111000001011110100111111001111111110000010111101001111110011111101000010 e0bd3f3fe0bd3f3f42
EUC-JP 狄侄侄狄侄侄B 1110000010111111100011111011000011111110100011111011000011111110111000001011111110001111101100001111111010001111101100001111111001000010 e0bf8fb0fe8fb0fee0bf8fb0fe8fb0fe42
UTF-8 狄侄侄狄侄侄B 11100111100010111000010011100100101111101000010011100100101111101000010011100111100010111000010011100100101111101000010011100100101111101000010001000010 e78b84e4be84e4be84e78b84e4be84e4be8442
UHC 狄侄侄狄侄侄B 11101110110110101111001011101001111100101110100111101110110110101111001011101001111100101110100101000010 eedaf2e9f2e9eedaf2e9f2e942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)