To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 滴陳?滴陳?B 1001001101001000100100101100001000111111100100110100100010010010110000100011111101000010 934892c23f934892c23f42
EUC-JP 滴陳?滴陳?B 1100010110101001110001001100010000111111110001011010100111000100110001000011111101000010 c5a9c4c43fc5a9c4c43f42
UTF-8 滴陳옭滴陳옭B 11100110101110111011010011101001100110011011001111101100100110001010110111100110101110111011010011101001100110011011001111101100100110001010110101000010 e6bbb4e999b3ec98ade6bbb4e999b3ec98ad42
UHC 滴陳옭滴陳옭B 11101110110110011111001011100111101111111100010011101110110110011111001011100111101111111100010001000010 eed9f2e7bfc4eed9f2e7bfc442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)