To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???Y???YB 001111110011111100111111010110010011111100111111001111110101100101000010 3f3f3f593f3f3f5942
SJIS-WIN 吟?dY吟?dYB 10001011111000010011111110000010100001000101100110001011111000010011111110000010100001000101100101000010 8be13f8284598be13f82845942
EUC-JP 吟?dY吟?dYB 10110110111000110011111110100011111001000101100110110110111000110011111110100011111001000101100101000010 b6e33fa3e459b6e33fa3e45942
UTF-8 吟㏘dY吟㏘dYB 111001011001000010011111111000111000111110011000111011111011110110000100010110011110010110010000100111111110001110001111100110001110111110111101100001000101100101000010 e5909fe38f98efbd8459e5909fe38f98efbd845942
UHC 吟㏘dY吟㏘dYB 111010111110000110100010111001001010001111100100010110011110101111100001101000101110010010100011111001000101100101000010 ebe1a2e4a3e459ebe1a2e4a3e45942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)