To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 韈暦スェ隰暦スゥ遒 1110100011100111100101111110111110111101101010101110100010101100100101111110111110111101101010011110011110100010 e8e797efbdaae8ac97efbda9e7a2
EUC-JP 韈暦スェ隰暦スゥ遒 111100001110100111001110111100011000111010111101100011101010101011110000101011101100111011110001100011101011110110001110101010011110111010100100 f0e9cef18ebd8eaaf0aecef18ebd8ea9eea4
UTF-8 韈暦スェ隰暦スゥ遒 111010011001111110001000111001101001101010100110111011111011110110111101111011111011110110101010111010011001101010110000111001101001101010100110111011111011110110111101111011111011110110101001111010011000000110010010 e99f88e69aa6efbdbdefbdaae99ab0e69aa6efbdbdefbda9e98192
UHC ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)