To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 症贇鮨闖ヌ贇セ 1000111111000111111001101101010011101001101111011110100010001111110001111110011011010100111100011000111010111110 8fc7e6d4e9bde88fc7e6d4f18ebe
EUC-JP 症贇鮨闖ヌ贇?セ 101111101100100111101100110101101111001010111111111011111110111110001110110001111110110011010110001111111000111010111110 bec9ecd6f2bfefef8ec7ecd63f8ebe
UTF-8 症贇鮨闖ヌ贇セ 111001111001011110000111111010001011010010000111111010011010111010101000111010011001011110010110111011111011111010000111111010001011010010000111111011101000010010001001111011111011110110111110 e79787e8b487e9aea8e99796efbe87e8b487ee8489efbdbe
UHC 症贇?闖?贇?? 111100011111100011101011110010110011111111110111111001100011111111101011110010110011111100111111 f1f8ebcb3ff7e63febcb3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)