To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???O??????? 0011111100111111001111110100111100111111001111110011111100111111001111110011111100111111 3f3f3f4f3f3f3f3f3f3f3f
SJIS-WIN 症エO硝ハシ、省コ 10001111110001111111000010111100101101000100111110001111110010011100101010111100101001001000111111001000111110001011110010111010 8fc7f0bcb44f8fc9cabca48fc8f8bcba
EUC-JP 症?エO硝ハシ、省?コ 10111110110010010011111110001110101101000100111110111110110010111000111011001010100011101011110010001110101001001011111011001010001111111000111010111010 bec93f8eb44fbecb8eca8ebc8ea4beca3f8eba
UTF-8 症エO硝ハシ、省コ 11100111100101111000011111101110100000011011101111101111101111011011010001001111111001111010000110011101111011111011111010001010111011111011110110111100111011111011110110100100111001111001110010000001111011101001100110011011111011111011110110111010 e79787ee81bbefbdb44fe7a19defbe8aefbdbcefbda4e79c81ee999befbdba
UHC 症??O硝???省?? 1111000111111000001111110011111101001111111101011010011000111111001111110011111111100000111111010011111100111111 f1f83f3f4ff5a63f3f3fe0fd3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)