To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ワス疾痔嵭軸鳧 111100001011101011011100111100011000111010111101100011101011111011110010101010111000111010100100111110101011001110001110101100101110100111101000 f0badcf18ebd8ebef2ab8ea4fab38eb2e9e8
EUC-JP ?ワ?ス疾?痔嵭軸鳧 001111111000111011011100001111111000111010111101101111001100000000111111101111001010011010001111101110111101111010111100101101001111001011101010 3f8edc3f8ebdbcc03fbca68fbbdebcb4f2ea
UTF-8 ワス疾痔嵭軸鳧 111011101000000110111001111011111011111010011100111011101000010010001001111011111011110110111101111001111001011010111110111011101000011110100010111001111001011110010100111001011011010110101101111010001011101110111000111010011011001110100111 ee81b9efbe9cee8489efbdbde796beee87a2e79794e5b5ade8bbb8e9b3a7
UHC ????疾?痔?軸鳧 0011111100111111001111110011111111110010111100000011111111110110110000000011111111110101111011101101110111000000 3f3f3f3ff2f03ff6c03ff5eeddc0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)