To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 顏エ螟占アェ隍昜髫エ螟占アェ隍晏刀B 11101000111110001011010011100101101001001001000011101000101100011010101011101000101001001001110111100100111100011000000111101001100110101011010011100101101001001001000011101000101100011010101011101000101001001001110111100101100100111000000101000010 e8f8b4e5a490e8b1aae8a49de4f181e99ab4e5a490e8b1aae8a49de5938142
EUC-JP 顏エ螟占アェ隍昜?髫エ螟占アェ隍晏刀B 111100001111101010001110101101001110101010100110110000001110101010001110101100011000111010101010111100001010011011011010111001100011111111110001111110101000111010110100111010101010011011000000111010101000111010110001100011101010101011110000101001101101101011100111110001011110000101000010 f0fa8eb4eaa6c0ea8eb18eaaf0a6dae63ff1fa8eb4eaa6c0ea8eb18eaaf0a6dae7c5e142
UTF-8 顏エ螟占アェ隍昜髫エ螟占アェ隍晏刀B 11101001101000011000111111101111101111011011010011101000100111101001111111100101100011011010000011101111101111011011000111101111101111011010101011101001100110101000110111100110100110001001110011101110100000111011110011101001101010111010101111101111101111011011010011101000100111101001111111100101100011011010000011101111101111011011000111101111101111011010101011101001100110101000110111100110100110011000111111100101100010001000000001000010 e9a18fefbdb4e89e9fe58da0efbdb1efbdaae99a8de6989cee83bce9ababefbdb4e89e9fe58da0efbdb1efbdaae99a8de6998fe5888042
UHC ??螟占??隍????螟占??隍晏刀B 001111110011111111011001101011011110111110111111001111110011111111111100110110110011111100111111001111110011111111011001101011011110111110111111001111110011111111111100110110111110010011001111110100111110111101000010 3f3fd9adefbf3f3ffcdb3f3f3f3fd9adefbf3f3ffcdbe4cfd3ef42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)