To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 髫エ螟占アェ隍晏刀髫エ螟占アェ隍晏刀B 11101001100110101011010011100101101001001001000011101000101100011010101011101000101001001001110111100101100100111000000111101001100110101011010011100101101001001001000011101000101100011010101011101000101001001001110111100101100100111000000101000010 e99ab4e5a490e8b1aae8a49de59381e99ab4e5a490e8b1aae8a49de5938142
EUC-JP 髫エ螟占アェ隍晏刀髫エ螟占アェ隍晏刀B 11110001111110101000111010110100111010101010011011000000111010101000111010110001100011101010101011110000101001101101101011100111110001011110000111110001111110101000111010110100111010101010011011000000111010101000111010110001100011101010101011110000101001101101101011100111110001011110000101000010 f1fa8eb4eaa6c0ea8eb18eaaf0a6dae7c5e1f1fa8eb4eaa6c0ea8eb18eaaf0a6dae7c5e142
UTF-8 髫エ螟占アェ隍晏刀髫エ螟占アェ隍晏刀B 11101001101010111010101111101111101111011011010011101000100111101001111111100101100011011010000011101111101111011011000111101111101111011010101011101001100110101000110111100110100110011000111111100101100010001000000011101001101010111010101111101111101111011011010011101000100111101001111111100101100011011010000011101111101111011011000111101111101111011010101011101001100110101000110111100110100110011000111111100101100010001000000001000010 e9ababefbdb4e89e9fe58da0efbdb1efbdaae99a8de6998fe58880e9ababefbdb4e89e9fe58da0efbdb1efbdaae99a8de6998fe5888042
UHC ??螟占??隍晏刀??螟占??隍晏刀B 0011111100111111110110011010110111101111101111110011111100111111111111001101101111100100110011111101001111101111001111110011111111011001101011011110111110111111001111110011111111111100110110111110010011001111110100111110111101000010 3f3fd9adefbf3f3ffcdbe4cfd3ef3f3fd9adefbf3f3ffcdbe4cfd3ef42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)