To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 髫エ螟占アェ隴帛刀髫エ螟占アェ隴帛刀B 11101001100110101011010011100101101001001001000011101000101100011010101011101000101011011001101111100101100100111000000111101001100110101011010011100101101001001001000011101000101100011010101011101000101011011001101111100101100100111000000101000010 e99ab4e5a490e8b1aae8ad9be59381e99ab4e5a490e8b1aae8ad9be5938142
EUC-JP 髫エ螟占アェ隴帛刀髫エ螟占アェ隴帛刀B 11110001111110101000111010110100111010101010011011000000111010101000111010110001100011101010101011110000101011111101011011100111110001011110000111110001111110101000111010110100111010101010011011000000111010101000111010110001100011101010101011110000101011111101011011100111110001011110000101000010 f1fa8eb4eaa6c0ea8eb18eaaf0afd6e7c5e1f1fa8eb4eaa6c0ea8eb18eaaf0afd6e7c5e142
UTF-8 髫エ螟占アェ隴帛刀髫エ螟占アェ隴帛刀B 11101001101010111010101111101111101111011011010011101000100111101001111111100101100011011010000011101111101111011011000111101111101111011010101011101001100110101011010011100101101110001001101111100101100010001000000011101001101010111010101111101111101111011011010011101000100111101001111111100101100011011010000011101111101111011011000111101111101111011010101011101001100110101011010011100101101110001001101111100101100010001000000001000010 e9ababefbdb4e89e9fe58da0efbdb1efbdaae99ab4e5b89be58880e9ababefbdb4e89e9fe58da0efbdb1efbdaae99ab4e5b89be5888042
UHC ??螟占???帛刀??螟占???帛刀B 001111110011111111011001101011011110111110111111001111110011111100111111110110111101100111010011111011110011111100111111110110011010110111101111101111110011111100111111001111111101101111011001110100111110111101000010 3f3fd9adefbf3f3f3fdbd9d3ef3f3fd9adefbf3f3f3fdbd9d3ef42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)