To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 氈???傳?燼逗??霆???禎?淳逗??B 1001111110000001001111110011111100111111100110010100001000111111111000001001111010010000100000000011111100111111111010001011101100111111001111110011111110010010111101010011111110001111011111101001000010000000001111110011111101000010 9f813f3f3f99423fe09e90803f3fe8bb3f3f3f92f53f8f7e90803f3f42
EUC-JP 氈???傳?燼逗??霆???禎?淳逗??B 1101110111100001001111110011111100111111110100011010001100111111110111111111111010111111111000000011111100111111111100001011110100111111001111110011111111000100111101110011111110111101110111111011111111100000001111110011111101000010 dde13f3f3fd1a33fdffebfe03f3ff0bd3f3f3fc4f73fbddfbfe03f3f42
UTF-8 氈肋렰렍傳렓燼逗렗렒霆肋렰렍禎렓淳逗렗렒B 11100110101100001000100011101111101001011001001111101011101000001011000011101011101000001000110111100101100000101011001111101011101000001001001111100111100001111011110011101001100000001001011111101011101000001001011111101011101000001001001011101001100111001000011011101111101001011001001111101011101000001011000011101011101000001000110111100111101001101000111011101011101000001001001111100110101101111011001111101001100000001001011111101011101000001001011111101011101000001001001001000010 e6b088efa593eba0b0eba08de582b3eba093e787bce98097eba097eba092e99c86efa593eba0b0eba08de7a68eeba093e6b7b3e98097eba097eba09242
UHC 氈肋렰렍傳렓燼逗렗렒霆肋렰렍禎렓淳逗렗렒B 1110111011111101110100101111000110001110101111011000111010100011111011101110111010001110101010001110001111101000110101001110100010001110101011001000111010100111111011111111110111010010111100011000111010111101100011101010001111101111111011101000111010101000111000101110100011010100111010001000111010101100100011101010011101000010 eefdd2f18ebd8ea3eeee8ea8e3e8d4e88eac8ea7effdd2f18ebd8ea3efee8ea8e2e8d4e88eac8ea742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)