To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 霆???禎?燼逗??霆???禎?燼逗??B 1110100010111011001111110011111100111111100100101111010100111111111000001001111010010000100000000011111100111111111010001011101100111111001111110011111110010010111101010011111111100000100111101001000010000000001111110011111101000010 e8bb3f3f3f92f53fe09e90803f3fe8bb3f3f3f92f53fe09e90803f3f42
EUC-JP 霆???禎?燼逗??霆???禎?燼逗??B 1111000010111101001111110011111100111111110001001111011100111111110111111111111010111111111000000011111100111111111100001011110100111111001111110011111111000100111101110011111111011111111111101011111111100000001111110011111101000010 f0bd3f3f3fc4f73fdffebfe03f3ff0bd3f3f3fc4f73fdffebfe03f3f42
UTF-8 霆肋렰렍禎렓燼逗렗렒霆肋렰렍禎렓燼逗렗렒B 11101001100111001000011011101111101001011001001111101011101000001011000011101011101000001000110111100111101001101000111011101011101000001001001111100111100001111011110011101001100000001001011111101011101000001001011111101011101000001001001011101001100111001000011011101111101001011001001111101011101000001011000011101011101000001000110111100111101001101000111011101011101000001001001111100111100001111011110011101001100000001001011111101011101000001001011111101011101000001001001001000010 e99c86efa593eba0b0eba08de7a68eeba093e787bce98097eba097eba092e99c86efa593eba0b0eba08de7a68eeba093e787bce98097eba097eba09242
UHC 霆肋렰렍禎렓燼逗렗렒霆肋렰렍禎렓燼逗렗렒B 1110111111111101110100101111000110001110101111011000111010100011111011111110111010001110101010001110001111101000110101001110100010001110101011001000111010100111111011111111110111010010111100011000111010111101100011101010001111101111111011101000111010101000111000111110100011010100111010001000111010101100100011101010011101000010 effdd2f18ebd8ea3efee8ea8e3e8d4e88eac8ea7effdd2f18ebd8ea3efee8ea8e3e8d4e88eac8ea742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)