To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????BF 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
SJIS-WIN シナシエシナシ・シナシウシナ邪社BF 1011110011000101101111001011010010111100110001011011110010100101101111001100010110111100101100111011110011000101100011101101011110001110110100000100001001000110 bcc5bcb4bcc5bca5bcc5bcb3bcc58ed78ed04246
EUC-JP シナシエシナシ・シナシウシナ邪社BF 10001110101111001000111011000101100011101011110010001110101101001000111010111100100011101100010110001110101111001000111010100101100011101011110010001110110001011000111010111100100011101011001110001110101111001000111011000101101111001101100110111100110100100100001001000110 8ebc8ec58ebc8eb48ebc8ec58ebc8ea58ebc8ec58ebc8eb38ebc8ec5bcd9bcd24246
UTF-8 シナシエシナシ・シナシウシナ邪社BF 1110111110111101101111001110111110111110100001011110111110111101101111001110111110111101101101001110111110111101101111001110111110111110100001011110111110111101101111001110111110111101101001011110111110111101101111001110111110111110100001011110111110111101101111001110111110111101101100111110111110111101101111001110111110111110100001011110100110000010101010101110011110100100101111100100001001000110 efbdbcefbe85efbdbcefbdb4efbdbcefbe85efbdbcefbda5efbdbcefbe85efbdbcefbdb3efbdbcefbe85e982aae7a4be4246
UHC ??????????????邪社BF 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111110111101111011111011110111001000100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3fdef7dee44246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)