To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???馨??艶{?艶??形??艶{????? 0011111100111111001111111000101001011101001111110011111110001001100100001000000101101111001111111000100110010000001111110011111110001100011000000011111100111111100010011001000010000001011011110011111100111111001111110011111100111111 3f3f3f8a5d3f3f8990816f3f89903f3f8c603f3f8990816f3f3f3f3f3f
EUC-JP ???馨??艶{?艶??形??艶{????琰 00111111001111110011111110110011101111100011111100111111101100011111000010100001110100000011111110110001111100000011111100111111101101111100000100111111001111111011000111110000101000011101000000111111001111110011111100111111100011111100110010110100 3f3f3fb3be3f3fb1f0a1d03fb1f03f3fb7c13f3fb1f0a1d03f3f3f3f8fccb4
UTF-8 怜붺윢馨됬컮艶{옫艶쀮뵃形루컮艶{컞怜붺윢琰 111011111010011010101100111010111011011010111010111011001001110010100010111010011010011010101000111010111001000010101100111011001011101110101110111010001000100110110110111011111011110110011011111011001001100010101011111010001000100110110110111011001000000010101110111010111011010110000011111001011011110110100010111010111010001110101000111011001011101110101110111010001000100110110110111011111011110110011011111011001011101110011110111011111010011010101100111010111011011010111010111011001001110010100010111001111001000010110000 efa6acebb6baec9ca2e9a6a8eb90acecbbaee889b6efbd9bec98abe889b6ec80aeebb583e5bda2eba3a8ecbbaee889b6efbd9becbb9eefa6acebb6baec9ca2e790b0
UHC 怜붺윢馨됬컮艶{옫艶쀮뵃形루컮艶{컞怜붺윢琰 1110011110110000100101001110011110011111101000111111101110110000100010011110011110110000100101001110011011111101101000111111101110011110101010101110011011111101100101111110111010010100100010011111101110100001101101111110011110110000100101001110011011111101101000111111101110110000100010011110011110110000100101001110011110011111101000111110011011111100 e7b094e79fa3fbb089e7b094e6fda3fb9eaae6fd97ee9489fba1b7e7b094e6fda3fbb089e7b094e79fa3e6fc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)