To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 逆??揖??擬??沃??乙ょ?柔j?沃 10001011011101000011111100111111100101110100101100111111001111111000101101011011001111110011111110010111100000000011111100111111100010011011001110000010111001010011111110001111010111111000001010001010001111111001011110000000 8b743f3f974b3f3f8b5b3f3f97803f3f89b382e53f8f5f828a3f9780
EUC-JP 逆??揖??擬??沃??乙ょ?柔j?沃 10110101110101010011111100111111110011011010110000111111001111111011010110111100001111110011111111001101111000000011111100111111101100101011010110100100111001110011111110111101110000001010001111101010001111111100110111100000 b5d53f3fcdac3f3fb5bc3f3fcde03f3fb2b5a4e73fbdc0a3ea3fcde0
UTF-8 逆곷벡揖밧츦擬듭춷沃섅룂乙ょ춯柔j틓沃 111010011000000010000110111010101011001110110111111010111011001010100001111001101000111110010110111010111011000010100111111011001011100010100110111001101001001110101100111010111001001110101101111011001011011010110111111001101011001010000011111011001000010010000101111010111010001110000010111001001011100110011001111000111000001010000111111011001011011010101111111001101001111110010100111011111011110110001010111011011000101110010011111001101011001010000011 e98086eab3b7ebb2a1e68f96ebb0a7ecb8a6e693aceb93adecb6b7e6b283ec8485eba382e4b999e38287ecb6afe69f94efbd8aed8b93e6b283
UHC 逆곷벡揖밧츦擬듭춷沃섅룂乙ょ춯柔j틓沃 1110011010111101100000011110101110111010101001001110101111100111101110011110010110101110100111001110101111110100101101011110110010101101100100111110100010101010100110001110001110001111100000111110101111100000101010101110011110101101100011001110101011110101101000111110101010111010100000101110100010101010 e6bd81ebbaa4ebe7b9e5ae9cebf4b5ecad93e8aa98e38f83ebe0aae7ad8ceaf5a3eaba82e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)