To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 擾??援??儒???λ?揖??循?????^ 10001111111011110011111100111111100010011000011100111111001111111000111011110010001111110011111100111111100000111100100100111111100101110100101100111111001111111000111101111010001111110011111100111111001111110011111101011110 8fef3f3f89873f3f8ef23f3f3f83c93f974b3f3f8f7a3f3f3f3f3f5e
EUC-JP 擾??援??儒???λ?揖??循?????^ 10111110111100010011111100111111101100011110011100111111001111111011110011110100001111110011111100111111101001101100101100111111110011011010110000111111001111111011110111011011001111110011111100111111001111110011111101011110 bef13f3fb1e73f3fbcf43f3f3fa6cb3fcdac3f3fbddb3f3f3f3f3f5e
UTF-8 擾우엱援욃츦儒뺤냸若λ툕揖먫뮲循뗣뀍說깆퀡^ 111001101001001110111110111011001001101010110000111011001001011110110001111001101000111110110100111011001001101010000011111011001011100010100110111001011000010010010010111010111011101010100100111010111000001110111000111011111010010110110100110011101011101111101101100010001001010111100110100011111001011011101011101010001010101111101011101011101011001011100101101111101010101011101011100101111010001111101011100000001000110111101111101001101010000111101010101110011000011011101101100000001010000101011110 e693beec9ab0ec97b1e68fb4ec9a83ecb8a6e58492ebbaa4eb83b8efa5b4cebbed8895e68f96eba8abebaeb2e5beaaeb97a3eb808defa6a1eab986ed80a15e
UHC 擾우엱援욃츦儒뺤냸若λ툕揖먫뮲循뗣뀍說깆퀡^ 11101000111101101011111111101100100111101000011011101010101101011001111011100101101011101001110011101010111000111001010111101100100001101000100011100101101011101010010111101011101110001000110011101011111001111001000011101000100100101011101111100010111000001000101111100011100001011000100011100110111100101011000111101100101100111001010101011110 e8f6bfec9e86eab59ee5ae9ceae395ec8688e5aea5ebb88cebe790e892bbe2e08be38588e6f2b1ecb3955e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)