To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????猷ъ????臾??諛??榮??釗 00111111001111110011111100111111001111110011111110010111010100011000010010001100001111110011111100111111001111111110010001101011001111110011111111100110100001110011111100111111100111101100010000111111001111111111101110111011 3f3f3f3f3f3f9751848c3f3f3f3fe46b3f3fe6873f3f9ec43f3ffbbb
EUC-JP ??????猷ъ?孼??臾??諛??榮??釗 00111111001111110011111100111111001111110011111111001101101100101010011111101100001111111000111110111010110000110011111100111111111001111100110000111111001111111110101111100111001111110011111111011100110001100011111100111111100011111110001110100110 3f3f3f3f3f3fcdb2a7ec3f8fbac33f3fe7cc3f3febe73f3fdcc63f3f8fe3a6
UTF-8 捻뀀뜄梨뜹츦猷ъ뵖孼꾩슦臾쇔넇諛대궙榮붽쒀釗 1110111110100110101001001110101110000000100000001110101110011100100001001110111110100111101000101110101110011100101110011110110010111000101001101110011110001100101101111101000110001010111010111011010110010110111001011010110110111100111010101011111010101001111011001000101010100110111010001000011110111110111011001000011110010100111010111000010010000111111010001010101110011011111010111000110010000000111010101011011010011001111001101010011010101110111010111011011010111101111011001001001010000000111010011000011110010111 efa6a4eb8080eb9c84efa7a2eb9cb9ecb8a6e78cb7d18aebb596e5adbceabea9ec8aa6e887beec8794eb8487e8ab9beb8c80eab699e6a6aeebb6bdec9280e98797
UHC 捻뀀뜄梨뜹츦猷ъ뵖孼꾩슦臾쇔넇諛대궙榮붽쒀釗 1110011011110111101100101110101110001101100010001110110010110001101101101110010110101110100111001110101110100011101011001110110010010100100110001110010111101101100001001110110010011010101100001110101110101100101111001110010110000110100101111110101110110000101101001110101110000010101011101110011110110100100101001110101010111110101011001110000111110010 e6f7b2eb8d88ecb1b6e5ae9ceba3acec9498e5ed84ec9ab0ebacbce58697ebb0b4eb82aee7b494eabeace1f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)