To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????猷ъ????臾??諛??榮??陰 00111111001111110011111100111111001111110011111110010111010100011000010010001100001111110011111100111111001111111110010001101011001111110011111111100110100001110011111100111111100111101100010000111111001111111000100101000001 3f3f3f3f3f3f9751848c3f3f3f3fe46b3f3fe6873f3f9ec43f3f8941
EUC-JP ??????猷ъ?孼??臾??諛??榮??陰 001111110011111100111111001111110011111100111111110011011011001010100111111011000011111110001111101110101100001100111111001111111110011111001100001111110011111111101011111001110011111100111111110111001100011000111111001111111011000110100010 3f3f3f3f3f3fcdb2a7ec3f8fbac33f3fe7cc3f3febe73f3fdcc63f3fb1a2
UTF-8 捻뀀뜄梨뜹츦猷ъ뵖孼꾩슦臾쇔넇諛대궙榮붽쑈陰 1110111110100110101001001110101110000000100000001110101110011100100001001110111110100111101000101110101110011100101110011110110010111000101001101110011110001100101101111101000110001010111010111011010110010110111001011010110110111100111010101011111010101001111011001000101010100110111010001000011110111110111011001000011110010100111010111000010010000111111010001010101110011011111010111000110010000000111010101011011010011001111001101010011010101110111010111011011010111101111011001001000110001000111010011001100110110000 efa6a4eb8080eb9c84efa7a2eb9cb9ecb8a6e78cb7d18aebb596e5adbceabea9ec8aa6e887beec8794eb8487e8ab9beb8c80eab699e6a6aeebb6bdec9188e999b0
UHC 捻뀀뜄梨뜹츦猷ъ뵖孼꾩슦臾쇔넇諛대궙榮붽쑈陰 1110011011110111101100101110101110001101100010001110110010110001101101101110010110101110100111001110101110100011101011001110110010010100100110001110010111101101100001001110110010011010101100001110101110101100101111001110010110000110100101111110101110110000101101001110101110000010101011101110011110110100100101001110101010111110101001001110101111100100 e6f7b2eb8d88ecb1b6e5ae9ceba3acec9498e5ed84ec9ab0ebacbce58697ebb0b4eb82aee7b494eabea4ebe4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)