To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???松ο?碎??癰???蘖???↑ぜ碎?? 001111110011111100111111100011111011110010000011110011010011111111100001111010100011111100111111111000011001111000111111001111110011111110011111010100000011111100111111001111111000000110101010100000101011101011100001111010100011111100111111 3f3f3f8fbc83cd3fe1ea3f3fe19e3f3f3f9f503f3f3f81aa82bae1ea3f3f
EUC-JP ???松ο?碎??癰???蘖???↑ぜ碎?? 001111110011111100111111101111101011111010100110110011110011111111100010111011000011111100111111111000011111111000111111001111110011111111011101101100010011111100111111001111111010001010101100101001001011110011100010111011000011111100111111 3f3f3fbebea6cf3fe2ec3f3fe1fe3f3f3fddb13f3f3fa2aca4bce2ec3f3f
UTF-8 捻꿔룗松ο쭏碎쇱춷癰궽쇱젷蘖뽮퉫璘↑ぜ碎띤렧 1110111110100110101001001110101010111111100101001110101110100011100101111110011010011101101111101100111010111111111011001010110110001111111001111010001010001110111011001000011110110001111011001011011010110111111001111001100110110000111010101011011010111101111011001000011110110001111011001010000010110111111010001001100010010110111010111011110110101110111011011000100110101011111011111010011110101111111000101000011010010001111000111000000110011100111001111010001010001110111010111001110110100100111010111010000010100111 efa6a4eabf94eba397e69dbecebfecad8fe7a28eec87b1ecb6b7e799b0eab6bdec87b1eca0b7e89896ebbdaeed89abefa7afe28691e3819ce7a28eeb9da4eba0a7
UHC 捻꿔룗松ο쭏碎쇱춷癰궽쇱젷蘖뽮퉫璘↑ぜ碎띤렧 1110011011110111101100101110001110001111100100111110000111100110101001011110111110100111100010001110000111101111101111001110110010101101100100111110100010111001100000101100111010111100111011001010000010101011111001011110111010010110111010101011100110000011111011001101111010100001111010001010101010111100111000011110111110110110111011011000111010110110 e6f7b2e38f93e1e6a5efa788e1efbcecad93e8b982cebceca0abe5ee96eab983ecdea1e8aabce1efb6ed8eb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)