To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???塋??搖η?彦??節??業??塋??^ 0011111100111111001111111001101011001000001111110011111110011101100010101000001111000101001111111001010101000110001111110011111110010000110111110011111100111111100010111100011000111111001111111001101011001000001111110011111101011110 3f3f3f9ac83f3f9d8a83c53f95463f3f90df3f3f8bc63f3f9ac83f3f5e
EUC-JP ???塋??搖η?彦??節??業??塋??^ 0011111100111111001111111101010011001010001111110011111111011001111010101010011011000111001111111100100110100111001111110011111111000000111000010011111100111111101101101100100000111111001111111101010011001010001111110011111101011110 3f3f3fd4ca3f3fd9eaa6c73fc9a73f3fc0e13f3fb6c83f3fd4ca3f3f5e
UTF-8 寧륅슈塋뤸샄搖η쳪彦뤹쵍節당쳪業띌뼢塋뤸땹^ 111011111010011010101010111010111010010110000101111011001000101010001000111001011010000110001011111010111010010010111000111011001000001110000100111001101001000010010110110011101011011111101100101100111010101011100101101111011010011011101011101001001011100111101100101101011000110111100111101011111000000011101011100010111011100111101100101100111010101011100110101001011010110111101011100111011000110011101011101111001010001011100101101000011000101111101011101001001011100011101011100101011011100101011110 efa6aaeba585ec8a88e5a18beba4b8ec8384e69096ceb7ecb3aae5bda6eba4b9ecb58de7af80eb8bb9ecb3aae6a5adeb9d8cebbca2e5a18beba4b8eb95b95e
UHC 寧륅슈塋뤸샄搖η쳪彦뤹쵍節당쳪業띌뼢塋뤸땹^ 11100111101011001000111111101111101111011011010011100111101010111000111111100110100110001011011011101000111101001010010111100111101010111000111111100101111010011000111111100111101011001000111111101111101111011011010011100111101010111000111111100101111101101011011011101001100101101010010111100111101010111000111111100110100010111000111101011110 e7ac8fefbdb4e7ab8fe698b6e8f4a5e7ab8fe5e98fe7ac8fefbdb4e7ab8fe5f6b6e996a5e7ab8fe68b8f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)