To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 瑤??澳?????押??節??央??澳??^ 11101010101000100011111100111111111000000101001100111111001111110011111100111111001111111000100110011111001111110011111110010000110111110011111100111111100010011001101100111111001111111110000001010011001111110011111101011110 eaa23f3fe0533f3f3f3f3f899f3f3f90df3f3f899b3f3fe0533f3f5e
EUC-JP 瑤??澳??縕??押??節??央??澳??^ 111101001010010000111111001111111101111110110100001111110011111110001111110101001100001000111111001111111011001010100001001111110011111111000000111000010011111100111111101100011111101100111111001111111101111110110100001111110011111101011110 f4a43f3fdfb43f3f8fd4c23f3fb2a13f3fc0e13f3fb1fb3f3fdfb43f3f5e
UTF-8 瑤놅슨澳묈돭縕띹젒押낂웺節쇠젒央계쾷澳묈뀉^ 11100111100100011010010011101011100001101000010111101100100010101010100011100110101111101011001111101011101011001000100011101011100011111010110111100111101110001001010111101011100111011011100111101100101000001001001011100110100010101011110011101011100000101000001011101100100110111011101011100111101011111000000011101100100001111010000011101100101000001001001011100101101001001010111011101010101100111000010011101100101111101011011111100110101111101011001111101011101011001000100011101011100000001000100101011110 e791a4eb8685ec8aa8e6beb3ebac88eb8fade7b895eb9db9eca092e68abceb8282ec9bbae7af80ec87a0eca092e5a4aeeab384ecbeb7e6beb3ebac88eb80895e
UHC 瑤놅슨澳묈돭縕띹젒押낂웺節쇠젒央계쾷澳묈뀉^ 11101000111111011000011011101111101111011011110011100111111111101001000111100101100010011011000011101000101100101000110111101000101000001001000111100100111000111000010111101001100111111000011011101111101111011011110011101000101000001001000111100100111001111011000011101000101100101000110111100111111111101001000111100101100001011000010101011110 e8fd86efbdbce7fe91e589b0e8b28de8a091e4e385e99f86efbdbce8a091e4e7b0e8b28de7fe91e585855e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)