To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 遙??節??擾??謠?ゴ裔??偃??橈?じ^ 11101010101000010011111100111111100100001101111100111111001111111000111111101111001111110011111111100110100011110011111110000011010100111110010111100001001111110011111110011000111011100011111100111111100111101111010000111111100000101011011001011110 eaa13f3f90df3f3f8fef3f3fe68f3f8353e5e13f3f98ee3f3f9ef43f82b65e
EUC-JP 遙??節??擾??謠?ゴ裔??偃??橈?じ^ 11110100101000110011111100111111110000001110000100111111001111111011111011110001001111110011111111101011111011110011111110100101101101001110101011100011001111110011111111010000111100000011111100111111110111001111011000111111101001001011100001011110 f4a33f3fc0e13f3fbef13f3febef3fa5b4eae33f3fd0f03f3fdcf63fa4b85e
UTF-8 遙닺궡節길뎴擾욥쐠謠면ゴ裔댐쉽偃뉔샒橈녽じ^ 11101001100000011001100111101011100010111011101011101010101101101010000111100111101011111000000011101010101110001011100011101011100011101011010011100110100100111011111011101100100110101010010111101100100100001010000011101000101011001010000011101011101010011011010011100011100000101011010011101000101000111001010011101011100011001001000011101100100010011011110111100101100000011000001111101011100010011001010011101100100000111001001011100110101010011000100011101011100001011011110111100011100000011001100001011110 e98199eb8bbaeab6a1e7af80eab8b8eb8eb4e693beec9aa5ec90a0e8aca0eba9b4e382b4e8a394eb8c90ec89bde58183eb8994ec8392e6a988eb85bde381985e
UHC 遙닺궡節길뎴擾욥쐠謠면ゴ裔댐쉽偃뉔샒橈녽じ^ 11101001101010111011010011101000100000101011010011101111101111011011000111100110100010011000011111101000111101101011111111101001100111001000011011101001101010101011100011101001101010111011010011100111111000001011010011101111101111011011000111100101111001111000011111101001100110001011111111101000111110101000011011101001101010101011100001011110 e9abb4e882b4efbdb1e68987e8f6bfe99c86e9aab8e9abb4e7e0b4efbdb1e5e787e998bfe8fa86e9aab85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)