To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 橈??渦??節??汚??橈??渦??節??汚??^ 100111101111010000111111001111111000100101010001001111110011111110010000110111110011111100111111100010011001100000111111001111111001111011110100001111110011111110001001010100010011111100111111100100001101111100111111001111111000100110011000001111110011111101011110 9ef43f3f89513f3f90df3f3f89983f3f9ef43f3f89513f3f90df3f3f89983f3f5e
EUC-JP 橈??渦??節??汚??橈??渦??節??汚??^ 110111001111011000111111001111111011000110110010001111110011111111000000111000010011111100111111101100011111100000111111001111111101110011110110001111110011111110110001101100100011111100111111110000001110000100111111001111111011000111111000001111110011111101011110 dcf63f3fb1b23f3fc0e13f3fb1f83f3fdcf63f3fb1b23f3fc0e13f3fb1f83f3f5e
UTF-8 橈꾬쉬渦쒐럷節븝쉬汚뜻굵橈꾬쉬渦쒐럷節븝쉬汚뜻굵^ 11100110101010011000100011101010101111101010110011101100100010011010110011100110101110001010011011101100100100101001000011101011100111111011011111100111101011111000000011101011101110001001110111101100100010011010110011100110101100011001101011101011100111001011101111101010101101011011010111100110101010011000100011101010101111101010110011101100100010011010110011100110101110001010011011101100100100101001000011101011100111111011011111100111101011111000000011101011101110001001110111101100100010011010110011100110101100011001101011101011100111001011101111101010101101011011010101011110 e6a988eabeacec89ace6b8a6ec9290eb9fb7e7af80ebb89dec89ace6b19aeb9cbbeab5b5e6a988eabeacec89ace6b8a6ec9290eb9fb7e7af80ebb89dec89ace6b19aeb9cbbeab5b55e
UHC 橈꾬쉬渦쒐럷節븝쉬汚뜻굵橈꾬쉬渦쒐럷節븝쉬汚뜻굵^ 11101000111110101000010011101111101111011010110011101000101111101001110011100111100011101001011011101111101111011011101011101111101111011010110011100111111111011011011011100110101100011011110111101000111110101000010011101111101111011010110011101000101111101001110011100111100011101001011011101111101111011011101011101111101111011010110011100111111111011011011011100110101100011011110101011110 e8fa84efbdace8be9ce78e96efbdbaefbdace7fdb6e6b1bde8fa84efbdace8be9ce78e96efbdbaefbdace7fdb6e6b1bd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)