To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 遙??橈??吾??鈺?タ齬??魚??絶??^ 111010101010000100111111001111111001111011110100001111110011111110001100111000010011111100111111111110111100010000111111100000110101111011101010100101110011111100111111100010111001101100111111001111111001000011100010001111110011111101011110 eaa13f3f9ef43f3f8ce13f3ffbc43f835eea973f3f8b9b3f3f90e23f3f5e
EUC-JP 遙??橈??吾??鈺?タ齬??魚??絶??^ 11110100101000110011111100111111110111001111011000111111001111111011100011100011001111110011111110001111111000111101010100111111101001011011111111110011111101110011111100111111101101011111101100111111001111111100000011100100001111110011111101011110 f4a33f3fdcf63f3fb8e33f3f8fe3d53fa5bff3f73f3fb5fb3f3fc0e43f3f5e
UTF-8 遙욘깲橈꾣굢吾들퐥鈺썽タ齬좈쐞魚됭뱤絶쏉쉈^ 11101001100000011001100111101100100110101001100011101010101110011011001011100110101010011000100011101010101111101010001111101010101101011010001011100101100100001011111011101011100100111010010011101101100100001010010111101001100010001011101011101100100011011011110111100011100000101011111111101001101111011010110011101100101000101000100011101100100100001001111011101001101011011001101011101011100100001010110111101011101100011010010011100111101101011011011011101100100011111000100111101100100010011000100001011110 e98199ec9a98eab9b2e6a988eabea3eab5a2e590beeb93a4ed90a5e988baec8dbde382bfe9bdaceca288ec909ee9ad9aeb90adebb1a4e7b5b6ec8f89ec89885e
UHC 遙욘깲橈꾣굢吾들퐥鈺썽タ齬좈쐞魚됭뱤絶쏉쉈^ 11101001101010111011111111100110100000111010000011101000111110101000010011100110100000101000100111100111111011101011010111101001101111011000111011101000101011011011110111101001101010111011111111100101111000011010000011101001100111001000010011100101111000001000100111101000100100111000101011101111101111101001101111101111101111011010010101011110 e9abbfe683a0e8fa84e68289e7eeb5e9bd8ee8adbde9abbfe5e1a0e99c84e5e089e8938aefbe9befbda55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)