To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????h 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN 橈??油?????瑤???〓?檍??柚?h 100111101111010000111111001111111001011011111011001111110011111100111111001111110011111111101010101000100011111100111111001111111000000110101100001111111001111011111000001111110011111110010111010011010011111101101000 9ef43f3f96fb3f3f3f3f3feaa23f3f3f81ac3f9ef83f3f974d3f68
EUC-JP 橈??油?????瑤??洹〓?檍??柚?h 1101110011110110001111110011111111001100111111010011111100111111001111110011111100111111111101001010010000111111001111111000111111000111101110101010001010101110001111111101110011111010001111110011111111001101101011100011111101101000 dcf63f3fccfd3f3f3f3f3ff4a43f3f8fc7baa2ae3fdcfa3f3fcdae3f68
UTF-8 橈볥굝油꾢래類욌짎瑤녹깪洹〓눀檍용갭柚엎h 11100110101010011000100011101011101100111010010111101010101101011001110111100110101100101011100111101010101111101010001011101011100111101001100011101111101001111001000011101100100110101000110011101100101001111000111011100111100100011010010011101011100001011011100111101010101110011010101011100110101101001011100111100011100000001001001111101011100010001000000011100110101010101000110111101100100110101010100111101010101100001010110111100110100111111001101011101100100101111000111001101000 e6a988ebb3a5eab59de6b2b9eabea2eb9e98efa790ec9a8ceca78ee791a4eb85b9eab9aae6b4b9e38093eb8880e6aa8dec9aa9eab0ade69f9aec978e68
UHC 橈볥굝油꾢래類욌짎瑤녹깪洹〓눀檍용갭柚엎h 1110100011111010100100111110101110000010100001011110101011111010100001001110010110110111101000011110101110111010100111101110101110100011100110101110100011111101101100111110110010000011100110101110101010110111101000011110101110000111101000011110010111100101101111111110101110110000101110001110101011110110101111101111111001101000 e8fa93eb8285eafa84e5b7a1ebba9eeba39ae8fdb3ec839aeab7a1eb87a1e5e5bfebb0b8eaf6befe68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)