To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鴦??????徇?.猷?????淫ф?音??^ 111010011111000100111111001111110011111100111111001111110011111110011100011011010011111110000001010001001001011101010001001111110011111100111111001111110011111110001000111110101000010010000110001111111000100110111001001111110011111101011110 e9f13f3f3f3f3f3f9c6d3f814497513f3f3f3f3f88fa84863f89b93f3f5e
EUC-JP 鴦??????徇?.猷?????淫ф?音??^ 111100101111001100111111001111110011111100111111001111110011111111010111110011100011111110100001101001011100110110110010001111110011111100111111001111110011111110110000111111001010011111100110001111111011001010111011001111110011111101011110 f2f33f3f3f3f3f3fd7ce3fa1a5cdb23f3f3f3f3fb0fca7e63fb2bb3f3f5e
UTF-8 鴦꾆뀀룱烈쀫쓧徇쒒.猷잛뒙曆욁굤淫ф쾬音깅뙉^ 111010011011010010100110111010101011111010000110111010111000000010000000111010111010001110110001111011111010011010011111111011001000000010101011111011001001001110100111111001011011111010000111111011001001001010010010111011111011110010001110111001111000110010110111111011001001111010011011111010111001001010011001111011111010011010001011111011001001101010000001111010101011010110100100111001101011011110101011110100011000010011101100101111101010110011101001100111111011001111101010101110011000010111101011100110011000100101011110 e9b4a6eabe86eb8080eba3b1efa69fec80abec93a7e5be87ec9292efbc8ee78cb7ec9e9beb9299efa68bec9a81eab5a4e6b7abd184ecbeace99fb3eab985eb99895e
UHC 鴦꾆뀀룱烈쀫쓧徇쒒.猷잛뒙曆욁굤淫ф쾬音깅뙉^ 111001001110110010000100110011101011001011101011100011111010011011100110111011111001011111101011100111011000100011100010110111111001110011101001101000111010111011101011101000111001111111101100100010101001011011100110101101111001111011100011100000101000101011101011111000101010110011100110101100101000001111101011111001011011000111101011100011001000111001011110 e4ec84ceb2eb8fa6e6ef97eb9d88e2df9ce9a3aeeba39fec8a96e6b79ee3828aebe2ace6b283ebe5b1eb8c8e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)