To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???鍮??節??倭???檍??諭??違??B 0011111100111111001111111110100001001010001111110011111110010000110111110011111100111111100110000110000000111111001111110011111110011110111110000011111100111111100101110100000000111111001111111000100011100001001111110011111101000010 3f3f3fe84a3f3f90df3f3f98603f3f3f9ef83f3f97403f3f88e13f3f42
EUC-JP ???鍮??節??倭???檍??諭??違??B 0011111100111111001111111110111110101011001111110011111111000000111000010011111100111111110011111100000100111111001111110011111111011100111110100011111100111111110011011010000100111111001111111011000011100011001111110011111101000010 3f3f3fefab3f3fc0e13f3fcfc13f3f3fdcfa3f3fcda13f3fb0e33f3f42
UTF-8 略노쵐鍮뽩츦節뤵렆倭먈귦벃檍용챷諭썹춯違꾨쳥B 11101111101001011011011011101011100001011011100011101100101101011001000011101001100011011010111011101011101111011010100111101100101110001010011011100111101011111000000011101011101001001011010111101011101000001000011011100101100000001010110111101011101010001000100011101010101101111010011011101011101100101000001111100110101010101000110111101100100110101010100111101100101100011011011111101000101010111010110111101100100011011011100111101100101101101010111111101001100000011001010111101010101111101010100011101100101100111010010101000010 efa5b6eb85b8ecb590e98daeebbda9ecb8a6e7af80eba4b5eba086e580adeba888eab7a6ebb283e6aa8dec9aa9ecb1b7e8abadec8db9ecb6afe98195eabea8ecb3a542
UHC 略노쵐鍮뽩츦節뤵렆倭먈귦벃檍용챷諭썹춯違꾨쳥B 111001011011001010110011111010111010110010010010111010111011100110010110111001011010111010011100111011111011110110001111111000111000111010100000111010001101111010111000110100011000001011101101100100111010100111100101111001011011111111101011101010101000010011101011101100011011110111100111101011011000110011101010110111101000010011101011101010111000101001000010 e5b2b3ebac92ebb996e5ae9cefbd8fe38ea0e8deb8d182ed93a9e5e5bfebaa84ebb1bde7ad8ceade84ebab8a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)