To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 趙????企?絲??趙????企?絲??^ 111001101110001000111111001111110011111100111111100010101110100100111111111000110100111000111111001111111110011011100010001111110011111100111111001111111000101011101001001111111110001101001110001111110011111101011110 e6e23f3f3f3f8ae93fe34e3f3fe6e23f3f3f3f8ae93fe34e3f3f5e
EUC-JP 趙????企?絲??趙????企?絲??^ 111011001110010000111111001111110011111100111111101101001110101100111111111001011010111100111111001111111110110011100100001111110011111100111111001111111011010011101011001111111110010110101111001111110011111101011110 ece43f3f3f3fb4eb3fe5af3f3fece43f3f3f3fb4eb3fe5af3f3f5e
UTF-8 趙얜렰렋林企렕絲렜빳趙얜렰렋林企렕絲렜빳^ 11101000101101101001100111101100100101101001110011101011101000001011000011101011101000001000101111101111101001111011010011100100101111001000000111101011101000001001010111100111101101011011001011101011101000001001110011101011101110011011001111101000101101101001100111101100100101101001110011101011101000001011000011101011101000001000101111101111101001111011010011100100101111001000000111101011101000001001010111100111101101011011001011101011101000001001110011101011101110011011001101011110 e8b699ec969ceba0b0eba08befa7b4e4bc81eba095e7b5b2eba09cebb9b3e8b699ec969ceba0b0eba08befa7b4e4bc81eba095e7b5b2eba09cebb9b35e
UHC 趙얜렰렋林企렕絲렜빳趙얜렰렋林企렕絲렜빳^ 1111000011100001101111101110101110001110101111011000111010100010111011001111011111010000111010101000111010101010110111101110101010001110101011101011101110100101111100001110000110111110111010111000111010111101100011101010001011101100111101111101000011101010100011101010101011011110111010101000111010101110101110111010010101011110 f0e1beeb8ebd8ea2ecf7d0ea8eaadeea8eaebba5f0e1beeb8ebd8ea2ecf7d0ea8eaadeea8eaebba55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)