To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????}B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7d42
SJIS-WIN ????????????而????????}B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100011101010011100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f8ea73f3f3f3f3f3f3f3f7d42
EUC-JP ????????????而????????}B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111101111001010100100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3fbca93f3f3f3f3f3f3f3f7d42
UTF-8 溜삳젗溜븍젻溜뷰슴溜쀫졋而댁쓻溜띾졋栒붾젺}B 1110111110100111100010111110110010000010101100111110110010100000100101111110111110100111100010111110101110111000100011011110110010100000101110111110111110100111100010111110101110110111101100001110110010001010101101001110111110100111100010111110110010000000101010111110110010100001100010111110100010000000100011001110101110001100100000011110110010010011101110111110111110100111100010111110101110011101101111101110110010100001100010111110011010100000100100101110101110110110101111101110110010100000101110100111110101000010 efa78bec82b3eca097efa78bebb88deca0bbefa78bebb7b0ec8ab4efa78bec80abeca18be8808ceb8c81ec93bbefa78beb9dbeeca18be6a092ebb6beeca0ba7d42
UHC 溜삳젗溜븍젻溜뷰슴溜쀫졋而댁쓻溜띾졋栒붾젺}B 1110101011111110101110111110101110100000100100111110101011111110101110101110101110100000101011101110101011111110101110101110010010111101101111111110101011111110100101111110101110100000101110101110110010111011101101001110110010011101100101101110101011111110100011011110101110100000101110101110001011100011100101001110101110100000101011010111110101000010 eafebbeba093eafebaeba0aeeafebae4bdbfeafe97eba0baecbbb4ec9d96eafe8deba0bae2e394eba0ad7d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)