To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 逾賀ウ後欧諤ィ閭嘲逾賀ウ後欧諤ィ閭嘴^ 111001111010010110001001111010101011001110001100111000111000100110100010111001101000000010101000111010001000001110011010011111011110011110100101100010011110101010110011100011001110001110001001101000101110011010000000101010001110100010000011100110100111101101011110 e7a589eab38ce389a2e680a8e8839a7de7a589eab38ce389a2e680a8e8839a7b5e
EUC-JP 逾賀ウ後欧諤ィ閭嘲逾賀ウ後欧諤ィ閭嘴^ 11101110101001111011001011101100100011101011001110111000111001011011001010100100111010111110000010001110101010001110111111100011110100111101111011101110101001111011001011101100100011101011001110111000111001011011001010100100111010111110000010001110101010001110111111100011110100111101110001011110 eea7b2ec8eb3b8e5b2a4ebe08ea8efe3d3deeea7b2ec8eb3b8e5b2a4ebe08ea8efe3d3dc5e
UTF-8 逾賀ウ後欧諤ィ閭嘲逾賀ウ後欧諤ィ閭嘴^ 11101001100000001011111011101000101100111000000011101111101111011011001111100101101111101000110011100110101011001010011111101000101010111010010011101111101111011010100011101001100101101010110111100101100110001011001011101001100000001011111011101000101100111000000011101111101111011011001111100101101111101000110011100110101011001010011111101000101010111010010011101111101111011010100011101001100101101010110111100101100110001011010001011110 e980bee8b380efbdb3e5be8ce6aca7e8aba4efbda8e996ade598b2e980bee8b380efbdb3e5be8ce6aca7e8aba4efbda8e996ade598b45e
UHC 逾賀?後???閭嘲逾賀?後???閭嘴^ 1110101110110101111110011100010100111111111111011010110100111111001111110011111111010101111011111111000010111111111010111011010111111001110001010011111111111101101011010011111100111111001111111101010111101111111101101010010001011110 ebb5f9c53ffdad3f3f3fd5eff0bfebb5f9c53ffdad3f3f3fd5eff6a45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)