To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 縞?ず?縞?ず?n}縞?ず?縞?ず?n{^ 1000111011001000001111111000001010111000001111111000111011001000001111111000001010111000001111110110111001111101100011101100100000111111100000101011100000111111100011101100100000111111100000101011100000111111011011100111101101011110 8ec83f82b83f8ec83f82b83f6e7d8ec83f82b83f8ec83f82b83f6e7b5e
EUC-JP 縞?ず?縞?ず?n}縞?ず?縞?ず?n{^ 1011110011001010001111111010010010111010001111111011110011001010001111111010010010111010001111110110111001111101101111001100101000111111101001001011101000111111101111001100101000111111101001001011101000111111011011100111101101011110 bcca3fa4ba3fbcca3fa4ba3f6e7dbcca3fa4ba3fbcca3fa4ba3f6e7b5e
UTF-8 縞덞ず렩縞덞ず렓n}縞덞ず렩縞덞ず렓n{^ 1110011110111000100111101110101110001101100111101110001110000001100110101110101110100000101010011110011110111000100111101110101110001101100111101110001110000001100110101110101110100000100100110110111001111101111001111011100010011110111010111000110110011110111000111000000110011010111010111010000010101001111001111011100010011110111010111000110110011110111000111000000110011010111010111010000010010011011011100111101101011110 e7b89eeb8d9ee3819aeba0a9e7b89eeb8d9ee3819aeba0936e7de7b89eeb8d9ee3819aeba0a9e7b89eeb8d9ee3819aeba0936e7b5e
UHC 縞덞ず렩縞덞ず렓n}縞덞ず렩縞덞ず렓n{^ 11111011110101101011010011111011101010101011101010001110101101111111101111010110101101001111101110101010101110101000111010101000011011100111110111111011110101101011010011111011101010101011101010001110101101111111101111010110101101001111101110101010101110101000111010101000011011100111101101011110 fbd6b4fbaaba8eb7fbd6b4fbaaba8ea86e7dfbd6b4fbaaba8eb7fbd6b4fbaaba8ea86e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)