To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 狎????????泣??邑????????^ 11100000101111100011111100111111001111110011111100111111001111110011111100111111100010111000001100111111001111111001011101010111001111110011111100111111001111110011111100111111001111110011111101011110 e0be3f3f3f3f3f3f3f3f8b833f3f97573f3f3f3f3f3f3f3f5e
EUC-JP 狎????????泣??邑??絪?????^ 111000001100000000111111001111110011111100111111001111110011111100111111001111111011010111100011001111110011111111001101101110000011111100111111100011111101001111101100001111110011111100111111001111110011111101011110 e0c03f3f3f3f3f3f3f3fb5e33f3fcdb83f3f8fd3ec3f3f3f3f3f5e
UTF-8 狎띠떜杻듯걫捻뀀챷泣졾넼邑뀀뼲絪붺솾紐꾩뒥^ 11100111100010111000111011101011100111011010000011101011100101101001110011101111101001111000100011101011100100111010111111101010101100011010101111101111101001101010010011101011100000001000000011101100101100011011011111100110101100111010001111101100101000011011111011101011100001001011110011101001100000101001000111101011100000001000000011101011101111001011001011100111101101011010101011101011101101101011101011101100100001101011111011101111101001111000111111101010101111101010100111101011100100101010010101011110 e78b8eeb9da0eb969cefa788eb93afeab1abefa6a4eb8080ecb1b7e6b3a3eca1beeb84bce98291eb8080ebbcb2e7b5aaebb6baec86beefa78feabea9eb92a55e
UHC 狎띠떜杻듯걫捻뀀챷泣졾넼邑뀀뼲絪붺솾紐꾩뒥^ 11100100111001001011011011101100100010111011001011101010111101001011010111101101100000011001010011100110111101111011001011101011101010101000010011101011111010001010000011100101100001101011011011101011111010011011001011101011100101101011010111101100110111111001010011100111100110011011001011101011101010101000010011101100100010101010000001011110 e4e4b6ec8bb2eaf4b5ed8194e6f7b2ebaa84ebe8a0e586b6ebe9b2eb96b5ecdf94e799b2ebaa84ec8aa05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)