To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 兀??凹?????予?????掩??凹??^ 100110010101100100111111001111111000100110011010001111110011111100111111001111110011111110010111010111000011111100111111001111110011111100111111100010011000011000111111001111111000100110011010001111110011111101011110 99593f3f899a3f3f3f3f3f975c3f3f3f3f3f89863f3f899a3f3f5e
EUC-JP 兀??凹?????予?????掩??凹??^ 110100011011101000111111001111111011000111111010001111110011111100111111001111110011111111001101101111010011111100111111001111110011111100111111101100011110011000111111001111111011000111111010001111110011111101011110 d1ba3f3fb1fa3f3f3f3f3fcdbd3f3f3f3f3fb1e63f3fb1fa3f3f5e
UTF-8 兀볥젘凹싨븥杻쇤뙕予좄킇溜붼뙕掩롫뼹凹싨슑^ 11100101100001011000000011101011101100111010010111101100101000001001100011100101100001111011100111101100100010111010100011101011101110001010010111101111101001111000100011101100100001111010010011101011100110011001010111100100101110101000100011101100101000101000010011101101100000101000011111101111101001111000101111101011101101101011110011101011100110011001010111100110100011101010100111101011101000011010101111101011101111001011100111100101100001111011100111101100100010111010100011101100100010101001000101011110 e58580ebb3a5eca098e587b9ec8ba8ebb8a5efa788ec87a4eb9995e4ba88eca284ed8287efa78bebb6bceb9995e68ea9eba1abebbcb9e587b9ec8ba8ec8a915e
UHC 兀볥젘凹싨븥杻쇤뙕予좄킇溜붼뙕掩롫뼹凹싨슑^ 11101000101101001001001111101011101000001001010011101000111010101001101011100110100101011000111011101010111101001011110011101001100011001001101011100101111110001010000011101000101101001001001111101010111111101001010011101001100011001001101011100101111100111000111011101011100101101011110011101000111010101001101011100110100110101010000001011110 e8b493eba094e8ea9ae6958eeaf4bce98c9ae5f8a0e8b493eafe94e98c9ae5f38eeb96bce8ea9ae69aa05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)