To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????喩??俉??誼f?喩??哀??? 00111111001111110011111100111111001111110011111110011010011001110011111100111111111110100110000100111111001111111000101101100010100000101000011000111111100110100110011100111111001111111000100010100011001111110011111100111111 3f3f3f3f3f3f9a673f3ffa613f3f8b6282863f9a673f3f88a33f3f3f
EUC-JP 孼?????喩??俉??誼f?喩??哀??? 10001111101110101100001100111111001111110011111100111111001111111101001111001000001111110011111110001111101100011011101100111111001111111011010111000011101000111110011000111111110100111100100000111111001111111011000010100101001111110011111100111111 8fbac33f3f3f3f3fd3c83f3f8fb1bb3f3fb5c3a3e63fd3c83f3fb0a53f3f3f
UTF-8 孼띠뢿吏섉젔喩쎼렋俉뤿뗀誼f븭喩붾눂哀넘뗫깻 111001011010110110111100111010111001110110100000111010111010001010111111111011111010011110011110111011001000010010001001111011001010000010010100111001011001011010101001111011001000111010111100111010111010000010001011111001001011111110001001111010111010010010111111111010111001011110000000111010001010101010111100111011111011110110000110111010111011100010101101111001011001011010101001111010111011011010111110111010111000100010000010111001011001001110000000111010111000010010011000111010111001011110101011111010101011100110111011 e5adbceb9da0eba2bfefa79eec8489eca094e596a9ec8ebceba08be4bf89eba4bfeb9780e8aabcefbd86ebb8ade596a9ebb6beeb8882e59380eb8498eb97abeab9bb
UHC 孼띠뢿吏섉젔喩쎼렋俉뤿뗀誼f븭喩붾눂哀넘뗫깻 1110010111101101101101101110110010001111100000101110110010100111100110001110011010100000100100101110101011100111100110111110001110001110101000101110011111101011100011111110101110110110101111101110101111111110101000111110011010010101100101101110101011100111100101001110101110000111101000111110010011101110101100111101000110001011111010111011001010100010 e5edb6ec8f82eca798e6a092eae79be38ea2e7eb8febb6beebfea3e69596eae794eb87a3e4eeb3d18bebb2a2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)