To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癲??唯?????癲??應??罐???臾??^ 1110000110011111001111110011111110010111010000100011111100111111001111110011111100111111111000011001111100111111001111111001110011100100001111110011111111100011101000110011111100111111001111111110010001101011001111110011111101011110 e19f3f3f97423f3f3f3f3fe19f3f3f9ce43f3fe3a33f3f3fe46b3f3f5e
EUC-JP 癲??唯?????癲??應??罐???臾??^ 1110001010100001001111110011111111001101101000110011111100111111001111110011111100111111111000101010000100111111001111111101100011100110001111110011111111100110101001010011111100111111001111111110011111001100001111110011111101011110 e2a13f3fcda33f3f3f3f3fe2a13f3fd8e63f3fe6a53f3f3fe7cc3f3f5e
UTF-8 癲용뿨唯녽굲紐뚰뭽癲됱띂應섌뒽罐六쀧독臾됯도^ 11100111100110011011001011101100100110101010100111101011101111111010100011100101100101001010111111101011100001011011110111101010101101011011001011101111101001111000111111101011100110101011000011101011101011011011110111100111100110011011001011101011100100001011000111101011100111011000001011100110100001111000100111101100100001001000110011101011100100101011110111100111101111011001000011101111101001111001000111101100100000001010011111101011100011111000010111101000100001111011111011101011100100001010111111101011100011111000010001011110 e799b2ec9aa9ebbfa8e594afeb85bdeab5b2efa78feb9ab0ebadbde799b2eb90b1eb9d82e68789ec848ceb92bde7bd90efa791ec80a7eb8f85e887beeb90afeb8f845e
UHC 癲용뿨唯녽굲紐뚰뭽癲됱띂應섌뒽罐六쀧독臾됯도^ 111011111010011010111111111010111001011110101000111010101110011010000110111010011000001010010101111010111010101010001100111011011001001010001100111011111010011010001001111011001000110110111101111010111110101110011000111010011000101010110011110011101011100011101011101110111001011111100111101101011011011011101011101011001000100111101010101101011011010101011110 efa6bfeb97a8eae686e98295ebaa8ced928cefa689ec8dbdebeb98e98ab3ceb8ebbb97e7b5b6ebac89eab5b55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)