To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 跌ク螻櫁゚懃矯襄丞穀跌ク豺。褄臥矯雉 1110011011101001101110001110010110110001100111101110100011011111100111001110011110001011101110001110010111110101100011111110010110001101100100101110011011101001101110001110011010110111101000011110010111101011100010011110011110001011101110001110100010110011 e6e9b8e5b19ee8df9ce78bb8e5f58fe58d92e6e9b8e6b7a1e5eb89e78bb8e8b3
EUC-JP 跌ク螻櫁゚懃矯襄丞穀跌ク豺。褄臥矯雉 111011001110101110001110101110001110101010110011110111001110101010001110110111111101100011101001101101101011101011101010111101111011111011100111101110011111001011101100111010111000111010111000111011001011100110001110101000011110101011101101101100101110100110110110101110101111000010110101 eceb8eb8eab3dcea8edfd8e9b6baeaf7bee7b9f2eceb8eb8ecb98ea1eaedb2e9b6baf0b5
UTF-8 跌ク螻櫁゚懃矯襄丞穀跌ク豺。褄臥矯雉 111010001011011110001100111011111011110110111000111010001001111010111011111001101010101110000001111011111011111010011111111001101000011110000011111001111001111110101111111010001010010110000100111001001011100010011110111001111010100110000000111010001011011110001100111011111011110110111000111010001011000110111010111011111011110110100001111010001010010010000100111010001000011110100101111001111001111110101111111010011001101110001001 e8b78cefbdb8e89ebbe6ab81efbe9fe68783e79fafe8a584e4b89ee7a980e8b78cefbdb8e8b1baefbda1e8a484e887a5e79fafe99b89
UHC 跌????懃矯襄丞穀跌?豺??臥矯雉 1111001011110110001111110011111100111111001111111101000011000100110011101110110011100101110100011110001110101010110011011101101011110010111101100011111111100011110011110011111100111111111010001100001011001110111011001111011011001011 f2f63f3f3f3fd0c4ceece5d1e3aacddaf2f63fe3cf3f3fe8c2ceecf6cb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)