To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 穽?旨葛?程?郵?陰◇穽?旨葛?程?郵?陰●^ 11100010011101100011111110001110011111001000101010001011001111111001001011110110001111111001011101011000001111111000100101000001100000011001111011100010011101100011111110001110011111001000101010001011001111111001001011110110001111111001011101011000001111111000100101000001100000011001110001011110 e2763f8e7c8a8b3f92f63f97583f8941819ee2763f8e7c8a8b3f92f63f97583f8941819c5e
EUC-JP 穽?旨葛?程?郵?陰◇穽?旨葛?程?郵?陰●^ 11100011110101110011111110111011110111011011001111101011001111111100010011111000001111111100110110111001001111111011000110100010101000011111111011100011110101110011111110111011110111011011001111101011001111111100010011111000001111111100110110111001001111111011000110100010101000011111110001011110 e3d73fbbddb3eb3fc4f83fcdb93fb1a2a1fee3d73fbbddb3eb3fc4f83fcdb93fb1a2a1fc5e
UTF-8 穽렗旨葛밞程렣郵렮陰◇穽렗旨葛밞程렣郵렮陰●^ 11100111101010011011110111101011101000001001011111100110100101111010100011101000100100011001101111101011101100001001111011100111101010001000101111101011101000001010001111101001100000111011010111101011101000001010111011101001100110011011000011100010100101111000011111100111101010011011110111101011101000001001011111100110100101111010100011101000100100011001101111101011101100001001111011100111101010001000101111101011101000001010001111101001100000111011010111101011101000001010111011101001100110011011000011100010100101111000111101011110 e7a9bdeba097e697a8e8919bebb09ee7a88beba0a3e983b5eba0aee999b0e29787e7a9bdeba097e697a8e8919bebb09ee7a88beba0a3e983b5eba0aee999b0e2978f5e
UHC 穽렗旨葛밞程렣郵렮陰◇穽렗旨葛밞程렣郵렮陰●^ 111011111111000010001110101011001111001010101001110010101110011110111001111000011110111111101111100011101011010011101001111010001000111010111011111010111110010010100001110111101110111111110000100011101010110011110010101010011100101011100111101110011110000111101111111011111000111010110100111010011110100010001110101110111110101111100100101000011101110001011110 eff08eacf2a9cae7b9e1efef8eb4e9e88ebbebe4a1deeff08eacf2a9cae7b9e1efef8eb4e9e88ebbebe4a1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)