To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣→?音?????悠??恂れ??λ?竊 001111110011111100111111100010111000001110000001101010000011111110001001101110010011111100111111001111110011111100111111100101110100100100111111001111111001110010010110100000101110101000111111001111111000001111001001001111111110001010000110 3f3f3f8b8381a83f89b93f3f3f3f3f97493f3f9c9682ea3f3f83c93fe286
EUC-JP ???泣→?音??孼??悠??恂れ??λ?竊 0011111100111111001111111011010111100011101000101010101000111111101100101011101100111111001111111000111110111010110000110011111100111111110011011010101000111111001111111101011111110110101001001110110000111111001111111010011011001011001111111110001111100110 3f3f3fb5e3a2aa3fb2bb3f3f8fbac33f3fcdaa3f3fd7f6a4ec3f3fa6cb3fe3e6
UTF-8 捻꿔끇泣→쨫音좎쭍孼꾩뮇悠뷴톹恂れ뫊若λ겭竊 1110111110100110101001001110101010111111100101001110101110000001100001111110011010110011101000111110001010000110100100101110110010101000101010111110100110011111101100111110110010100010100011101110110010101101100011011110010110101101101111001110101010111110101010011110101110101110100001111110011010000010101000001110101110110111101101001110110110000110101110011110011010000001100000101110001110000010100011001110101110101011100010101110111110100101101101001100111010111011111010101011001010101101111001111010101110001010 efa6a4eabf94eb8187e6b3a3e28692eca8abe99fb3eca28eecad8de5adbceabea9ebae87e682a0ebb7b4ed86b9e68182e3828cebab8aefa5b4cebbeab2ade7ab8a
UHC 捻꿔끇泣→쨫音좎쭍孼꾩뮇悠뷴톹恂れ뫊若λ겭竊 1110011011110111101100101110001110000101101110111110101111101000101000011110011010100100100001011110101111100101101000001110110010100111100001101110010111101101100001001110110010010010100101101110101011101101101110101110010110110111100011011110001011100001101010101110110010010001101011001110010110101110101001011110101110000001101110111110111110111100 e6f7b2e385bbebe8a1e6a485ebe5a0eca786e5ed84ec9296eaedbae5b78de2e1aaec91ace5aea5eb81bbefbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)