To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????塋k??????????塋k?塋 001111110011111100111111001111110011111100111111100110101100100010000010100010110011111100111111001111110011111100111111001111110011111100111111001111110011111110011010110010001000001010001011001111111001101011001000 3f3f3f3f3f3f9ac8828b3f3f3f3f3f3f3f3f3f3f9ac8828b3f9ac8
EUC-JP ??????塋k??????????塋k?塋 001111110011111100111111001111110011111100111111110101001100101010100011111010110011111100111111001111110011111100111111001111110011111100111111001111110011111111010100110010101010001111101011001111111101010011001010 3f3f3f3f3f3fd4caa3eb3f3f3f3f3f3f3f3f3f3fd4caa3eb3fd4ca
UTF-8 療귥뼶溜곕젡塋k뙎溜곕젚療귥뼶溜곕젡塋k떩塋 111011111010011110000001111010101011011110100101111010111011110010110110111011111010011110001011111010101011001110010101111011001010000010100001111001011010000110001011111011111011110110001011111010111001100110001110111011111010011110001011111010101011001110010101111011001010000010011010111011111010011110000001111010101011011110100101111010111011110010110110111011111010011110001011111010101011001110010101111011001010000010100001111001011010000110001011111011111011110110001011111010111001011010101001111001011010000110001011 efa781eab7a5ebbcb6efa78beab395eca0a1e5a18befbd8beb998eefa78beab395eca09aefa781eab7a5ebbcb6efa78beab395eca0a1e5a18befbd8beb96a9e5a18b
UHC 療귥뼶溜곕젡塋k뙎溜곕젚療귥뼶溜곕젡塋k떩塋 1110100011111110100000101110110010010110101110011110101011111110101100001110101110100000100110101110011110101011101000111110101110001100100100111110101011111110101100001110101110100000100101101110100011111110100000101110110010010110101110011110101011111110101100001110101110100000100110101110011110101011101000111110101110001011101110111110011110101011 e8fe82ec96b9eafeb0eba09ae7aba3eb8c93eafeb0eba096e8fe82ec96b9eafeb0eba09ae7aba3eb8bbbe7ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)