To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 靖?鷹?淫?璃乙?靖?鷹?淫?璃淫?^ 1001011011110101001111111001000111101001001111111000100011111010001111111001011110011110100010011011001100111111100101101111010100111111100100011110100100111111100010001111101000111111100101111001111010001000111110100011111101011110 96f53f91e93f88fa3f979e89b33f96f53f91e93f88fa3f979e88fa3f5e
EUC-JP 靖?鷹?淫?璃乙?靖?鷹?淫?璃淫?^ 1100110011110111001111111100001011101011001111111011000011111100001111111100110111111110101100101011010100111111110011001111011100111111110000101110101100111111101100001111110000111111110011011111111010110000111111000011111101011110 ccf73fc2eb3fb0fc3fcdfeb2b53fccf73fc2eb3fb0fc3fcdfeb0fc3f5e
UTF-8 靖렋鷹렭淫브璃乙렊靖렋鷹렭淫브璃淫렢^ 11101001100111011001011011101011101000001000101111101001101101111011100111101011101000001010110111100110101101111010101111101011101110001000110011100111100100101000001111100100101110011001100111101011101000001000101011101001100111011001011011101011101000001000101111101001101101111011100111101011101000001010110111100110101101111010101111101011101110001000110011100111100100101000001111100110101101111010101111101011101000001010001001011110 e99d96eba08be9b7b9eba0ade6b7abebb88ce79283e4b999eba08ae99d96eba08be9b7b9eba0ade6b7abebb88ce79283e6b7abeba0a25e
UHC 靖렋鷹렭淫브璃乙렊靖렋鷹렭淫브璃淫렢^ 11101111111111101000111010100010111010111110110110001110101110101110101111100010101110101110101011010111111000111110101111100000100011101010000111101111111111101000111010100010111010111110110110001110101110101110101111100010101110101110101011010111111000111110101111100010100011101011001101011110 effe8ea2ebed8ebaebe2baead7e3ebe08ea1effe8ea2ebed8ebaebe2baead7e3ebe28eb35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)