To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????m}????????m{^ 001111110011111100111111001111110011111100111111001111110011111101101101011111010011111100111111001111110011111100111111001111110011111100111111011011010111101101011110 3f3f3f3f3f3f3f3f6d7d3f3f3f3f3f3f3f3f6d7b5e
SJIS-WIN 頂戡??沚基??m}頂戡??沚基??m{^ 1001001010111000100111010100000100111111001111111001111110001101100010101110111000111111001111110110110101111101100100101011100010011101010000010011111100111111100111111000110110001010111011100011111100111111011011010111101101011110 92b89d413f3f9f8d8aee3f3f6d7d92b89d413f3f9f8d8aee3f3f6d7b5e
EUC-JP 頂戡??沚基??m}頂戡??沚基??m{^ 1100010010111010110110011010001000111111001111111101110111101101101101001111000000111111001111110110110101111101110001001011101011011001101000100011111100111111110111011110110110110100111100000011111100111111011011010111101101011110 c4bad9a23f3fddedb4f03f3f6d7dc4bad9a23f3fddedb4f03f3f6d7b5e
UTF-8 頂戡렰렕沚基렰렓m}頂戡렰렕沚基렰렓m{^ 1110100110100000100000101110011010001000101000011110101110100000101100001110101110100000100101011110011010110010100110101110010110011111101110101110101110100000101100001110101110100000100100110110110101111101111010011010000010000010111001101000100010100001111010111010000010110000111010111010000010010101111001101011001010011010111001011001111110111010111010111010000010110000111010111010000010010011011011010111101101011110 e9a082e688a1eba0b0eba095e6b29ae59fbaeba0b0eba0936d7de9a082e688a1eba0b0eba095e6b29ae59fbaeba0b0eba0936d7b5e
UHC 頂戡렰렕沚基렰렓m}頂戡렰렕沚基렰렓m{^ 11110000101000101100101011110001100011101011110110001110101010101111001010101111110100001111000110001110101111011000111010101000011011010111110111110000101000101100101011110001100011101011110110001110101010101111001010101111110100001111000110001110101111011000111010101000011011010111101101011110 f0a2caf18ebd8eaaf2afd0f18ebd8ea86d7df0a2caf18ebd8eaaf2afd0f18ebd8ea86d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)