To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 蒸基??沚基??n}蒸基??沚基??n{^ 1000111111110110100010101110111000111111001111111001111110001101100010101110111000111111001111110110111001111101100011111111011010001010111011100011111100111111100111111000110110001010111011100011111100111111011011100111101101011110 8ff68aee3f3f9f8d8aee3f3f6e7d8ff68aee3f3f9f8d8aee3f3f6e7b5e
EUC-JP 蒸基??沚基??n}蒸基??沚基??n{^ 1011111011111000101101001111000000111111001111111101110111101101101101001111000000111111001111110110111001111101101111101111100010110100111100000011111100111111110111011110110110110100111100000011111100111111011011100111101101011110 bef8b4f03f3fddedb4f03f3f6e7dbef8b4f03f3fddedb4f03f3f6e7b5e
UTF-8 蒸基렰렣沚基렰렓n}蒸基렰렣沚基렰렓n{^ 1110100010010010101110001110010110011111101110101110101110100000101100001110101110100000101000111110011010110010100110101110010110011111101110101110101110100000101100001110101110100000100100110110111001111101111010001001001010111000111001011001111110111010111010111010000010110000111010111010000010100011111001101011001010011010111001011001111110111010111010111010000010110000111010111010000010010011011011100111101101011110 e892b8e59fbaeba0b0eba0a3e6b29ae59fbaeba0b0eba0936e7de892b8e59fbaeba0b0eba0a3e6b29ae59fbaeba0b0eba0936e7b5e
UHC 蒸基렰렣沚基렰렓n}蒸基렰렣沚基렰렓n{^ 11110001111110101101000011110001100011101011110110001110101101001111001010101111110100001111000110001110101111011000111010101000011011100111110111110001111110101101000011110001100011101011110110001110101101001111001010101111110100001111000110001110101111011000111010101000011011100111101101011110 f1fad0f18ebd8eb4f2afd0f18ebd8ea86e7df1fad0f18ebd8eb4f2afd0f18ebd8ea86e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)