To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ?基??趙基??n}?基??趙基??n{^ 001111111000101011101110001111110011111111100110111000101000101011101110001111110011111101101110011111010011111110001010111011100011111100111111111001101110001010001010111011100011111100111111011011100111101101011110 3f8aee3f3fe6e28aee3f3f6e7d3f8aee3f3fe6e28aee3f3f6e7b5e
EUC-JP 塼基??趙基??n}塼基??趙基??n{^ 10001111101110001011100110110100111100000011111100111111111011001110010010110100111100000011111100111111011011100111110110001111101110001011100110110100111100000011111100111111111011001110010010110100111100000011111100111111011011100111101101011110 8fb8b9b4f03f3fece4b4f03f3f6e7d8fb8b9b4f03f3fece4b4f03f3f6e7b5e
UTF-8 塼基렰렯趙基렰렮n}塼基렰렯趙基렰렮n{^ 1110010110100001101111001110010110011111101110101110101110100000101100001110101110100000101011111110100010110110100110011110010110011111101110101110101110100000101100001110101110100000101011100110111001111101111001011010000110111100111001011001111110111010111010111010000010110000111010111010000010101111111010001011011010011001111001011001111110111010111010111010000010110000111010111010000010101110011011100111101101011110 e5a1bce59fbaeba0b0eba0afe8b699e59fbaeba0b0eba0ae6e7de5a1bce59fbaeba0b0eba0afe8b699e59fbaeba0b0eba0ae6e7b5e
UHC 塼基렰렯趙基렰렮n}塼基렰렯趙基렰렮n{^ 11101110111101001101000011110001100011101011110110001110101111001111000011100001110100001111000110001110101111011000111010111011011011100111110111101110111101001101000011110001100011101011110110001110101111001111000011100001110100001111000110001110101111011000111010111011011011100111101101011110 eef4d0f18ebd8ebcf0e1d0f18ebd8ebb6e7deef4d0f18ebd8ebcf0e1d0f18ebd8ebb6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)