To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 陷溢噬墲ゐ譽キ妺厭陷溢噬墲ゐ譽キ妺閲^ 1110100010011100100010001110110010011010100000111111101010011110100000101110111011100110101000111011011111111010101001011000100101111101111010001001110010001000111011001001101010000011111110101001111010000010111011101110011010100011101101111111101010100101100010010111101101011110 e89c88ec9a83fa9e82eee6a3b7faa5897de89c88ec9a83fa9e82eee6a3b7faa5897b5e
EUC-JP 陷溢噬墲ゐ譽キ妺厭陷溢噬墲ゐ譽キ妺閲^ 1110111111111100101100001110111011010011111000111000111110111000110011101010010011110000111011001010010110001110101101111000111110111001101101111011000111011110111011111111110010110000111011101101001111100011100011111011100011001110101001001111000011101100101001011000111010110111100011111011100110110111101100011101110001011110 effcb0eed3e38fb8cea4f0eca58eb78fb9b7b1deeffcb0eed3e38fb8cea4f0eca58eb78fb9b7b1dc5e
UTF-8 陷溢噬墲ゐ譽キ妺厭陷溢噬墲ゐ譽キ妺閲^ 11101001100110011011011111100110101110101010001011100101100110011010110011100101101000101011001011100011100000101001000011101000101011011011110111101111101111011011011111100101101001101011101011100101100011101010110111101001100110011011011111100110101110101010001011100101100110011010110011100101101000101011001011100011100000101001000011101000101011011011110111101111101111011011011111100101101001101011101011101001100101101011001001011110 e999b7e6baa2e599ace5a2b2e38290e8adbdefbdb7e5a6bae58eade999b7e6baa2e599ace5a2b2e38290e8adbdefbdb7e5a6bae996b25e
UHC 陷溢??ゐ譽??厭陷溢??ゐ譽???^ 11111001111010001110110011101110001111110011111110101010111100001110011111100010001111110011111111100110111101001111100111101000111011001110111000111111001111111010101011110000111001111110001000111111001111110011111101011110 f9e8ecee3f3faaf0e7e23f3fe6f4f9e8ecee3f3faaf0e7e23f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)