To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 依???財?億???依???財?億???^ 100010001100101100111111001111110011111110001101111000000011111110001001101011010011111100111111001111111000100011001011001111110011111100111111100011011110000000111111100010011010110100111111001111110011111101011110 88cb3f3f3f8de03f89ad3f3f3f88cb3f3f3f8de03f89ad3f3f3f5e
EUC-JP 依???財?億???依???財?億???^ 101100001100110100111111001111110011111110111010111000100011111110110010101011110011111100111111001111111011000011001101001111110011111100111111101110101110001000111111101100101010111100111111001111110011111101011110 b0cd3f3f3fbae23fb2af3f3f3fb0cd3f3f3fbae23fb2af3f3f3f5e
UTF-8 依머븝췹財렰億쇳렍렣依머븝췹財렰億쇳렍렡^ 11100100101111101001110111101011101010001011100011101011101110001001110111101100101101111011100111101000101100101010000111101011101000001011000011100101100001001000010011101100100001111011001111101011101000001000110111101011101000001010001111100100101111101001110111101011101010001011100011101011101110001001110111101100101101111011100111101000101100101010000111101011101000001011000011100101100001001000010011101100100001111011001111101011101000001000110111101011101000001010000101011110 e4be9deba8b8ebb89decb7b9e8b2a1eba0b0e58484ec87b3eba08deba0a3e4be9deba8b8ebb89decb7b9e8b2a1eba0b0e58484ec87b3eba08deba0a15e
UHC 依머븝췹財렰億쇳렍렣依머븝췹財렰億쇳렍렡^ 1110101111101110101110001101001110111010111011111100001111101111111011101010111110001110101111011110010111100010101111001110110110001110101000111000111010110100111010111110111010111000110100111011101011101111110000111110111111101110101011111000111010111101111001011110001010111100111011011000111010100011100011101011001001011110 ebeeb8d3baefc3efeeaf8ebde5e2bced8ea38eb4ebeeb8d3baefc3efeeaf8ebde5e2bced8ea38eb25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)