To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悟????????域??有←?惟??沃??已 1000110011100101001111110011111100111111001111110011111100111111001111110011111110001000111001100011111100111111100101110100110010000001101010010011111110001000110100100011111100111111100101111000000000111111001111111001101111011111 8ce53f3f3f3f3f3f3f3f88e63f3f974c81a93f88d23f3f97803f3f9bdf
EUC-JP 悟??佾??洧??域??有←?惟??沃??已 101110001110011100111111001111111000111110110000111110110011111100111111100011111100011110110100001111110011111110110000111010000011111100111111110011011010110110100010101010110011111110110000110101000011111100111111110011011110000000111111001111111101011011100001 b8e73f3f8fb0fb3f3f8fc7b43f3fb0e83f3fcdada2ab3fb0d43f3fcde03f3fd6e1
UTF-8 悟귣쓷佾붹룚洧덀걶域㏐쑴有←뙠惟곗뵰沃쇰뿰已 111001101000001010011111111010101011011110100011111011001001001110110111111001001011110110111110111010111011011010111001111010111010001110011010111001101011010010100111111010111000110110000000111010101011000110110110111001011001111110011111111000111000111110010000111011001001000110110100111001101001110010001001111000101000011010010000111010111001100110100000111001101000001110011111111010101011001110010111111010111011010110110000111001101011001010000011111011001000011110110000111010111011111110110000111001011011011110110010 e6829feab7a3ec93b7e4bdbeebb6b9eba39ae6b4a7eb8d80eab1b6e59f9fe38f90ec91b4e69c89e28690eb99a0e6839feab397ebb5b0e6b283ec87b0ebbfb0e5b7b2
UHC 悟귣쓷佾붹룚洧덀걶域㏐쑴有←뙠惟곗뵰沃쇰뿰已 1110011111110110100000101110101110011101100101001110110011101011100101001110011010001111100101101110101011111011100010001110001110000001100111001110011010110100101001111110101010111110101010011110101011110011101000011110011110001100101001011110101011101110101100001110110010010100101011101110100010101010101111001110101110010111101100001110110010101011 e7f682eb9d94eceb94e68f96eafb88e3819ce6b4a7eabea9eaf3a1e78ca5eaeeb0ec94aee8aabceb97b0ecab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)