To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 閻わ?塋??張ョ?諺??節??業??塋??^ 11101000100001011000001011101101001111111001101011001000001111110011111110010010101000111000001110000111001111111000110010111111001111110011111110010000110111110011111100111111100010111100011000111111001111111001101011001000001111110011111101011110 e88582ed3f9ac83f3f92a383873f8cbf3f3f90df3f3f8bc63f3f9ac83f3f5e
EUC-JP 閻わ?塋??張ョ?諺?˚節??業??塋??^ 111011111110010110100100111011110011111111010100110010100011111100111111110001001010010110100101111001110011111110111000110000010011111110001111101000101011011011000000111000010011111100111111101101101100100000111111001111111101010011001010001111110011111101011110 efe5a4ef3fd4ca3f3fc4a5a5e73fb8c13f8fa2b6c0e13f3fb6c83f3fd4ca3f3f5e
UTF-8 閻わ슈塋뤸샃張ョ쳪諺든˚節당쳪業등눏塋뤸럦^ 111010011001011010111011111000111000001010001111111011001000101010001000111001011010000110001011111010111010010010111000111011001000001110000011111001011011110010110101111000111000001110100111111011001011001110101010111010001010101110111010111010111001001110100000110010111001101011100111101011111000000011101011100010111011100111101100101100111010101011100110101001011010110111101011100100111011000111101011100010001000111111100101101000011000101111101011101001001011100011101011100111111010011001011110 e996bbe3828fec8a88e5a18beba4b8ec8383e5bcb5e383a7ecb3aae8abbaeb93a0cb9ae7af80eb8bb9ecb3aae6a5adeb93b1eb888fe5a18beba4b8eb9fa65e
UHC 閻わ슈塋뤸샃張ョ쳪諺든˚節당쳪業등눏塋뤸럦^ 11100111101000101010101011101111101111011011010011100111101010111000111111100110100110001011010111101101111001011010101111100111101010111000111111100101111011001011010111100111101000101010101011101111101111011011010011100111101010111000111111100101111101101011010111101110100001111010101111100111101010111000111111100110100011101000100101011110 e7a2aaefbdb4e7ab8fe698b5ede5abe7ab8fe5ecb5e7a2aaefbdb4e7ab8fe5f6b5ee87abe7ab8fe68e895e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)