To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?伊逗???虞?寃?淨?伊逗???虞?寃?^ 100111111100010000111111100010001100100110010000100000000011111100111111001111111000101111110001001111111001101110000011001111111001111111000100001111111000100011001001100100001000000000111111001111110011111110001011111100010011111110011011100000110011111101011110 9fc43f88c990803f3f3f8bf13f9b833f9fc43f88c990803f3f3f8bf13f9b833f5e
EUC-JP 淨?伊逗???虞?寃?淨?伊逗???虞?寃?^ 110111101100011000111111101100001100101110111111111000000011111100111111001111111011011011110011001111111101010111100011001111111101111011000110001111111011000011001011101111111110000000111111001111110011111110110110111100110011111111010101111000110011111101011110 dec63fb0cbbfe03f3f3fb6f33fd5e33fdec63fb0cbbfe03f3f3fb6f33fd5e33f5e
UTF-8 淨렠伊逗썬亐렕虞렧寃넸淨렠伊逗썬亐렕虞렧寃넵^ 11100110101101111010100011101011101000001010000011100100101111001000101011101001100000001001011111101100100011011010110011100100101110101001000011101011101000001001010111101000100110011001111011101011101000001010011111100101101011111000001111101011100001001011100011100110101101111010100011101011101000001010000011100100101111001000101011101001100000001001011111101100100011011010110011100100101110101001000011101011101000001001010111101000100110011001111011101011101000001010011111100101101011111000001111101011100001001011010101011110 e6b7a8eba0a0e4bc8ae98097ec8dace4ba90eba095e8999eeba0a7e5af83eb84b8e6b7a8eba0a0e4bc8ae98097ec8dace4ba90eba095e8999eeba0a7e5af83eb84b55e
UHC 淨렠伊逗썬亐렕虞렧寃넸淨렠伊逗썬亐렕虞렧寃넵^ 111011111110010010001110101100011110110010100101110101001110100010111101111000111110101010100111100011101010101011101001111001011000111010110110111010101011001010110011110111101110111111100100100011101011000111101100101001011101010011101000101111011110001111101010101001111000111010101010111010011110010110001110101101101110101010110010101100111101110001011110 efe48eb1eca5d4e8bde3eaa78eaae9e58eb6eab2b3deefe48eb1eca5d4e8bde3eaa78eaae9e58eb6eab2b3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)