To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????[???????????[^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 淨?依?衣???怨峯?[淨?依?衣???怨峯?[^ 1001111111000100001111111000100011001011001111111000100011011111001111110011111100111111100010011000010110010101111101010011111101011011100111111100010000111111100010001100101100111111100010001101111100111111001111110011111110001001100001011001010111110101001111110101101101011110 9fc43f88cb3f88df3f3f3f898595f53f5b9fc43f88cb3f88df3f3f3f898595f53f5b5e
EUC-JP 淨?依?衣???怨峯?[淨?依?衣???怨峯?[^ 1101111011000110001111111011000011001101001111111011000011100001001111110011111100111111101100011110010111001010111101110011111101011011110111101100011000111111101100001100110100111111101100001110000100111111001111110011111110110001111001011100101011110111001111110101101101011110 dec63fb0cd3fb0e13f3f3fb1e5caf73f5bdec63fb0cd3fb0e13f3f3fb1e5caf73f5b5e
UTF-8 淨렠依렋衣쭹렩렰怨峯긺[淨렠依렋衣쭹렩렰怨峯긺[^ 111001101011011110101000111010111010000010100000111001001011111010011101111010111010000010001011111010001010000110100011111011001010110110111001111010111010000010101001111010111010000010110000111001101000000010101000111001011011001110101111111010101011100010111010010110111110011010110111101010001110101110100000101000001110010010111110100111011110101110100000100010111110100010100001101000111110110010101101101110011110101110100000101010011110101110100000101100001110011010000000101010001110010110110011101011111110101010111000101110100101101101011110 e6b7a8eba0a0e4be9deba08be8a1a3ecadb9eba0a9eba0b0e680a8e5b3afeab8ba5be6b7a8eba0a0e4be9deba08be8a1a3ecadb9eba0a9eba0b0e680a8e5b3afeab8ba5b5e
UHC 淨렠依렋衣쭹렩렰怨峯긺[淨렠依렋衣쭹렩렰怨峯긺[^ 1110111111100100100011101011000111101011111011101000111010100010111010111111110111000010111001111000111010110111100011101011110111101010101100111101110011100111101100011110011101011011111011111110010010001110101100011110101111101110100011101010001011101011111111011100001011100111100011101011011110001110101111011110101010110011110111001110011110110001111001110101101101011110 efe48eb1ebee8ea2ebfdc2e78eb78ebdeab3dce7b1e75befe48eb1ebee8ea2ebfdc2e78eb78ebdeab3dce7b1e75b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)