To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 遵??穽?遵??咀魄遵??穽?遵??咀白^ 10001111100001010011111100111111111000100111011000111111100011111000010100111111001111111001100111110000111010011010111010001111100001010011111100111111111000100111011000111111100011111000010100111111001111111001100111110000100101001001001001011110 8f853f3fe2763f8f853f3f99f0e9ae8f853f3fe2763f8f853f3f99f094925e
EUC-JP 遵??穽?遵??咀魄遵??穽?遵??咀白^ 10111101111001010011111100111111111000111101011100111111101111011110010100111111001111111101001011110010111100101011000010111101111001010011111100111111111000111101011100111111101111011110010100111111001111111101001011110010110001111111001001011110 bde53f3fe3d73fbde53f3fd2f2f2b0bde53f3fe3d73fbde53f3fd2f2c7f25e
UTF-8 遵몃ㄼ穽렯遵모갊咀魄遵몃ㄼ穽렯遵모갊咀白^ 11101001100000011011010111101011101010101000001111100011100001001011110011100111101010011011110111101011101000001010111111101001100000011011010111101011101010101010100011101010101100001000101011100101100100101000000011101001101011011000010011101001100000011011010111101011101010101000001111100011100001001011110011100111101010011011110111101011101000001010111111101001100000011011010111101011101010101010100011101010101100001000101011100101100100101000000011100111100110011011110101011110 e981b5ebaa83e384bce7a9bdeba0afe981b5ebaaa8eab08ae59280e9ad84e981b5ebaa83e384bce7a9bdeba0afe981b5ebaaa8eab08ae59280e799bd5e
UHC 遵몃ㄼ穽렯遵모갊咀魄遵몃ㄼ穽렯遵모갊咀白^ 1111000111100101101110001110101110100100101011001110111111110000100011101011110011110001111001011011100011110000101100001010011111101110101110101101101111011110111100011110010110111000111010111010010010101100111011111111000010001110101111001111000111100101101110001111000010110000101001111110111010111010110110111101110001011110 f1e5b8eba4aceff08ebcf1e5b8f0b0a7eebadbdef1e5b8eba4aceff08ebcf1e5b8f0b0a7eebadbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)