To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 賊??耿?耿??耿?鋼賊??耿?耿??耿?鋼^ 100100011010111100111111001111111110001111010100001111111110001111010100001111110011111111100011110101000011111110001101011111001001000110101111001111110011111111100011110101000011111111100011110101000011111100111111111000111101010000111111100011010111110001011110 91af3f3fe3d43fe3d43f3fe3d43f8d7c91af3f3fe3d43fe3d43f3fe3d43f8d7c5e
EUC-JP 賊?饔耿?耿?饔耿?鋼賊?饔耿?耿?饔耿?鋼^ 1100001010110001001111111000111111101000111011111110011011010110001111111110011011010110001111111000111111101000111011111110011011010110001111111011100111011101110000101011000100111111100011111110100011101111111001101101011000111111111001101101011000111111100011111110100011101111111001101101011000111111101110011101110101011110 c2b13f8fe8efe6d63fe6d63f8fe8efe6d63fb9ddc2b13f8fe8efe6d63fe6d63f8fe8efe6d63fb9dd5e
UTF-8 賊렠饔耿렋耿㉢饔耿렋鋼賊렠饔耿렋耿㉢饔耿렋鋼^ 11101000101100111000101011101011101000001010000011101001101001011001010011101000100000001011111111101011101000001000101111101000100000001011111111100011100010011010001011101001101001011001010011101000100000001011111111101011101000001000101111101001100010111011110011101000101100111000101011101011101000001010000011101001101001011001010011101000100000001011111111101011101000001000101111101000100000001011111111100011100010011010001011101001101001011001010011101000100000001011111111101011101000001000101111101001100010111011110001011110 e8b38aeba0a0e9a594e880bfeba08be880bfe389a2e9a594e880bfeba08be98bbce8b38aeba0a0e9a594e880bfeba08be880bfe389a2e9a594e880bfeba08be98bbc5e
UHC 賊렠饔耿렋耿㉢饔耿렋鋼賊렠饔耿렋耿㉢饔耿렋鋼^ 111011101110010010001110101100011110100010111101110011001110101010001110101000101100110011101010101010001011001111101000101111011100110011101010100011101010001011001011101111001110111011100100100011101011000111101000101111011100110011101010100011101010001011001100111010101010100010110011111010001011110111001100111010101000111010100010110010111011110001011110 eee48eb1e8bdccea8ea2cceaa8b3e8bdccea8ea2cbbceee48eb1e8bdccea8ea2cceaa8b3e8bdccea8ea2cbbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)