To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨??耿?耿??耿?鋼淨??耿?耿??耿?鋼^ 100111111100010000111111001111111110001111010100001111111110001111010100001111110011111111100011110101000011111110001101011111001001111111000100001111110011111111100011110101000011111111100011110101000011111100111111111000111101010000111111100011010111110001011110 9fc43f3fe3d43fe3d43f3fe3d43f8d7c9fc43f3fe3d43fe3d43f3fe3d43f8d7c5e
EUC-JP 淨?饔耿?耿?饔耿?鋼淨?饔耿?耿?饔耿?鋼^ 1101111011000110001111111000111111101000111011111110011011010110001111111110011011010110001111111000111111101000111011111110011011010110001111111011100111011101110111101100011000111111100011111110100011101111111001101101011000111111111001101101011000111111100011111110100011101111111001101101011000111111101110011101110101011110 dec63f8fe8efe6d63fe6d63f8fe8efe6d63fb9dddec63f8fe8efe6d63fe6d63f8fe8efe6d63fb9dd5e
UTF-8 淨렠饔耿렋耿㉢饔耿렋鋼淨렠饔耿렋耿㉢饔耿렋鋼^ 11100110101101111010100011101011101000001010000011101001101001011001010011101000100000001011111111101011101000001000101111101000100000001011111111100011100010011010001011101001101001011001010011101000100000001011111111101011101000001000101111101001100010111011110011100110101101111010100011101011101000001010000011101001101001011001010011101000100000001011111111101011101000001000101111101000100000001011111111100011100010011010001011101001101001011001010011101000100000001011111111101011101000001000101111101001100010111011110001011110 e6b7a8eba0a0e9a594e880bfeba08be880bfe389a2e9a594e880bfeba08be98bbce6b7a8eba0a0e9a594e880bfeba08be880bfe389a2e9a594e880bfeba08be98bbc5e
UHC 淨렠饔耿렋耿㉢饔耿렋鋼淨렠饔耿렋耿㉢饔耿렋鋼^ 111011111110010010001110101100011110100010111101110011001110101010001110101000101100110011101010101010001011001111101000101111011100110011101010100011101010001011001011101111001110111111100100100011101011000111101000101111011100110011101010100011101010001011001100111010101010100010110011111010001011110111001100111010101000111010100010110010111011110001011110 efe48eb1e8bdccea8ea2cceaa8b3e8bdccea8ea2cbbcefe48eb1e8bdccea8ea2cceaa8b3e8bdccea8ea2cbbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)