To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???要?????撓η????要?????撓η?^ 00111111001111110011111110010111011101100011111100111111001111110011111100111111100111011001101010000011110001010011111100111111001111110011111110010111011101100011111100111111001111110011111100111111100111011001101010000011110001010011111101011110 3f3f3f97763f3f3f3f3f9d9a83c53f3f3f3f97763f3f3f3f3f9d9a83c53f5e
EUC-JP 縕??要??縕??撓η?縕??要??縕??撓η?^ 100011111101010011000010001111110011111111001101110101110011111100111111100011111101010011000010001111110011111111011001111110101010011011000111001111111000111111010100110000100011111100111111110011011101011100111111001111111000111111010100110000100011111100111111110110011111101010100110110001110011111101011110 8fd4c23f3fcdd73f3f8fd4c23f3fd9faa6c73f8fd4c23f3fcdd73f3f8fd4c23f3fd9faa6c73f5e
UTF-8 縕됵슴要랃쉠縕됵슴撓η킀縕됵슴要랃쉠縕됵슴撓η킀^ 1110011110111000100101011110101110010000101101011110110010001010101101001110100010100110100000011110101110011110100000111110110010001001101000001110011110111000100101011110101110010000101101011110110010001010101101001110011010010010100100111100111010110111111011011000001010000000111001111011100010010101111010111001000010110101111011001000101010110100111010001010011010000001111010111001111010000011111011001000100110100000111001111011100010010101111010111001000010110101111011001000101010110100111001101001001010010011110011101011011111101101100000101000000001011110 e7b895eb90b5ec8ab4e8a681eb9e83ec89a0e7b895eb90b5ec8ab4e69293ceb7ed8280e7b895eb90b5ec8ab4e8a681eb9e83ec89a0e7b895eb90b5ec8ab4e69293ceb7ed82805e
UHC 縕됵슴要랃쉠縕됵슴撓η킀縕됵슴要랃쉠縕됵슴撓η킀^ 11101000101100101000100111101111101111011011111111101001101010011000110111101111101111011010101011101000101100101000100111101111101111011011111111101000111101011010010111100111101101001000110111101000101100101000100111101111101111011011111111101001101010011000110111101111101111011010101011101000101100101000100111101111101111011011111111101000111101011010010111100111101101001000110101011110 e8b289efbdbfe9a98defbdaae8b289efbdbfe8f5a5e7b48de8b289efbdbfe9a98defbdaae8b289efbdbfe8f5a5e7b48d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)