To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 澳??瞬??餓??}澳??瞬??餓??{^ 111000000101001100111111001111111000111101110101001111110011111110001001111011000011111100111111011111011110000001010011001111110011111110001111011101010011111100111111100010011110110000111111001111110111101101011110 e0533f3f8f753f3f89ec3f3f7de0533f3f8f753f3f89ec3f3f7b5e
EUC-JP 澳??瞬??餓??}澳??瞬??餓??{^ 110111111011010000111111001111111011110111010110001111110011111110110010111011100011111100111111011111011101111110110100001111110011111110111101110101100011111100111111101100101110111000111111001111110111101101011110 dfb43f3fbdd63f3fb2ee3f3f7ddfb43f3fbdd63f3fb2ee3f3f7b5e
UTF-8 澳묊슢瞬썹맏餓쇿븚}澳묊슢瞬썹맏餓쇿븚{^ 111001101011111010110011111010111010110010001010111011001000101010100010111001111001111010101100111011001000110110111001111010111010011110001111111010011010010010010011111011001000011110111111111010111011100010011010011111011110011010111110101100111110101110101100100010101110110010001010101000101110011110011110101011001110110010001101101110011110101110100111100011111110100110100100100100111110110010000111101111111110101110111000100110100111101101011110 e6beb3ebac8aec8aa2e79eacec8db9eba78fe9a493ec87bfebb89a7de6beb3ebac8aec8aa2e79eacec8db9eba78fe9a493ec87bfebb89a7b5e
UHC 澳묊슢瞬썹맏餓쇿븚}澳묊슢瞬썹맏餓쇿븚{^ 111001111111111010010001111001111001101010101110111000101110101110111101111001111011100010111010111001001011101110011001111001011001010110000110011111011110011111111110100100011110011110011010101011101110001011101011101111011110011110111000101110101110010010111011100110011110010110010101100001100111101101011110 e7fe91e79aaee2ebbde7b8bae4bb99e595867de7fe91e79aaee2ebbde7b8bae4bb99e595867b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)