To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????}???????????{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 澳??梧??要??嚥?}澳??梧??要??嚥?{^ 111000000101001100111111001111111000110011100110001111110011111110010111011101100011111100111111100110101000101100111111011111011110000001010011001111110011111110001100111001100011111100111111100101110111011000111111001111111001101010001011001111110111101101011110 e0533f3f8ce63f3f97763f3f9a8b3f7de0533f3f8ce63f3f97763f3f9a8b3f7b5e
EUC-JP 澳??梧??要??嚥?}澳??梧??要??嚥?{^ 110111111011010000111111001111111011100011101000001111110011111111001101110101110011111100111111110100111110101100111111011111011101111110110100001111110011111110111000111010000011111100111111110011011101011100111111001111111101001111101011001111110111101101011110 dfb43f3fb8e83f3fcdd73f3fd3eb3f7ddfb43f3fb8e83f3fcdd73f3fd3eb3f7b5e
UTF-8 澳뉛숴梧잞쉽要뺝맃嚥쨕}澳뉛숴梧잞쉽要뺝맃嚥쨕{^ 111001101011111010110011111010111000100110011011111011001000100010110100111001101010001010100111111011001001111010011110111011001000100110111101111010001010011010000001111010111011101010011101111010111010011110000011111001011001101010100101111011001010100010010101011111011110011010111110101100111110101110001001100110111110110010001000101101001110011010100010101001111110110010011110100111101110110010001001101111011110100010100110100000011110101110111010100111011110101110100111100000111110010110011010101001011110110010101000100101010111101101011110 e6beb3eb899bec88b4e6a2a7ec9e9eec89bde8a681ebba9deba783e59aa5eca8957de6beb3eb899bec88b4e6a2a7ec9e9eec89bde8a681ebba9deba783e59aa5eca8957b5e
UHC 澳뉛숴梧잞쉽要뺝맃嚥쨕}澳뉛숴梧잞쉽要뺝맃嚥쨕{^ 1110011111111110100001111110111110111101101001001110011111111100100111111110111110111101101100011110100110101001100101011110010110010000100111011110011010111111101001000110101101111101111001111111111010000111111011111011110110100100111001111111110010011111111011111011110110110001111010011010100110010101111001011001000010011101111001101011111110100100011010110111101101011110 e7fe87efbda4e7fc9fefbdb1e9a995e5909de6bfa46b7de7fe87efbda4e7fc9fefbdb1e9a995e5909de6bfa46b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)