To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 澳??罌??窈??[澳??罌??窈??[^ 111000000101001100111111001111111110001110100000001111110011111111100010011101110011111100111111010110111110000001010011001111110011111111100011101000000011111100111111111000100111011100111111001111110101101101011110 e0533f3fe3a03f3fe2773f3f5be0533f3fe3a03f3fe2773f3f5b5e
EUC-JP 澳??罌??窈??[澳??罌??窈??[^ 110111111011010000111111001111111110011010100010001111110011111111100011110110000011111100111111010110111101111110110100001111110011111111100110101000100011111100111111111000111101100000111111001111110101101101011110 dfb43f3fe6a23f3fe3d83f3f5bdfb43f3fe6a23f3fe3d83f3f5b5e
UTF-8 澳묌죱罌녘젒窈뚦∏[澳묌죱罌녘젒窈뚦∏[^ 111001101011111010110011111010111010110010001100111011001010001110110001111001111011110110001100111010111000010110011000111011001010000010010010111001111010101010001000111010111001101010100110111000101000100010001111010110111110011010111110101100111110101110101100100011001110110010100011101100011110011110111101100011001110101110000101100110001110110010100000100100101110011110101010100010001110101110011010101001101110001010001000100011110101101101011110 e6beb3ebac8ceca3b1e7bd8ceb8598eca092e7aa88eb9aa6e2888f5be6beb3ebac8ceca3b1e7bd8ceb8598eca092e7aa88eb9aa6e2888f5b5e
UHC 澳묌죱罌녘젒窈뚦∏[澳묌죱罌녘젒窈뚦∏[^ 111001111111111010010001111010011010000110001100111001011010001010110011111010001010000010010001111010011010000110001100111001011010001010110011010110111110011111111110100100011110100110100001100011001110010110100010101100111110100010100000100100011110100110100001100011001110010110100010101100110101101101011110 e7fe91e9a18ce5a2b3e8a091e9a18ce5a2b35be7fe91e9a18ce5a2b3e8a091e9a18ce5a2b35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)