To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 澳??孃??窈??N}澳??孃??窈??N{^ 1110000001010011001111110011111110011011011011110011111100111111111000100111011100111111001111110100111001111101111000000101001100111111001111111001101101101111001111110011111111100010011101110011111100111111010011100111101101011110 e0533f3f9b6f3f3fe2773f3f4e7de0533f3f9b6f3f3fe2773f3f4e7b5e
EUC-JP 澳??孃??窈??N}澳??孃??窈??N{^ 1101111110110100001111110011111111010101110100000011111100111111111000111101100000111111001111110100111001111101110111111011010000111111001111111101010111010000001111110011111111100011110110000011111100111111010011100111101101011110 dfb43f3fd5d03f3fe3d83f3f4e7ddfb43f3fd5d03f3fe3d83f3f4e7b5e
UTF-8 澳롩죺孃곮젍窈붹떅N}澳롩죺孃곮젍窈붹떅N{^ 1110011010111110101100111110101110100001101010011110110010100011101110101110010110101101100000111110101010110011101011101110110010100000100011011110011110101010100010001110101110110110101110011110101110010110100001010100111001111101111001101011111010110011111010111010000110101001111011001010001110111010111001011010110110000011111010101011001110101110111011001010000010001101111001111010101010001000111010111011011010111001111010111001011010000101010011100111101101011110 e6beb3eba1a9eca3bae5ad83eab3aeeca08de7aa88ebb6b9eb96854e7de6beb3eba1a9eca3bae5ad83eab3aeeca08de7aa88ebb6b9eb96854e7b5e
UHC 澳롩죺孃곮젍窈붹떅N}澳롩죺孃곮젍窈붹떅N{^ 1110011111111110100011101110100110100001100101001110010110111110100000011110100010100000100011101110100110100001100101001110011010001011100110110100111001111101111001111111111010001110111010011010000110010100111001011011111010000001111010001010000010001110111010011010000110010100111001101000101110011011010011100111101101011110 e7fe8ee9a194e5be81e8a08ee9a194e68b9b4e7de7fe8ee9a194e5be81e8a08ee9a194e68b9b4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)