To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ???窈??偃??n}???窈??偃??n{^ 001111110011111100111111111000100111011100111111001111111001100011101110001111110011111101101110011111010011111100111111001111111110001001110111001111110011111110011000111011100011111100111111011011100111101101011110 3f3f3fe2773f3f98ee3f3f6e7d3f3f3fe2773f3f98ee3f3f6e7b5e
EUC-JP ???窈??偃??n}???窈??偃??n{^ 001111110011111100111111111000111101100000111111001111111101000011110000001111110011111101101110011111010011111100111111001111111110001111011000001111110011111111010000111100000011111100111111011011100111101101011110 3f3f3fe3d83f3fd0f03f3f6e7d3f3f3fe3d83f3fd0f03f3f6e7b5e
UTF-8 療딅젽窈뚮젚偃띿괩n}療딅젽窈뚮젚偃띿괩n{^ 1110111110100111100000011110101110010100100001011110110010100000101111011110011110101010100010001110101110011010101011101110110010100000100110101110010110000001100000111110101110011101101111111110101010110100101010010110111001111101111011111010011110000001111010111001010010000101111011001010000010111101111001111010101010001000111010111001101010101110111011001010000010011010111001011000000110000011111010111001110110111111111010101011010010101001011011100111101101011110 efa781eb9485eca0bde7aa88eb9aaeeca09ae58183eb9dbfeab4a96e7defa781eb9485eca0bde7aa88eb9aaeeca09ae58183eb9dbfeab4a96e7b5e
UHC 療딅젽窈뚮젚偃띿괩n}療딅젽窈뚮젚偃띿괩n{^ 1110100011111110100010101110101110100000101011111110100110100001100011001110101110100000100101101110010111100111100011011110110010110001101010000110111001111101111010001111111010001010111010111010000010101111111010011010000110001100111010111010000010010110111001011110011110001101111011001011000110101000011011100111101101011110 e8fe8aeba0afe9a18ceba096e5e78decb1a86e7de8fe8aeba0afe9a18ceba096e5e78decb1a86e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)