To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雍????????艤??荏??曜??雍??旬 11101000101101000011111100111111001111110011111100111111001111110011111100111111111001000111111000111111001111111000100101100000001111110011111110010111011010100011111100111111111010001011010000111111001111111000111101111011 e8b43f3f3f3f3f3f3f3fe47e3f3f89603f3f976a3f3fe8b43f3f8f7b
EUC-JP 雍????????艤??荏??曜??雍??旬 11110000101101100011111100111111001111110011111100111111001111110011111100111111111001111101111100111111001111111011000111000001001111110011111111001101110010110011111100111111111100001011011000111111001111111011110111011100 f0b63f3f3f3f3f3f3f3fe7df3f3fb1c13f3fcdcb3f3ff0b63f3fbddc
UTF-8 雍됱뼶溜깅퉬溜긺즸艤뚪퐥荏⑹쐧曜쒕젌雍됱뼵旬 111010011001101110001101111010111001000010110001111010111011110010110110111011111010011110001011111010101011100110000101111011011000100110101100111011111010011110001011111010101011100010111010111011001010011010111000111010001000100110100100111010111001101010101010111011011001000010100101111010001000110110001111111000101001000110111001111011001001000010100111111001101001101110011100111011001001001010010101111011001010000010001100111010011001101110001101111010111001000010110001111010111011110010110101111001101001011110101100 e99b8deb90b1ebbcb6efa78beab985ed89acefa78beab8baeca6b8e889a4eb9aaaed90a5e88d8fe291b9ec90a7e69b9cec9295eca08ce99b8deb90b1ebbcb5e697ac
UHC 雍됱뼶溜깅퉬溜긺즸艤뚪퐥荏⑹쐧曜쒕젌雍됱뼵旬 1110100010111100100010011110110010010110101110011110101011111110101100011110101110111001100001001110101011111110101100011110011110100011100010101110101111111010100011001110100110111101100011101110110011111011101010011110110010011100100011001110100011111000100111001110101110100000100011011110100010111100100010011110110010010110101110001110001011100010 e8bc89ec96b9eafeb1ebb984eafeb1e7a38aebfa8ce9bd8eecfba9ec9c8ce8f89ceba08de8bc89ec96b8e2e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)