To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 藥???уⅹ???n}藥???уⅹ???n{^ 1110010101011010001111110011111100111111100001001000010111111010010010010011111100111111001111110110111001111101111001010101101000111111001111110011111110000100100001011111101001001001001111110011111100111111011011100111101101011110 e55a3f3f3f8485fa493f3f3f6e7de55a3f3f3f8485fa493f3f3f6e7b5e
EUC-JP 藥???у????n}藥???у????n{^ 111010011011101100111111001111110011111110100111111001010011111100111111001111110011111101101110011111011110100110111011001111110011111100111111101001111110010100111111001111110011111100111111011011100111101101011110 e9bb3f3f3fa7e53f3f3f3f6e7de9bb3f3f3fa7e53f3f3f3f6e7b5e
UTF-8 藥썲쁿歷уⅹ嶪뤹텥n}藥썲쁿歷уⅹ嶪뤹텥n{^ 111010001001011110100101111011001000110110110010111011001000000110111111111011111010011010001100110100011000001111100010100001011011100111100101101101101010101011101011101001001011100111101101100001011010010101101110011111011110100010010111101001011110110010001101101100101110110010000001101111111110111110100110100011001101000110000011111000101000010110111001111001011011011010101010111010111010010010111001111011011000010110100101011011100111101101011110 e897a5ec8db2ec81bfefa68cd183e285b9e5b6aaeba4b9ed85a56e7de897a5ec8db2ec81bfefa68cd183e285b9e5b6aaeba4b9ed85a56e7b5e
UHC 藥썲쁿歷уⅹ嶪뤹텥n}藥썲쁿歷уⅹ嶪뤹텥n{^ 1110010110110111101111011110010110011000100001101110011010111000101011001110010110100101101010101110010111110101100011111110011110110110100110100110111001111101111001011011011110111101111001011001100010000110111001101011100010101100111001011010010110101010111001011111010110001111111001111011011010011010011011100111101101011110 e5b7bde59886e6b8ace5a5aae5f58fe7b69a6e7de5b7bde59886e6b8ace5a5aae5f58fe7b69a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)