To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 烏?????烏??而????????壹??^ 1000100101000111001111110011111100111111001111110011111110001001010001110011111100111111100011101010011100111111001111110011111100111111001111110011111100111111001111111001101011100011001111110011111101011110 89473f3f3f3f3f89473f3f8ea73f3f3f3f3f3f3f3f9ae33f3f5e
EUC-JP 烏?????烏??而????????壹??^ 1011000110101000001111110011111100111111001111110011111110110001101010000011111100111111101111001010100100111111001111110011111100111111001111110011111100111111001111111101010011100101001111110011111101011110 b1a83f3f3f3f3fb1a83f3fbca93f3f3f3f3f3f3f3fd4e53f3f5e
UTF-8 烏녻몳溜롫젨烏놃렔而ㅸ죫吳쏅젍溜좄죫壹⑹빱^ 11100111100000111000111111101011100001011011101111101011101010101011001111101111101001111000101111101011101000011010101111101100101000001010100011100111100000111000111111101011100001101000001111101011101000001001010011101000100000001000110011100011100001011011100011101100101000111010101111100101100100001011001111101100100011111000010111101100101000001000110111101111101001111000101111101100101000101000010011101100101000111010101111100101101000111011100111100010100100011011100111101011101110011011000101011110 e7838feb85bbebaab3efa78beba1abeca0a8e7838feb8683eba094e8808ce385b8eca3abe590b3ec8f85eca08defa78beca284eca3abe5a3b9e291b9ebb9b15e
UHC 烏녻몳溜롫젨烏놃렔而ㅸ죫吳쏅젍溜좄죫壹⑹빱^ 11101000101000011000011011101000100100011001101111101010111111101000111011101011101000001010000011101000101000011000011011101101100011101010100111101100101110111010010011101000101000011000011011100111111011111001101111101011101000001000111011101010111111101010000011101000101000011000011011101100111011001010100111101100101110111010010001011110 e8a186e8919beafe8eeba0a0e8a186ed8ea9ecbba4e8a186e7ef9beba08eeafea0e8a186ececa9ecbba45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)