To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 荳茨セ∝ョ滄ゥ手「ア荳願アシ螳滄ゥ手「ー^ 11100100101110001000100011101111101111101000000111100101101011101001111111101001101010011000111011101000101000101011000111100100101110001000101011101000101100011011110011100101101011101001111111101001101010011000111011101000101000101011000001011110 e4b888efbe81e5ae9fe9a98ee8a2b1e4b88ae8b1bce5ae9fe9a98ee8a2b05e
EUC-JP 荳茨セ∝ョ滄ゥ手「ア荳願アシ螳滄ゥ手「ー^ 1110100010111010101100001111000110001110101111101010001011100111100011101010111011011110111010111000111010101001101111001110101010001110101000101000111010110001111010001011101010110100111010101000111010110001100011101011110011101010101100001101111011101011100011101010100110111100111010101000111010100010100011101011000001011110 e8bab0f18ebea2e78eaedeeb8ea9bcea8ea28eb1e8bab4ea8eb18ebceab0deeb8ea9bcea8ea28eb05e
UTF-8 荳茨セ∝ョ滄ゥ手「ア荳願アシ螳滄ゥ手「ー^ 11101000100011011011001111101000100011001010100011101111101111011011111011100010100010001001110111101111101111011010111011100110101110111000010011101111101111011010100111100110100010011000101111101111101111011010001011101111101111011011000111101000100011011011001111101001101000011001100011101111101111011011000111101111101111011011110011101000100111101011001111100110101110111000010011101111101111011010100111100110100010011000101111101111101111011010001011101111101111011011000001011110 e88db3e88ca8efbdbee2889defbdaee6bb84efbda9e6898befbda2efbdb1e88db3e9a198efbdb1efbdbce89eb3e6bb84efbda9e6898befbda2efbdb05e
UHC 荳茨?∝?滄?手??荳願??螳滄?手??^ 11010100111001011110110110111100001111111010000111110000001111111111001111100111001111111110001010100010001111110011111111010100111001011110101011000011001111110011111111010011110110011111001111100111001111111110001010100010001111110011111101011110 d4e5edbc3fa1f03ff3e73fe2a23f3fd4e5eac33f3fd3d9f3e73fe2a23f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)