To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 荳茨セ∝ョ滄ゥ手「ア荳茨セ∝ョ滄ゥ手「ー^ 11100100101110001000100011101111101111101000000111100101101011101001111111101001101010011000111011101000101000101011000111100100101110001000100011101111101111101000000111100101101011101001111111101001101010011000111011101000101000101011000001011110 e4b888efbe81e5ae9fe9a98ee8a2b1e4b888efbe81e5ae9fe9a98ee8a2b05e
EUC-JP 荳茨セ∝ョ滄ゥ手「ア荳茨セ∝ョ滄ゥ手「ー^ 1110100010111010101100001111000110001110101111101010001011100111100011101010111011011110111010111000111010101001101111001110101010001110101000101000111010110001111010001011101010110000111100011000111010111110101000101110011110001110101011101101111011101011100011101010100110111100111010101000111010100010100011101011000001011110 e8bab0f18ebea2e78eaedeeb8ea9bcea8ea28eb1e8bab0f18ebea2e78eaedeeb8ea9bcea8ea28eb05e
UTF-8 荳茨セ∝ョ滄ゥ手「ア荳茨セ∝ョ滄ゥ手「ー^ 11101000100011011011001111101000100011001010100011101111101111011011111011100010100010001001110111101111101111011010111011100110101110111000010011101111101111011010100111100110100010011000101111101111101111011010001011101111101111011011000111101000100011011011001111101000100011001010100011101111101111011011111011100010100010001001110111101111101111011010111011100110101110111000010011101111101111011010100111100110100010011000101111101111101111011010001011101111101111011011000001011110 e88db3e88ca8efbdbee2889defbdaee6bb84efbda9e6898befbda2efbdb1e88db3e88ca8efbdbee2889defbdaee6bb84efbda9e6898befbda2efbdb05e
UHC 荳茨?∝?滄?手??荳茨?∝?滄?手??^ 11010100111001011110110110111100001111111010000111110000001111111111001111100111001111111110001010100010001111110011111111010100111001011110110110111100001111111010000111110000001111111111001111100111001111111110001010100010001111110011111101011110 d4e5edbc3fa1f03ff3e73fe2a23f3fd4e5edbc3fa1f03ff3e73fe2a23f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)