To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 逋イ竏ス岦應セ晧ョ凡逋イ竏ス岦應セ晧ョ本^ 111001111001100110110010111000101000100010111101111110101010110010011100111001001011111010011101111011001010111010010110011111011110011110011001101100101110001010001000101111011111101010101100100111001110010010111110100111011110110010101110100101100111101101011110 e799b2e288bdfaac9ce4be9decae967de799b2e288bdfaac9ce4be9decae967b5e
EUC-JP 逋イ竏ス岦應セ晧ョ凡逋イ竏ス岦應セ晧ョ本^ 11101101111110011000111010110010111000111110100010001110101111011000111110111011101100111101100011100110100011101011111011011010111011101000111010101110110010111101111011101101111110011000111010110010111000111110100010001110101111011000111110111011101100111101100011100110100011101011111011011010111011101000111010101110110010111101110001011110 edf98eb2e3e88ebd8fbbb3d8e68ebedaee8eaecbdeedf98eb2e3e88ebd8fbbb3d8e68ebedaee8eaecbdc5e
UTF-8 逋イ竏ス岦應セ晧ョ凡逋イ竏ス岦應セ晧ョ本^ 11101001100000001000101111101111101111011011001011100111101010111000111111101111101111011011110111100101101100101010011011100110100001111000100111101111101111011011111011100110100110011010011111101111101111011010111011100101100001111010000111101001100000001000101111101111101111011011001011100111101010111000111111101111101111011011110111100101101100101010011011100110100001111000100111101111101111011011111011100110100110011010011111101111101111011010111011100110100111001010110001011110 e9808befbdb2e7ab8fefbdbde5b2a6e68789efbdbee699a7efbdaee587a1e9808befbdb2e7ab8fefbdbde5b2a6e68789efbdbee699a7efbdaee69cac5e
UHC 逋????應?晧?凡逋????應?晧?本^ 1111100011100111001111110011111100111111001111111110101111101011001111111111101111000101001111111101101111101101111110001110011100111111001111110011111100111111111010111110101100111111111110111100010100111111110111001110001001011110 f8e73f3f3f3febeb3ffbc53fdbedf8e73f3f3f3febeb3ffbc53fdce25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)