To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???揖η????齬??洵ヤ?濡リ???? 00111111001111110011111110010111010010111000001111000101001111110011111100111111001111111110101010010111001111110011111110011111101010111000001110000100001111111001010001000111100000111000101000111111001111110011111100111111 3f3f3f974b83c53f3f3f3fea973f3f9fab83843f9447838a3f3f3f3f
EUC-JP ???揖η????齬??洵ヤ?濡リ???? 00111111001111110011111111001101101011001010011011000111001111110011111100111111001111111111001111110111001111110011111111011110101011011010010111100100001111111100011110101000101001011110101000111111001111110011111100111111 3f3f3fcdaca6c73f3f3f3ff3f73f3fdeada5e43fc7a8a5ea3f3f3f3f
UTF-8 嶺뚮슢揖η뙼硫대뙑齬잕퀣洵ヤ틛濡リ컳閱곗냄 1110111110100110101010111110101110011010101011101110110010001010101000101110011010001111100101101100111010110111111010111001100110111100111011111010011110001110111010111000110010000000111010111001100110010001111010011011110110101100111011001001111010010101111011011000000010100011111001101011010010110101111000111000001110100100111011011000101110011011111001101011111110100001111000111000001110101010111011001011101110110011111010011001011010110001111010101011001110010111111010111000001110000100 efa6abeb9aaeec8aa2e68f96ceb7eb99bcefa78eeb8c80eb9991e9bdacec9e95ed80a3e6b4b5e383a4ed8b9be6bfa1e383aaecbbb3e996b1eab397eb8384
UHC 嶺뚮슢揖η뙼硫대뙑齬잕퀣洵ヤ틛濡リ컳閱곗냄 111001111010110110001100111010111001101010101110111010111110011110100101111001111000110010111111111010111010100110110100111010111000110010010110111001011110000110011111111010101011001110010111111000101110011110101011111001001011101010001000111010111010000110101011111010101011000010011001111001101111001110110000111011001011001110111111 e7ad8ceb9aaeebe7a5e78cbfeba9b4eb8c96e5e19feab397e2e7abe4ba88eba1abeab099e6f3b0ecb3bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)