To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8宜??癒?エ碍??柚??猷??沃??逾 1110000110011111001111111000001001010111100010110101100000111111001111111001011011111100001111111000001101000111100010100101011000111111001111111001011101001101001111110011111110010111010100010011111100111111100101111000000000111111001111111110011110100101 e19f3f82578b583f3f96fc3f83478a563f3f974d3f3f97513f3f97803f3fe7a5
EUC-JP 癲?8宜??癒?エ碍??柚??猷??沃??逾 1110001010100001001111111010001110111000101101011011100100111111001111111100110011111110001111111010010110101000101100111011011100111111001111111100110110101110001111110011111111001101101100100011111100111111110011011110000000111111001111111110111010100111 e2a13fa3b8b5b93f3fccfe3fa5a8b3b73f3fcdae3f3fcdb23f3fcde03f3feea7
UTF-8 癲쒕8宜룩맱癒뀁エ碍⑸씭柚쏙쬅猷몃샍沃쇰ㅉ逾 111001111001100110110010111011001001001010010101111011111011110010011000111001011010111010011100111010111010001110101001111010111010011110110001111001111001100110010010111010111000000010000001111000111000001010101000111001111010001010001101111000101001000110111000111011001001010010101101111001101001111110011010111011001000111110011001111011001010110010000101111001111000110010110111111010111010101010000011111011001000001110001101111001101011001010000011111011001000011110110000111000111000010110001001111010011000000010111110 e799b2ec9295efbc98e5ae9ceba3a9eba7b1e79992eb8081e382a8e7a28de291b8ec94ade69f9aec8f99ecac85e78cb7ebaa83ec838de6b283ec87b0e38589e980be
UHC 癲쒕8宜룩맱癒뀁エ碍⑸씭柚쏙쬅猷몃샍沃쇰ㅉ逾 1110111110100110100111001110101110100011101110001110101111110001101101111110100010010000101110001110101110101000101100101110110010101011101010001110010011110100101010011110101110011101101111101110101011110110101111011110111110100110100111001110101110100011101110001110101110011000101110111110100010101010101111001110101110100100101110011110101110110101 efa69ceba3b8ebf1b7e890b8eba8b2ecaba8e4f4a9eb9dbeeaf6bdefa69ceba3b8eb98bbe8aabceba4b9ebb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)