To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悟???ゆ?應??嚴щ?愉?い怨??? 100011001110010100111111001111110011111110000010111001000011111110011100111001000011111100111111100110101000111010000100100010110011111110010110111110010011111110000010101000101000100110000101001111110011111100111111 8ce53f3f3f82e43f9ce43f3f9a8e848b3f96f93f82a289853f3f3f
EUC-JP 悟??沅ゆ?應??嚴щ?愉?い怨??孼 10111000111001110011111100111111100011111100011011101001101001001110011000111111110110001110011000111111001111111101001111101110101001111110101100111111110011001111101100111111101001001010010010110001111001010011111100111111100011111011101011000011 b8e73f3f8fc6e9a4e63fd8e63f3fd3eea7eb3fccfb3fa4a4b1e53f3f8fbac3
UTF-8 悟뽯쉼沅ゆ룚應쇱쑓嚴щ벊愉잒い怨몄삖孼 1110011010000010100111111110101110111101101011111110110010001001101111001110011010110010100001011110001110000010100001101110101110100011100110101110011010000111100010011110110010000111101100011110110010010001100100111110010110011010101101001101000110001001111010111011001010001010111001101000010010001001111011001001111010010010111000111000000110000100111001101000000010101000111010111010101010000100111011001000001010010110111001011010110110111100 e6829febbdafec89bce6b285e38286eba39ae68789ec87b1ec9193e59ab4d189ebb28ae68489ec9e92e38184e680a8ebaa84ec8296e5adbc
UHC 悟뽯쉼沅ゆ룚應쇱쑓嚴щ벊愉잒い怨몄삖孼 1110011111110110100101101110101110111101101100001110101010110110101010101110011010001111100101101110101111101011101111001110110010011100101100101110010111110001101011001110101110010011101011011110101011110000100111111110100010101010101001001110101010110011101110001110110010011000100110101110010111101101 e7f696ebbdb0eab6aae68f96ebebbcec9cb2e5f1aceb93adeaf09fe8aaa4eab3b8ec989ae5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)