To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 誤??飮??矣??夜??艤ゆ?掩??泣ε? 100011001110101100111111001111111001111101011010001111110011111111100001111000010011111100111111100101101110100100111111001111111110010001111110100000101110010000111111100010011000011000111111001111111000101110000011100000111100001100111111 8ceb3f3f9f5a3f3fe1e13f3f96e93f3fe47e82e43f89863f3f8b8383c33f
EUC-JP 誤??飮??矣??夜??艤ゆ?掩??泣ε? 101110001110110100111111001111111101110110111011001111110011111111100010111000110011111100111111110011001110101100111111001111111110011111011111101001001110011000111111101100011110011000111111001111111011010111100011101001101100010100111111 b8ed3f3fddbb3f3fe2e33f3fcceb3f3fe7dfa4e63fb1e63f3fb5e3a6c53f
UTF-8 誤곸룊飮닷쮦矣묒돭夜껋꼯艤ゆ뵽掩뽰궪泣ε뜄 1110100010101010101001001110101010110011101110001110101110100011100010101110100110100011101011101110101110001011101101111110110010101110101001101110011110011111101000111110101110101100100100101110101110001111101011011110010110100100100111001110101010111011100010111110101010111100101011111110100010001001101001001110001110000010100001101110101110110101101111011110011010001110101010011110101110111101101100001110101010110110101010101110011010110011101000111100111010110101111010111001110010000100 e8aaa4eab3b8eba38ae9a3aeeb8bb7ecaea6e79fa3ebac92eb8fade5a49ceabb8beabcafe889a4e38286ebb5bde68ea9ebbdb0eab6aae6b3a3ceb5eb9c84
UHC 誤곸룊飮닷쮦矣묒돭夜껋꼯艤ゆ뵽掩뽰궪泣ε뜄 111010001010011010000001111011001000111110001001111010111110011010110100111001011010100010000011111010111111100010010001111011001000100110110000111001011010100010000011111011001000010010001010111010111111101010101010111001101001010010111011111001011111001110010110111011001000001010111100111010111110100010100101111001011000110110001000 e8a681ec8f89ebe6b4e5a883ebf891ec89b0e5a883ec848aebfaaae694bbe5f396ec82bcebe8a5e58d88

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)