To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 永??揖??愉??蹂?????B 1000100101101001001111110011111110010111010010110011111100111111100101101111100100111111001111111110011011111000001111110011111100111111001111110011111101000010 89693f3f974b3f3f96f93f3fe6f83f3f3f3f3f42
EUC-JP 永??揖??愉??蹂?????B 1011000111001010001111110011111111001101101011000011111100111111110011001111101100111111001111111110110011111010001111110011111100111111001111110011111101000010 b1ca3f3fcdac3f3fccfb3f3fecfa3f3f3f3f3f42
UTF-8 永띕낑揖계뇾愉귞독蹂⑹돭銳잙뭳B 11100110101100001011100011101011100111011001010111101011100000101001000111100110100011111001011011101010101100111000010011101011100001111011111011100110100001001000100111101010101101111001111011101011100011111000010111101000101110011000001011100010100100011011100111101011100011111010110111101001100010101011001111101100100111101001100111101011101011011011001101000010 e6b0b8eb9d95eb8291e68f96eab384eb87bee68489eab79eeb8f85e8b982e291b9eb8fade98ab3ec9e99ebadb342
UHC 永띕낑揖계뇾愉귞독蹂⑹돭銳잙뭳B 11100111101101011011011011101011101100111010100111101011111001111011000011101000100001111001111111101010111100001000001011100111101101011011011011101011101100111010100111101100100010011011000011100111111001011001111111101011100100101000001001000010 e7b5b6ebb3a9ebe7b0e8879feaf082e7b5b6ebb3a9ec89b0e7e59feb928242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)