To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????A 0011111100111111001111110011111100111111001111110011111101000001 3f3f3f3f3f3f3f41
SJIS-WIN 宵メ?讖ェ宵イA 1000111110101010110100100011111111100110101010011010101010001111101010101011001001000001 8faad23fe6a9aa8faab241
EUC-JP 宵メ繐讖ェ宵イA 10111110101011001000111011010010100011111101010011010011111011001010101110001110101010101011111010101100100011101011001001000001 beac8ed28fd4d3ecab8eaabeac8eb241
UTF-8 宵メ繐讖ェ宵イA 11100101101011101011010111101111101111101001001011100111101110011001000011101000101011101001011011101111101111011010101011100101101011101011010111101111101111011011001001000001 e5aeb5efbe92e7b990e8ae96efbdaae5aeb5efbdb241
UHC 宵??讖?宵?A 1110000110110010001111110011111111110011110110010011111111100001101100100011111101000001 e1b23f3ff3d93fe1b23f41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)