To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ??侑???重Ъ 0011111100111111100110001101000000111111001111110011111110001111011001001000010001011011 3f3f98d03f3f3f8f64845b
EUC-JP 潗?侑???重Ъ 10001111110010001101100000111111110100001101001000111111001111110011111110111101110001011010011110111100 8fc8d83fd0d23f3f3fbdc5a7bc
UTF-8 潗샹侑렾뤒쾀重Ъ 1110011010111101100101111110110010000011101110011110010010111110100100011110101110100000101111101110101110100100100100101110110010111110100000001110100110000111100011011101000010101010 e6bd97ec83b9e4be91eba0beeba492ecbe80e9878dd0aa
UHC 潗샹侑렾뤒쾀重Ъ 11110010111111001011110010100111111010101110001010001110110001101000111111000010110001001110011011110001111011001010110010111100 f2fcbca7eae28ec68fc2c4e6f1ecacbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)