To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 足皎??足皎??B 10010001101010111110000110100111001111110011111110010001101010111110000110100111001111110011111101000010 91abe1a73f3f91abe1a73f3f42
EUC-JP 足皎??足皎??B 11000010101011011110001010101001001111110011111111000010101011011110001010101001001111110011111101000010 c2ade2a93f3fc2ade2a93f3f42
UTF-8 足皎렫렲足皎렫렲B 11101000101101101011001111100111100110101000111011101011101000001010101111101011101000001011001011101000101101101011001111100111100110101000111011101011101000001010101111101011101000001011001001000010 e8b6b3e79a8eeba0abeba0b2e8b6b3e79a8eeba0abeba0b242
UHC 足皎렫렲足皎렫렲B 1111000011101011110011101110101110001110101110011000111010111111111100001110101111001110111010111000111010111001100011101011111101000010 f0ebceeb8eb98ebff0ebceeb8eb98ebf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)