To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 鴦???筌??徇 1110100111110001001111110011111100111111111000101010001100111111001111111001110001101101 e9f13f3f3fe2a33f3f9c6d
EUC-JP 鴦???筌??徇 1111001011110011001111110011111100111111111001001010010100111111001111111101011111001110 f2f33f3f3fe4a53f3fd7ce
UTF-8 鴦꾆뀀꼧筌딆뮂徇 111010011011010010100110111010101011111010000110111010111000000010000000111010101011110010100111111001111010110110001100111010111001010010000110111010111010111010000010111001011011111010000111 e9b4a6eabe86eb8080eabca7e7ad8ceb9486ebae82e5be87
UHC 鴦꾆뀀꼧筌딆뮂徇 11100100111011001000010011001110101100101110101110000100100001001110111110100111100010101110110010010010100100011110001011011111 e4ec84ceb2eb8484efa78aec9291e2df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)