To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 櫻??腋??猷 10011111010011100011111100111111111000111111110000111111001111111001011101010001 9f4e3f3fe3fc3f3f9751
EUC-JP 櫻??腋??猷 11011101101011110011111100111111111001101111111000111111001111111100110110110010 ddaf3f3fe6fe3f3fcdb2
UTF-8 櫻뗭걀腋잂뮪猷 111001101010101110111011111010111001011110101101111010101011000110000000111010001000010110001011111011001001111010000010111010111010111010101010111001111000110010110111 e6abbbeb97adeab180e8858bec9e82ebaeaae78cb7
UHC 櫻뗭걀腋잂뮪猷 1110010110100001100010111110110010110000101111111110010011111101100111111110001010010010101101001110101110100011 e5a18becb0bfe4fd9fe292b4eba3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)