To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 亦??揖ц?蟻?? 10010110100100100011111100111111100101110100101110000100100010000011111110001011011000010011111100111111 96923f3f974b84883f8b613f3f
EUC-JP 亦??揖ц?蟻?? 11001011111100100011111100111111110011011010110010100111111010000011111110110101110000100011111100111111 cbf23f3fcdaca7e83fb5c23f3f
UTF-8 亦껋뼲揖ц쫨蟻뚯쓢 1110010010111010101001101110101010111011100010111110101110111100101100101110011010001111100101101101000110000110111011001010101110101000111010001001111110111011111010111001101010101111111011001001001110100010 e4baa6eabb8bebbcb2e68f96d186ecaba8e89fbbeb9aafec93a2
UHC 亦껋뼲揖ц쫨蟻뚯쓢 111001101011001010000011111011001001011010110101111010111110011110101100111010001010011010000001111010111111110010001100111011001001110110000011 e6b283ec96b5ebe7ace8a681ebfc8cec9d83

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)