To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????A 00111111001111110011111100111111001111110011111100111111001111110011111101000001 3f3f3f3f3f3f3f3f3f41
SJIS-WIN 佯??泣??音??A 10011000110100010011111100111111100010111000001100111111001111111000100110111001001111110011111101000001 98d13f3f8b833f3f89b93f3f41
EUC-JP 佯??泣??音??A 11010000110100110011111100111111101101011110001100111111001111111011001010111011001111110011111101000001 d0d33f3fb5e33f3fb2bb3f3f41
UTF-8 佯뺤눦泣앭틫音쎌뎵A 11100100101111011010111111101011101110101010010011101011100010001010011011100110101100111010001111101100100101011010110111101101100010111010101111101001100111111011001111101100100011101000110011101011100011101011010101000001 e4bdafebbaa4eb88a6e6b3a3ec95aded8babe99fb3ec8e8ceb8eb541
UHC 佯뺤눦泣앭틫音쎌뎵A 11100101101110101001010111101100100001111011110111101011111010001001110111100101101110101001010111101011111001011011110111101100100010011000100001000001 e5ba95ec87bdebe89de5ba95ebe5bdec898841

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)