To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏??泣??音?? 100010010100011100111111001111111000101110000011001111110011111110001001101110010011111100111111 89473f3f8b833f3f89b93f3f
EUC-JP 烏??泣??音?? 101100011010100000111111001111111011010111100011001111110011111110110010101110110011111100111111 b1a83f3fb5e33f3fb2bb3f3f
UTF-8 烏띾쪋泣ㅸ쵟音쎌돽 111001111000001110001111111010111001110110111110111011001010101010001011111001101011001110100011111000111000010110111000111011001011010110011111111010011001111110110011111011001000111010001100111010111000111110111101 e7838feb9dbeecaa8be6b3a3e385b8ecb59fe99fb3ec8e8ceb8fbd
UHC 烏띾쪋泣ㅸ쵟音쎌돽 111010001010000110001101111010111010010110000101111010111110100010100100111010001010110010100000111010111110010110111101111011001000100110111111 e8a18deba585ebe8a4e8aca0ebe5bdec89bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)