To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??泣??矣?? 100010001010001100111111001111111000101110000011001111110011111111100001111000010011111100111111 88a33f3f8b833f3fe1e13f3f
EUC-JP 哀??泣??矣?? 101100001010010100111111001111111011010111100011001111110011111111100010111000110011111100111111 b0a53f3fb5e33f3fe2e33f3f
UTF-8 哀얜벝泣곫뿿矣몄댉 111001011001001110000000111011001001011010011100111010111011001010011101111001101011001110100011111010101011001110101011111010111011111110111111111001111001111110100011111010111010101010000100111010111000110010001001 e59380ec969cebb29de6b3a3eab3abebbfbfe79fa3ebaa84eb8c89
UHC 哀얜벝泣곫뿿矣몄댉 111001001110111010111110111010111001001110111000111010111110100010000001111001101001011110111111111010111111100010111000111011001000100010110010 e4eebeeb93b8ebe881e697bfebf8b8ec88b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)