To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??恝泣 0011111100111111001111111000101110000011001111110011111111111010101111001000101110000011 3f3f3f8b833f3ffabc8b83
EUC-JP ???泣??恝泣 001111110011111100111111101101011110001100111111001111111000111110111101111001111011010111100011 3f3f3fb5e33f3f8fbde7b5e3
UTF-8 囹덈뿰泣앮뮄恝泣 111011111010011010101001111010111000110110001000111010111011111110110000111001101011001110100011111011001001010110101110111010111010111010000100111001101000000110011101111001101011001110100011 efa6a9eb8d88ebbfb0e6b3a3ec95aeebae84e6819de6b3a3
UHC 囹덈뿰泣앮뮄恝泣 11100111101010101000100011101011100101111011000011101011111010001001110111100110100100101001001111001110101111111110101111101000 e7aa88eb97b0ebe89de69293cebfebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)