To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??諭??遙 00111111001111110011111110001011100000110011111100111111100101110100000000111111001111111110101010100001 3f3f3f8b833f3f97403f3feaa1
EUC-JP ???泣??諭??遙 00111111001111110011111110110101111000110011111100111111110011011010000100111111001111111111010010100011 3f3f3fb5e33f3fcda13f3ff4a3
UTF-8 囹덈냵泣섉븭諭꾩뒫遙 111011111010011010101001111010111000110110001000111010111000001110110101111001101011001110100011111011001000010010001001111010111011100010101101111010001010101110101101111010101011111010101001111010111001001010101011111010011000000110011001 efa6a9eb8d88eb83b5e6b3a3ec8489ebb8ade8abadeabea9eb92abe98199
UHC 囹덈냵泣섉븭諭꾩뒫遙 1110011110101010100010001110101110000110100001011110101111101000100110001110011010010101100101101110101110110001100001001110110010001010101001011110100110101011 e7aa88eb8685ebe898e69596ebb184ec8aa5e9ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)