To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 臾노ぞ維묐Л維롫 1110100010000111101111101110101110000101101110001110001110000001100111101110011110110110101011011110101110101100100100001101000010011011111001111011011010101101111010111010000110101011 e887beeb85b8e3819ee7b6adebac90d09be7b6adeba1ab
SJIS-WIN ??????????¶??¬????¶???? 0011111100111111001111110011111100111111001111110011111100111111001111110011111110000001111101110011111100111111100000011100101000111111001111110011111100111111100000011111011100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f81f73f3f81ca3f3f3f3f81f73f3f3f3f
EUC-JP è??ë?¸ã??ç¶?ë¬???ç¶?ë¡? 1000111110101011101100100011111100111111100011111010101110110011001111111000111110100010101100011000111110101011101010100011111100111111100011111010101110101110101000101111100100111111100011111010101110110011101000101100110000111111001111110011111110001111101010111010111010100010111110010011111110001111101010111011001110001111101000101100001000111111 8fabb23f3f8fabb33f8fa2b18fabaa3f3f8fabaea2f93f8fabb3a2cc3f3f3f8fabaea2f93f8fabb38fa2c23f
UTF-8 臾노ぞ維묐Л維롫 11000011101010001100001010000111110000101011111011000011101010111100001010000101110000101011100011000011101000111100001010000001110000101001111011000011101001111100001010110110110000101010110111000011101010111100001010101100110000101001000011000011100100001100001010011011110000111010011111000010101101101100001010101101110000111010101111000010101000011100001010101011 c3a8c287c2bec3abc285c2b8c3a3c281c29ec3a7c2b6c2adc3abc2acc290c390c29bc3a7c2b6c2adc3abc2a1c2ab
UHC ??¾??¸????¶­???Ð??¶­?¡? 00111111001111111010100011111010001111110011111110100010101011000011111100111111001111110011111110100010110100101010000110101001001111110011111100111111101010001010001000111111001111111010001011010010101000011010100100111111101000101010111000111111 3f3fa8fa3f3fa2ac3f3f3f3fa2d2a1a93f3f3fa8a23f3fa2d2a1a93fa2ae3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)