To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 陷・霑手悋鮓戎 11101000100111001010010111101000101111111000111011101000100111001010010111101001101101101000111101011110 e89ca5e8bf8ee89ca5e9b68f5e
EUC-JP 陷・霑手悋鮓戎 1110111111111100100011101010010111110000110000011011110011101010110110001010011111110010101110001011110110111111 effc8ea5f0c1bcead8a7f2b8bdbf
UTF-8 陷・霑手悋鮓戎 111010011001100110110111111011111011110110100101111010011001110010010001111001101000100110001011111001101000001010001011111010011010111010010011111001101000100010001110 e999b7efbda5e99c91e6898be6828be9ae93e6888e
UHC 陷?霑手??戎 1111100111101000001111111110111111000101111000101010001000111111001111111110101111010100 f9e83fefc5e2a23f3febd4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)