To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蔕駕讐茘器拾苒 1110010011110110100010011110110110001111010100011110010010101011100010101110110110001111010001011110010010010010 e4f689ed8f51e4ab8aed8f45e492
EUC-JP 蔕駕讐茘器拾苒 1110100011111000101100101110111110111101101100101110100010101101101101001110111110111101101001101110011111110010 e8f8b2efbdb2e8adb4efbda6e7f2
UTF-8 蔕駕讐茘器拾苒 111010001001010010010101111010011010011110010101111010001010111010010000111010001000110010011000111001011001100110101000111001101000101110111110111010001000101110010010 e89495e9a795e8ae90e88c98e599a8e68bbee88b92
UHC ?駕讐?器拾苒 001111111100101010111101111000101100001000111111110100001110111111100011101001101110011011111110 3fcabde2c23fd0efe3a6e6fe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)