To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????V 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f56
SJIS-WIN シナシ・シト灼柴失芝シ・シト疾柴酌芝シ・V 1011110011000101101111001010010110111100110001001000111011011100100011101100010010001110101110001000111011000101101111001010010110111100110001001000111010111110100011101100010010001110110111101000111011000101101111001010010101010110 bcc5bca5bcc48edc8ec48eb88ec5bca5bcc48ebe8ec48ede8ec5bca556
EUC-JP シナシ・シト灼柴失芝シ・シト疾柴酌芝シ・V 1000111010111100100011101100010110001110101111001000111010100101100011101011110010001110110001001011110011011110101111001100011010111100101110101011110011000111100011101011110010001110101001011000111010111100100011101100010010111100110000001011110011000110101111001110000010111100110001111000111010111100100011101010010101010110 8ebc8ec58ebc8ea58ebc8ec4bcdebcc6bcbabcc78ebc8ea58ebc8ec4bcc0bcc6bce0bcc78ebc8ea556
UTF-8 シナシ・シト灼柴失芝シ・シト疾柴酌芝シ・V 11101111101111011011110011101111101111101000010111101111101111011011110011101111101111011010010111101111101111011011110011101111101111101000010011100111100000011011110011100110100111111011010011100101101001001011000111101000100010101001110111101111101111011011110011101111101111011010010111101111101111011011110011101111101111101000010011100111100101101011111011100110100111111011010011101001100001011000110011101000100010101001110111101111101111011011110011101111101111011010010101010110 efbdbcefbe85efbdbcefbda5efbdbcefbe84e781bce69fb4e5a4b1e88a9defbdbcefbda5efbdbcefbe84e796bee69fb4e9858ce88a9defbdbcefbda556
UHC ??????灼柴失芝????疾柴酌芝??V 0011111100111111001111110011111100111111001111111110110111000111111000111100001111100011111101111111001010111001001111110011111100111111001111111111001011110000111000111100001111101101110011001111001010111001001111110011111101010110 3f3f3f3f3f3fedc7e3c3e3f7f2b93f3f3f3ff2f0e3c3edccf2b93f3f56

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)