To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????zh 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111101001101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7a68
SJIS-WIN シナ示柴シゥシナ痔柴シュシナ痔柴シャzh 1011110011000101100011101010011010001110110001001011110010101001101111001100010110001110101001001000111011000100101111001010110110111100110001011000111010100100100011101100010010111100101011000111101001101000 bcc58ea68ec4bca9bcc58ea48ec4bcadbcc58ea48ec4bcac7a68
EUC-JP シナ示柴シゥシナ痔柴シュシナ痔柴シャzh 1000111010111100100011101100010110111100101010001011110011000110100011101011110010001110101010011000111010111100100011101100010110111100101001101011110011000110100011101011110010001110101011011000111010111100100011101100010110111100101001101011110011000110100011101011110010001110101011000111101001101000 8ebc8ec5bca8bcc68ebc8ea98ebc8ec5bca6bcc68ebc8ead8ebc8ec5bca6bcc68ebc8eac7a68
UTF-8 シナ示柴シゥシナ痔柴シュシナ痔柴シャzh 1110111110111101101111001110111110111110100001011110011110100100101110101110011010011111101101001110111110111101101111001110111110111101101010011110111110111101101111001110111110111110100001011110011110010111100101001110011010011111101101001110111110111101101111001110111110111101101011011110111110111101101111001110111110111110100001011110011110010111100101001110011010011111101101001110111110111101101111001110111110111101101011000111101001101000 efbdbcefbe85e7a4bae69fb4efbdbcefbda9efbdbcefbe85e79794e69fb4efbdbcefbdadefbdbcefbe85e79794e69fb4efbdbcefbdac7a68
UHC ??示柴????痔柴????痔柴??zh 0011111100111111111000111100011011100011110000110011111100111111001111110011111111110110110000001110001111000011001111110011111100111111001111111111011011000000111000111100001100111111001111110111101001101000 3f3fe3c6e3c33f3f3f3ff6c0e3c33f3f3f3ff6c0e3c33f3f7a68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)