To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????W 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f57
SJIS-WIN 偲耳篠タト失偲辞篠コトーナ汐篠宍篠竺W 10001110110000111000111010101000100011101100001011000000110001001000111010111000100011101100001110001110101010111000111011000010101110101100010010110000110001011000111010101100100011101100001010001110101100111000111011000010100011101011000101010111 8ec38ea88ec2c0c48eb88ec38eab8ec2bac4b0c58eac8ec28eb38ec28eb157
EUC-JP 偲耳篠タト失偲辞篠コトーナ汐篠宍篠竺W 10111100110001011011110010101010101111001100010010001110110000001000111011000100101111001011101010111100110001011011110010101101101111001100010010001110101110101000111011000100100011101011000010001110110001011011110010101110101111001100010010111100101101011011110011000100101111001011001101010111 bcc5bcaabcc48ec08ec4bcbabcc5bcadbcc48eba8ec48eb08ec5bcaebcc4bcb5bcc4bcb357
UTF-8 偲耳篠タト失偲辞篠コトーナ汐篠宍篠竺W 11100101100000011011001011101000100000001011001111100111101011111010000011101111101111101000000011101111101111101000010011100101101001001011000111100101100000011011001011101000101111101001111011100111101011111010000011101111101111011011101011101111101111101000010011101111101111011011000011101111101111101000010111100110101100011001000011100111101011111010000011100101101011101000110111100111101011111010000011100111101010111011101001010111 e581b2e880b3e7afa0efbe80efbe84e5a4b1e581b2e8be9ee7afa0efbdbaefbe84efbdb0efbe85e6b190e7afa0e5ae8de7afa0e7abba57
UHC ?耳篠??失??篠????汐篠?篠竺W 001111111110110010111100111000011100011000111111001111111110001111110111001111110011111111100001110001100011111100111111001111110011111111100000101100011110000111000110001111111110000111000110111101011110011101010111 3fecbce1c63f3fe3f73f3fe1c63f3f3f3fe0b1e1c63fe1c6f5e757

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)