To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????yB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111100101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7942
SJIS-WIN 偲而篠カト失偲示篠ホト示偲而篠叱篠モ」yB 10001110110000111000111010100111100011101100001010110110110001001000111010111000100011101100001110001110101001101000111011000010110011101100010010001110101001101000111011000011100011101010011110001110110000101000111010110110100011101100001011010011101000110111100101000010 8ec38ea78ec2b6c48eb88ec38ea68ec2cec48ea68ec38ea78ec28eb68ec2d3a37942
EUC-JP 偲而篠カト失偲示篠ホト示偲而篠叱篠モ」yB 10111100110001011011110010101001101111001100010010001110101101101000111011000100101111001011101010111100110001011011110010101000101111001100010010001110110011101000111011000100101111001010100010111100110001011011110010101001101111001100010010111100101110001011110011000100100011101101001110001110101000110111100101000010 bcc5bca9bcc48eb68ec4bcbabcc5bca8bcc48ece8ec4bca8bcc5bca9bcc4bcb8bcc48ed38ea37942
UTF-8 偲而篠カト失偲示篠ホト示偲而篠叱篠モ」yB 1110010110000001101100101110100010000000100011001110011110101111101000001110111110111101101101101110111110111110100001001110010110100100101100011110010110000001101100101110011110100100101110101110011110101111101000001110111110111110100011101110111110111110100001001110011110100100101110101110010110000001101100101110100010000000100011001110011110101111101000001110010110001111101100011110011110101111101000001110111110111110100100111110111110111101101000110111100101000010 e581b2e8808ce7afa0efbdb6efbe84e5a4b1e581b2e7a4bae7afa0efbe8eefbe84e7a4bae581b2e8808ce7afa0e58fb1e7afa0efbe93efbda37942
UHC ?而篠??失?示篠??示?而篠叱篠??yB 00111111111011001011101111100001110001100011111100111111111000111111011100111111111000111100011011100001110001100011111100111111111000111100011000111111111011001011101111100001110001101111001011101010111000011100011000111111001111110111100101000010 3fecbbe1c63f3fe3f73fe3c6e1c63f3fe3c63fecbbe1c6f2eae1c63f3f7942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)