To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????A 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000001 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f41
SJIS-WIN 偲耳篠疾篠竺偲識篠漆篠イナ汐篠叱篠竺A 1000111011000011100011101010100010001110110000101000111010111110100011101100001010001110101100011000111011000011100011101010111110001110110000101000111010111101100011101100001010110010110001011000111010101100100011101100001010001110101101101000111011000010100011101011000101000001 8ec38ea88ec28ebe8ec28eb18ec38eaf8ec28ebd8ec2b2c58eac8ec28eb68ec28eb141
EUC-JP 偲耳篠疾篠竺偲識篠漆篠イナ汐篠叱篠竺A 10111100110001011011110010101010101111001100010010111100110000001011110011000100101111001011001110111100110001011011110010110001101111001100010010111100101111111011110011000100100011101011001010001110110001011011110010101110101111001100010010111100101110001011110011000100101111001011001101000001 bcc5bcaabcc4bcc0bcc4bcb3bcc5bcb1bcc4bcbfbcc48eb28ec5bcaebcc4bcb8bcc4bcb341
UTF-8 偲耳篠疾篠竺偲識篠漆篠イナ汐篠叱篠竺A 11100101100000011011001011101000100000001011001111100111101011111010000011100111100101101011111011100111101011111010000011100111101010111011101011100101100000011011001011101000101011011001100011100111101011111010000011100110101111001000011011100111101011111010000011101111101111011011001011101111101111101000010111100110101100011001000011100111101011111010000011100101100011111011000111100111101011111010000011100111101010111011101001000001 e581b2e880b3e7afa0e796bee7afa0e7abbae581b2e8ad98e7afa0e6bc86e7afa0efbdb2efbe85e6b190e7afa0e58fb1e7afa0e7abba41
UHC ?耳篠疾篠竺?識篠漆篠??汐篠叱篠竺A 001111111110110010111100111000011100011011110010111100001110000111000110111101011110011100111111111000111101101111100001110001101111011011010100111000011100011000111111001111111110000010110001111000011100011011110010111010101110000111000110111101011110011101000001 3fecbce1c6f2f0e1c6f5e73fe3dbe1c6f6d4e1c63f3fe0b1e1c6f2eae1c6f5e741

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)