To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????W 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f57
SJIS-WIN 偲悉篠軸篠セナ式篠フトクナ悉篠ワト汐W 1000111011000011100011101011101110001110110000101000111010110010100011101100001010111110110001011000111010101110100011101100001011001100110001001011100011000101100011101011101110001110110000101101110011000100100011101010110001010111 8ec38ebb8ec28eb28ec2bec58eae8ec2ccc4b8c58ebb8ec2dcc48eac57
EUC-JP 偲悉篠軸篠セナ式篠フトクナ悉篠ワト汐W 10111100110001011011110010111101101111001100010010111100101101001011110011000100100011101011111010001110110001011011110010110000101111001100010010001110110011001000111011000100100011101011100010001110110001011011110010111101101111001100010010001110110111001000111011000100101111001010111001010111 bcc5bcbdbcc4bcb4bcc48ebe8ec5bcb0bcc48ecc8ec48eb88ec5bcbdbcc48edc8ec4bcae57
UTF-8 偲悉篠軸篠セナ式篠フトクナ悉篠ワト汐W 11100101100000011011001011100110100000101000100111100111101011111010000011101000101110111011100011100111101011111010000011101111101111011011111011101111101111101000010111100101101111001000111111100111101011111010000011101111101111101000110011101111101111101000010011101111101111011011100011101111101111101000010111100110100000101000100111100111101011111010000011101111101111101001110011101111101111101000010011100110101100011001000001010111 e581b2e68289e7afa0e8bbb8e7afa0efbdbeefbe85e5bc8fe7afa0efbe8cefbe84efbdb8efbe85e68289e7afa0efbe9cefbe84e6b19057
UHC ?悉篠軸篠??式篠????悉篠??汐W 00111111111000111111101011100001110001101111010111101110111000011100011000111111001111111110001111010010111000011100011000111111001111110011111100111111111000111111101011100001110001100011111100111111111000001011000101010111 3fe3fae1c6f5eee1c63f3fe3d2e1c63f3f3f3fe3fae1c63f3fe0b157

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)