To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????}B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7d42
SJIS-WIN 偲識篠疾篠ヲナ磁篠セト痔偲識篠漆篠爾}B 10001110110000111000111010101111100011101100001010001110101111101000111011000010101001101100010110001110101001011000111011000010101111101100010010001110101001001000111011000011100011101010111110001110110000101000111010111101100011101100001010001110101000100111110101000010 8ec38eaf8ec28ebe8ec2a6c58ea58ec2bec48ea48ec38eaf8ec28ebd8ec28ea27d42
EUC-JP 偲識篠疾篠ヲナ磁篠セト痔偲識篠漆篠爾}B 1011110011000101101111001011000110111100110001001011110011000000101111001100010010001110101001101000111011000101101111001010011110111100110001001000111010111110100011101100010010111100101001101011110011000101101111001011000110111100110001001011110010111111101111001100010010111100101001000111110101000010 bcc5bcb1bcc4bcc0bcc48ea68ec5bca7bcc48ebe8ec4bca6bcc5bcb1bcc4bcbfbcc4bca47d42
UTF-8 偲識篠疾篠ヲナ磁篠セト痔偲識篠漆篠爾}B 1110010110000001101100101110100010101101100110001110011110101111101000001110011110010110101111101110011110101111101000001110111110111101101001101110111110111110100001011110011110100011100000011110011110101111101000001110111110111101101111101110111110111110100001001110011110010111100101001110010110000001101100101110100010101101100110001110011110101111101000001110011010111100100001101110011110101111101000001110011110001000101111100111110101000010 e581b2e8ad98e7afa0e796bee7afa0efbda6efbe85e7a381e7afa0efbdbeefbe84e79794e581b2e8ad98e7afa0e6bc86e7afa0e788be7d42
UHC ?識篠疾篠??磁篠??痔?識篠漆篠爾}B 0011111111100011110110111110000111000110111100101111000011100001110001100011111100111111111011011011100011100001110001100011111100111111111101101100000000111111111000111101101111100001110001101111011011010100111000011100011011101100101100110111110101000010 3fe3dbe1c6f2f0e1c63f3fedb8e1c63f3ff6c03fe3dbe1c6f6d4e1c6ecb37d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)