To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 篠セト辞偲磁篠セト蒔篠式篠セト蒔篠質篠治篠鹿B 100011101100001010111110110001001000111010101011100011101100001110001110101001011000111011000010101111101100010010001110101010101000111011000010100011101010111010001110110000101011111011000100100011101010101010001110110000101000111010111111100011101100001010001110101000011000111011000010100011101010110101000010 8ec2bec48eab8ec38ea58ec2bec48eaa8ec28eae8ec2bec48eaa8ec28ebf8ec28ea18ec28ead42
EUC-JP 篠セト辞偲磁篠セト蒔篠式篠セト蒔篠質篠治篠鹿B 101111001100010010001110101111101000111011000100101111001010110110111100110001011011110010100111101111001100010010001110101111101000111011000100101111001010110010111100110001001011110010110000101111001100010010001110101111101000111011000100101111001010110010111100110001001011110011000001101111001100010010111100101000111011110011000100101111001010111101000010 bcc48ebe8ec4bcadbcc5bca7bcc48ebe8ec4bcacbcc4bcb0bcc48ebe8ec4bcacbcc4bcc1bcc4bca3bcc4bcaf42
UTF-8 篠セト辞偲磁篠セト蒔篠式篠セト蒔篠質篠治篠鹿B 11100111101011111010000011101111101111011011111011101111101111101000010011101000101111101001111011100101100000011011001011100111101000111000000111100111101011111010000011101111101111011011111011101111101111101000010011101000100100101001010011100111101011111010000011100101101111001000111111100111101011111010000011101111101111011011111011101111101111101000010011101000100100101001010011100111101011111010000011101000101100111010101011100111101011111010000011100110101100101011101111100111101011111010000011101001101110011011111101000010 e7afa0efbdbeefbe84e8be9ee581b2e7a381e7afa0efbdbeefbe84e89294e7afa0e5bc8fe7afa0efbdbeefbe84e89294e7afa0e8b3aae7afa0e6b2bbe7afa0e9b9bf42
UHC 篠????磁篠??蒔篠式篠??蒔篠質篠治篠鹿B 11100001110001100011111100111111001111110011111111101101101110001110000111000110001111110011111111100011110010001110000111000110111000111101001011100001110001100011111100111111111000111100100011100001110001101111001011110101111000011100011011110110101111011110000111000110110101101110001101000010 e1c63f3f3f3fedb8e1c63f3fe3c8e1c6e3d2e1c63f3fe3c8e1c6f2f5e1c6f6bde1c6d6e342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)