To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????a???????????aB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110000100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110000101000010 3f3f3f3f3f3f3f3f3f3f3f613f3f3f3f3f3f3f3f3f3f3f6142
SJIS-WIN 篠テォ篠ツト蒔篠ツト識a篠テォ篠ツト蒔篠ツト識aB 1000111011000010110000111010101110001110110000101100001011000100100011101010101010001110110000101100001011000100100011101010111101100001100011101100001011000011101010111000111011000010110000101100010010001110101010101000111011000010110000101100010010001110101011110110000101000010 8ec2c3ab8ec2c2c48eaa8ec2c2c48eaf618ec2c3ab8ec2c2c48eaa8ec2c2c48eaf6142
EUC-JP 篠テォ篠ツト蒔篠ツト識a篠テォ篠ツト蒔篠ツト識aB 1011110011000100100011101100001110001110101010111011110011000100100011101100001010001110110001001011110010101100101111001100010010001110110000101000111011000100101111001011000101100001101111001100010010001110110000111000111010101011101111001100010010001110110000101000111011000100101111001010110010111100110001001000111011000010100011101100010010111100101100010110000101000010 bcc48ec38eabbcc48ec28ec4bcacbcc48ec28ec4bcb161bcc48ec38eabbcc48ec28ec4bcacbcc48ec28ec4bcb16142
UTF-8 篠テォ篠ツト蒔篠ツト識a篠テォ篠ツト蒔篠ツト識aB 111001111010111110100000111011111011111010000011111011111011110110101011111001111010111110100000111011111011111010000010111011111011111010000100111010001001001010010100111001111010111110100000111011111011111010000010111011111011111010000100111010001010110110011000011000011110011110101111101000001110111110111110100000111110111110111101101010111110011110101111101000001110111110111110100000101110111110111110100001001110100010010010100101001110011110101111101000001110111110111110100000101110111110111110100001001110100010101101100110000110000101000010 e7afa0efbe83efbdabe7afa0efbe82efbe84e89294e7afa0efbe82efbe84e8ad9861e7afa0efbe83efbdabe7afa0efbe82efbe84e89294e7afa0efbe82efbe84e8ad986142
UHC 篠??篠??蒔篠??識a篠??篠??蒔篠??識aB 1110000111000110001111110011111111100001110001100011111100111111111000111100100011100001110001100011111100111111111000111101101101100001111000011100011000111111001111111110000111000110001111110011111111100011110010001110000111000110001111110011111111100011110110110110000101000010 e1c63f3fe1c63f3fe3c8e1c63f3fe3db61e1c63f3fe1c63f3fe3c8e1c63f3fe3db6142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)