To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN シエ上眈シシエシウシエ上眈シシエシウB 1111000011100011101111001011010010001111111000111110000110111100101111001111000011100011101111001011010011110010111100011011110010110011111100001110001110111100101101001000111111100011111000011011110010111100111100001110001110111100101101001111001011110001101111001011001101000010 f0e3bcb48fe3e1bcbcf0e3bcb4f2f1bcb3f0e3bcb48fe3e1bcbcf0e3bcb4f2f1bcb342
EUC-JP ?シエ上眈シ?シエ?シウ?シエ上眈シ?シエ?シウB 00111111100011101011110010001110101101001011111011100101111000101011111010001110101111000011111110001110101111001000111010110100001111111000111010111100100011101011001100111111100011101011110010001110101101001011111011100101111000101011111010001110101111000011111110001110101111001000111010110100001111111000111010111100100011101011001101000010 3f8ebc8eb4bee5e2be8ebc3f8ebc8eb43f8ebc8eb33f8ebc8eb4bee5e2be8ebc3f8ebc8eb43f8ebc8eb342
UTF-8 シエ上眈シシエシウシエ上眈シシエシウB 11101110100000101010001011101111101111011011110011101111101111011011010011100100101110001000101011100111100111001000100011101111101111011011110011101110100000101010001011101111101111011011110011101111101111011011010011101110100010001010100011101111101111011011110011101111101111011011001111101110100000101010001011101111101111011011110011101111101111011011010011100100101110001000101011100111100111001000100011101111101111011011110011101110100000101010001011101111101111011011110011101111101111011011010011101110100010001010100011101111101111011011110011101111101111011011001101000010 ee82a2efbdbcefbdb4e4b88ae79c88efbdbcee82a2efbdbcefbdb4ee88a8efbdbcefbdb3ee82a2efbdbcefbdb4e4b88ae79c88efbdbcee82a2efbdbcefbdb4ee88a8efbdbcefbdb342
UHC ???上眈??????????上眈???????B 0011111100111111001111111101111110111110111101111010111100111111001111110011111100111111001111110011111100111111001111110011111100111111110111111011111011110111101011110011111100111111001111110011111100111111001111110011111101000010 3f3f3fdfbef7af3f3f3f3f3f3f3f3f3f3fdfbef7af3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)