To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 阯・竊庭・シ蝣餓セ、阯・竊庭・シ蝣餓セ、B 11101000100101111010010111100010100001101001001011101011101001011011110011100101101000001000100111101100101111101010010011101000100101111010010111100010100001101001001011101011101001011011110011100101101000001000100111101100101111101010010001000010 e897a5e28692eba5bce5a089ecbea4e897a5e28692eba5bce5a089ecbea442
EUC-JP 阯・竊庭・シ蝣餓セ、阯・竊庭・シ蝣餓セ、B 1110111111110111100011101010010111100011111001101100010011101101100011101010010110001110101111001110101010100010101100101110111010001110101111101000111010100100111011111111011110001110101001011110001111100110110001001110110110001110101001011000111010111100111010101010001010110010111011101000111010111110100011101010010001000010 eff78ea5e3e6c4ed8ea58ebceaa2b2ee8ebe8ea4eff78ea5e3e6c4ed8ea58ebceaa2b2ee8ebe8ea442
UTF-8 阯・竊庭・シ蝣餓セ、阯・竊庭・シ蝣餓セ、B 11101001100110001010111111101111101111011010010111100111101010111000101011100101101110101010110111101111101111011010010111101111101111011011110011101000100111011010001111101001101001001001001111101111101111011011111011101111101111011010010011101001100110001010111111101111101111011010010111100111101010111000101011100101101110101010110111101111101111011010010111101111101111011011110011101000100111011010001111101001101001001001001111101111101111011011111011101111101111011010010001000010 e998afefbda5e7ab8ae5baadefbda5efbdbce89da3e9a493efbdbeefbda4e998afefbda5e7ab8ae5baadefbda5efbdbce89da3e9a493efbdbeefbda442
UHC ??竊庭???餓????竊庭???餓??B 001111110011111111101111101111001110111111010100001111110011111100111111111001001011101100111111001111110011111100111111111011111011110011101111110101000011111100111111001111111110010010111011001111110011111101000010 3f3fefbcefd43f3f3fe4bb3f3f3f3fefbcefd43f3f3fe4bb3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)