To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 艤??毅??厓??}艤??毅??厓??{^ 111001000111111000111111001111111000101101000010001111110011111111111010100011010011111100111111011111011110010001111110001111110011111110001011010000100011111100111111111110101000110100111111001111110111101101011110 e47e3f3f8b423f3ffa8d3f3f7de47e3f3f8b423f3ffa8d3f3f7b5e
EUC-JP 艤??毅??厓??}艤??毅??厓??{^ 1110011111011111001111110011111110110101101000110011111100111111100011111011010011000111001111110011111101111101111001111101111100111111001111111011010110100011001111110011111110001111101101001100011100111111001111110111101101011110 e7df3f3fb5a33f3f8fb4c73f3f7de7df3f3fb5a33f3f8fb4c73f3f7b5e
UTF-8 艤㎩룘毅볦쑇厓붿삌}艤㎩룘毅볦쑇厓붿삌{^ 111010001000100110100100111000111000111010101001111010111010001110011000111001101010111110000101111010111011001110100110111011001001000110000111111001011000111010010011111010111011011010111111111011001000001010001100011111011110100010001001101001001110001110001110101010011110101110100011100110001110011010101111100001011110101110110011101001101110110010010001100001111110010110001110100100111110101110110110101111111110110010000010100011000111101101011110 e889a4e38ea9eba398e6af85ebb3a6ec9187e58e93ebb6bfec828c7de889a4e38ea9eba398e6af85ebb3a6ec9187e58e93ebb6bfec828c7b5e
UHC 艤㎩룘毅볦쑇厓붿삌}艤㎩룘毅볦쑇厓붿삌{^ 111010111111101010100111111001011000111110010100111010111111011010010011111011001001110010100111111001001110110110010100111011001001100010010011011111011110101111111010101001111110010110001111100101001110101111110110100100111110110010011100101001111110010011101101100101001110110010011000100100110111101101011110 ebfaa7e58f94ebf693ec9ca7e4ed94ec98937debfaa7e58f94ebf693ec9ca7e4ed94ec98937b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)