To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????J}??????J{^ 0011111100111111001111110011111100111111001111110100101001111101001111110011111100111111001111110011111100111111010010100111101101011110 3f3f3f3f3f3f4a7d3f3f3f3f3f3f4a7b5e
SJIS-WIN 張??欲??J}張??欲??J{^ 100100101010001100111111001111111001011101111110001111110011111101001010011111011001001010100011001111110011111110010111011111100011111100111111010010100111101101011110 92a33f3f977e3f3f4a7d92a33f3f977e3f3f4a7b5e
EUC-JP 張??欲??J}張??欲??J{^ 110001001010010100111111001111111100110111011111001111110011111101001010011111011100010010100101001111110011111111001101110111110011111100111111010010100111101101011110 c4a53f3fcddf3f3f4a7dc4a53f3fcddf3f3f4a7b5e
UTF-8 張㏆슭欲ㅿ슛J}張㏆슭欲ㅿ슛J{^ 1110010110111100101101011110001110001111100001101110110010001010101011011110011010101100101100101110001110000101101111111110110010001010100110110100101001111101111001011011110010110101111000111000111110000110111011001000101010101101111001101010110010110010111000111000010110111111111011001000101010011011010010100111101101011110 e5bcb5e38f86ec8aade6acb2e385bfec8a9b4a7de5bcb5e38f86ec8aade6acb2e385bfec8a9b4a7b5e
UHC 張㏆슭欲ㅿ슛J}張㏆슭欲ㅿ슛J{^ 1110110111100101101001111110111110111101101111101110100110110000101001001110111110111101101110000100101001111101111011011110010110100111111011111011110110111110111010011011000010100100111011111011110110111000010010100111101101011110 ede5a7efbdbee9b0a4efbdb84a7dede5a7efbdbee9b0a4efbdb84a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)