To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 張θ?欲??箋??墻 100100101010001110000011110001100011111110010111011111100011111100111111111000101011001100111111001111111001101011010100 92a383c63f977e3f3fe2b33f3f9ad4
EUC-JP 張θ?欲??箋??墻 110001001010010110100110110010000011111111001101110111110011111100111111111001001011010100111111001111111101010011010110 c4a5a6c83fcddf3f3fe4b53f3fd4d6
UTF-8 張θ뻗欲꿩꼇箋묊즯墻 1110010110111100101101011100111010111000111010111011101110010111111001101010110010110010111010101011111110101001111010101011110010000111111001111010111010001011111010111010110010001010111011001010011010101111111001011010001010111011 e5bcb5ceb8ebbb97e6acb2eabfa9eabc87e7ae8bebac8aeca6afe5a2bb
UHC 張θ뻗欲꿩꼇箋묊즯墻 1110110111100101101001011110100010111011101110001110100110110000101100101110011010110010101110111110111110101000100100011110011110100011100000011110110111011111 ede5a5e8bbb8e9b0b2e6b2bbefa891e7a381eddf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)