To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????}???????????{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 才?鷹?竣?咀棒???}才?鷹?竣?咀棒???{^ 1000110111001011001111111001000111101001001111111000111101110110001111111001100111110000100101100101111100111111001111110011111101111101100011011100101100111111100100011110100100111111100011110111011000111111100110011111000010010110010111110011111100111111001111110111101101011110 8dcb3f91e93f8f763f99f0965f3f3f3f7d8dcb3f91e93f8f763f99f0965f3f3f3f7b5e
EUC-JP 才?鷹?竣?咀棒???}才?鷹?竣?咀棒???{^ 1011101011001101001111111100001011101011001111111011110111010111001111111101001011110010110010111100000000111111001111110011111101111101101110101100110100111111110000101110101100111111101111011101011100111111110100101111001011001011110000000011111100111111001111110111101101011110 bacd3fc2eb3fbdd73fd2f2cbc03f3f3f7dbacd3fc2eb3fbdd73fd2f2cbc03f3f3f7b5e
UTF-8 才렱鷹렓竣렕咀棒렮당떼}才렱鷹렓竣렕咀棒렮당떼{^ 111001101000100110001101111010111010000010110001111010011011011110111001111010111010000010010011111001111010101110100011111010111010000010010101111001011001001010000000111001101010001110010010111010111010000010101110111010111000101110111001111010111001011010111100011111011110011010001001100011011110101110100000101100011110100110110111101110011110101110100000100100111110011110101011101000111110101110100000100101011110010110010010100000001110011010100011100100101110101110100000101011101110101110001011101110011110101110010110101111000111101101011110 e6898deba0b1e9b7b9eba093e7aba3eba095e59280e6a392eba0aeeb8bb9eb96bc7de6898deba0b1e9b7b9eba093e7aba3eba095e59280e6a392eba0aeeb8bb9eb96bc7b5e
UHC 才렱鷹렓竣렕咀棒렮당떼}才렱鷹렓竣렕咀棒렮당떼{^ 1110111010100110100011101011111011101011111011011000111010101000111100011110001010001110101010101110111010111010110111001110101010001110101110111011010011100111101101101011110001111101111011101010011010001110101111101110101111101101100011101010100011110001111000101000111010101010111011101011101011011100111010101000111010111011101101001110011110110110101111000111101101011110 eea68ebeebed8ea8f1e28eaaeebadcea8ebbb4e7b6bc7deea68ebeebed8ea8f1e28eaaeebadcea8ebbb4e7b6bc7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)