To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????}???????????{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 才?鷹?躁?咀棒???}才?鷹?躁?咀棒???{^ 1000110111001011001111111001000111101001001111111110011101001110001111111001100111110000100101100101111100111111001111110011111101111101100011011100101100111111100100011110100100111111111001110100111000111111100110011111000010010110010111110011111100111111001111110111101101011110 8dcb3f91e93fe74e3f99f0965f3f3f3f7d8dcb3f91e93fe74e3f99f0965f3f3f3f7b5e
EUC-JP 才?鷹?躁?咀棒???}才?鷹?躁?咀棒???{^ 1011101011001101001111111100001011101011001111111110110110101111001111111101001011110010110010111100000000111111001111110011111101111101101110101100110100111111110000101110101100111111111011011010111100111111110100101111001011001011110000000011111100111111001111110111101101011110 bacd3fc2eb3fedaf3fd2f2cbc03f3f3f7dbacd3fc2eb3fedaf3fd2f2cbc03f3f3f7b5e
UTF-8 才렱鷹렓躁렕咀棒렮당떼}才렱鷹렓躁렕咀棒렮당떼{^ 111001101000100110001101111010111010000010110001111010011011011110111001111010111010000010010011111010001011101010000001111010111010000010010101111001011001001010000000111001101010001110010010111010111010000010101110111010111000101110111001111010111001011010111100011111011110011010001001100011011110101110100000101100011110100110110111101110011110101110100000100100111110100010111010100000011110101110100000100101011110010110010010100000001110011010100011100100101110101110100000101011101110101110001011101110011110101110010110101111000111101101011110 e6898deba0b1e9b7b9eba093e8ba81eba095e59280e6a392eba0aeeb8bb9eb96bc7de6898deba0b1e9b7b9eba093e8ba81eba095e59280e6a392eba0aeeb8bb9eb96bc7b5e
UHC 才렱鷹렓躁렕咀棒렮당떼}才렱鷹렓躁렕咀棒렮당떼{^ 1110111010100110100011101011111011101011111011011000111010101000111100001110001010001110101010101110111010111010110111001110101010001110101110111011010011100111101101101011110001111101111011101010011010001110101111101110101111101101100011101010100011110000111000101000111010101010111011101011101011011100111010101000111010111011101101001110011110110110101111000111101101011110 eea68ebeebed8ea8f0e28eaaeebadcea8ebbb4e7b6bc7deea68ebeebed8ea8f0e28eaaeebadcea8ebbb4e7b6bc7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)