To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????m}????????m{^ 001111110011111100111111001111110011111100111111001111110011111101101101011111010011111100111111001111110011111100111111001111110011111100111111011011010111101101011110 3f3f3f3f3f3f3f3f6d7d3f3f3f3f3f3f3f3f6d7b5e
SJIS-WIN 才?蜘?粧?醍?m}才?蜘?粧?醍?m{^ 1000110111001011001111111001001001110111001111111000111111001111001111111001000111100111001111110110110101111101100011011100101100111111100100100111011100111111100011111100111100111111100100011110011100111111011011010111101101011110 8dcb3f92773f8fcf3f91e73f6d7d8dcb3f92773f8fcf3f91e73f6d7b5e
EUC-JP 才?蜘?粧?醍?m}才?蜘?粧?醍?m{^ 1011101011001101001111111100001111011000001111111011111011010001001111111100001011101001001111110110110101111101101110101100110100111111110000111101100000111111101111101101000100111111110000101110100100111111011011010111101101011110 bacd3fc3d83fbed13fc2e93f6d7dbacd3fc3d83fbed13fc2e93f6d7b5e
UTF-8 才렭蜘렲粧렰醍렕m}才렭蜘렲粧렰醍렕m{^ 1110011010001001100011011110101110100000101011011110100010011100100110001110101110100000101100101110011110110010101001111110101110100000101100001110100110000110100011011110101110100000100101010110110101111101111001101000100110001101111010111010000010101101111010001001110010011000111010111010000010110010111001111011001010100111111010111010000010110000111010011000011010001101111010111010000010010101011011010111101101011110 e6898deba0ade89c98eba0b2e7b2a7eba0b0e9868deba0956d7de6898deba0ade89c98eba0b2e7b2a7eba0b0e9868deba0956d7b5e
UHC 才렭蜘렲粧렰醍렕m}才렭蜘렲粧렰醍렕m{^ 11101110101001101000111010111010111100101011101110001110101111111110110111110010100011101011110111110000101101011000111010101010011011010111110111101110101001101000111010111010111100101011101110001110101111111110110111110010100011101011110111110000101101011000111010101010011011010111101101011110 eea68ebaf2bb8ebfedf28ebdf0b58eaa6d7deea68ebaf2bb8ebfedf28ebdf0b58eaa6d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)