To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 專??攪衰汁?????專??攪衰汁?????^ 10011011100100110011111100111111100111011001100010010000100010101000111101100000001111110011111100111111001111110011111110011011100100110011111100111111100111011001100010010000100010101000111101100000001111110011111100111111001111110011111101011110 9b933f3f9d98908a8f603f3f3f3f3f9b933f3f9d98908a8f603f3f3f3f3f5e
EUC-JP 專??攪衰汁?????專??攪衰汁?????^ 11010101111100110011111100111111110110011111100010111111111010101011110111000001001111110011111100111111001111110011111111010101111100110011111100111111110110011111100010111111111010101011110111000001001111110011111100111111001111110011111101011110 d5f33f3fd9f8bfeabdc13f3f3f3f3fd5f33f3fd9f8bfeabdc13f3f3f3f3f5e
UTF-8 專쭸렫攪衰汁흗렩쾡렲춈專쭸렫攪衰汁흗렩쾡렲쵱^ 11100101101100001000100011101100101011011011100011101011101000001010101111100110100101001010101011101000101000011011000011100110101100011000000111101101100111011001011111101011101000001010100111101100101111101010000111101011101000001011001011101100101101101000100011100101101100001000100011101100101011011011100011101011101000001010101111100110100101001010101011101000101000011011000011100110101100011000000111101101100111011001011111101011101000001010100111101100101111101010000111101011101000001011001011101100101101011011000101011110 e5b088ecadb8eba0abe694aae8a1b0e6b181ed9d97eba0a9ecbea1eba0b2ecb688e5b088ecadb8eba0abe694aae8a1b0e6b181ed9d97eba0a9ecbea1eba0b2ecb5b15e
UHC 專쭸렫攪衰汁흗렩쾡렲춈專쭸렫攪衰汁흗렩쾡렲쵱^ 111011101111011011000010111001101000111010111001110011101110011011100001111100011111000111110000110010001110100110001110101101111100010011101001100011101011111111000011110111101110111011110110110000101110011010001110101110011100111011100110111000011111000111110001111100001100100011101001100011101011011111000100111010011000111010111111110000111101110001011110 eef6c2e68eb9cee6e1f1f1f0c8e98eb7c4e98ebfc3deeef6c2e68eb9cee6e1f1f1f0c8e98eb7c4e98ebfc3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)