To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 偲痔蔀丈厲、蔀承ン偲痔蔀丈厲、蔀承ン^ 100011101100001110001110101001001000111011000001100011111110010011111010100011101010010010001110110000011000111110110011110111011000111011000011100011101010010010001110110000011000111111100100111110101000111010100100100011101100000110001111101100111101110101011110 8ec38ea48ec18fe4fa8ea48ec18fb3dd8ec38ea48ec18fe4fa8ea48ec18fb3dd5e
EUC-JP 偲痔蔀丈厲、蔀承ン偲痔蔀丈厲、蔀承ン^ 101111001100010110111100101001101011110011000011101111101110011010001111101101001101000010001110101001001011110011000011101111101011010110001110110111011011110011000101101111001010011010111100110000111011111011100110100011111011010011010000100011101010010010111100110000111011111010110101100011101101110101011110 bcc5bca6bcc3bee68fb4d08ea4bcc3beb58eddbcc5bca6bcc3bee68fb4d08ea4bcc3beb58edd5e
UTF-8 偲痔蔀丈厲、蔀承ン偲痔蔀丈厲、蔀承ン^ 11100101100000011011001011100111100101111001010011101000100101001000000011100100101110001000100011100101100011101011001011101111101111011010010011101000100101001000000011100110100010011011111111101111101111101001110111100101100000011011001011100111100101111001010011101000100101001000000011100100101110001000100011100101100011101011001011101111101111011010010011101000100101001000000011100110100010011011111111101111101111101001110101011110 e581b2e79794e89480e4b888e58eb2efbda4e89480e689bfefbe9de581b2e79794e89480e4b888e58eb2efbda4e89480e689bfefbe9d5e
UHC ?痔?丈???承??痔?丈???承?^ 00111111111101101100000000111111111011011101101100111111001111110011111111100011101011110011111100111111111101101100000000111111111011011101101100111111001111110011111111100011101011110011111101011110 3ff6c03feddb3f3f3fe3af3f3ff6c03feddb3f3f3fe3af3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)