To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 額?????凹?????鍮?????轅??^ 1000101001111010001111110011111100111111001111110011111110001001100110100011111100111111001111110011111100111111111010000100101000111111001111110011111100111111001111111110011101110110001111110011111101011110 8a7a3f3f3f3f3f899a3f3f3f3f3fe84a3f3f3f3f3fe7763f3f5e
EUC-JP 額?????凹?????鍮?????轅??^ 1011001111011011001111110011111100111111001111110011111110110001111110100011111100111111001111110011111100111111111011111010101100111111001111110011111100111111001111111110110111010111001111110011111101011110 b3db3f3f3f3f3fb1fa3f3f3f3f3fefab3f3f3f3f3fedd73f3f5e
UTF-8 額ㅻ퉬溜곕젳凹좉엥栒롥젮鍮꾨젾溜㏝뙛轅ⓦ뀕^ 11101001101000011000110111100011100001011011101111101101100010011010110011101111101001111000101111101010101100111001010111101100101000001011001111100101100001111011100111101100101000101000100111101100100101111010010111100110101000001001001011101011101000011010010111101100101000001010111011101001100011011010111011101010101111101010100011101100101000001011111011101111101001111000101111100011100011111001110111101011100110011001101111101000101111011000010111100010100100111010011011101011100000001001010101011110 e9a18de385bbed89acefa78beab395eca0b3e587b9eca289ec97a5e6a092eba1a5eca0aee98daeeabea8eca0beefa78be38f9deb999be8bd85e293a6eb80955e
UHC 額ㅻ퉬溜곕젳凹좉엥栒롥젮鍮꾨젾溜㏝뙛轅ⓦ뀕^ 11100100111111101010010011101011101110011000010011101010111111101011000011101011101000001010011111101000111010101010000011101010101111111010100011100010111000111000111011100101101000001010010011101011101110011000010011101011101000001011000011101010111111101010011111101001100011001010000011101010101111111010100011100011100001011000111001011110 e4fea4ebb984eafeb0eba0a7e8eaa0eabfa8e2e38ee5a0a4ebb984eba0b0eafea7e98ca0eabfa8e3858e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)