To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???耀??松?お液???ら?譽???ら?^ 0011111100111111001111111001011101110011001111110011111110001111101111000011111110000010101010001000100101110100001111110011111100111111100000101110011100111111111001101010001100111111001111110011111110000010111001110011111101011110 3f3f3f97733f3f8fbc3f82a889743f3f3f82e73fe6a33f3f3f82e73f5e
EUC-JP 轝??耀??松?お液???ら?譽???ら?^ 10001111111000011010101000111111001111111100110111010100001111110011111110111110101111100011111110100100101010101011000111010101001111110011111100111111101001001110100100111111111011001010010100111111001111110011111110100100111010010011111101011110 8fe1aa3f3fcdd43f3fbebe3fa4aab1d53f3f3fa4e93feca53f3f3fa4e93f5e
UTF-8 轝뚮젶耀붻꽋松㎫お液ㅶ쵊溜ら쪛譽쎻닱囹ら턄^ 11101000101111011001110111101011100110101010111011101100101000001011011011101000100000001000000011101011101101101011101111101010101111011000101111100110100111011011111011100011100011101010101111100011100000011000101011100110101101101011001011100011100001011011011011101100101101011000101011101111101001111000101111100011100000101000100111101100101010101001101111101000101011011011110111101100100011101011101111101011100010111011000111101111101001101010100111100011100000101000100111101101100001001000010001011110 e8bd9deb9aaeeca0b6e88080ebb6bbeabd8be69dbee38eabe3818ae6b6b2e385b6ecb58aefa78be38289ecaa9be8adbdec8ebbeb8bb1efa6a9e38289ed84845e
UHC 轝뚮젶耀붻꽋松㎫お液ㅶ쵊溜ら쪛譽쎻닱囹ら턄^ 11100110101011001000110011101011101000001010101011101001101001011001010011101000100001001001101111100001111001101010011111100111101010101010101011100100111110111010010011100110101011001000110011101010111111101010101011101001101001011001010011100111111000101001101111100010100010001010011111100111101010101010101011101001101101011010000001011110 e6ac8ceba0aae9a594e8849be1e6a7e7aaaae4fba4e6ac8ceafeaae9a594e7e29be288a7e7aaaae9b5a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)