To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???昻??松?お液?????苡???ら?^ 00111111001111110011111111111010110100000011111100111111100011111011110000111111100000101010100010001001011101000011111100111111001111110011111100111111111001001000111100111111001111110011111110000010111001110011111101011110 3f3f3ffad03f3f8fbc3f82a889743f3f3f3f3fe48f3f3f3f82e73f5e
EUC-JP ??????松?お液?????苡???ら?^ 001111110011111100111111001111110011111100111111101111101011111000111111101001001010101010110001110101010011111100111111001111110011111100111111111001111110111100111111001111110011111110100100111010010011111101011110 3f3f3f3f3f3fbebe3fa4aab1d53f3f3f3f3fe7ef3f3f3fa4e93f5e
UTF-8 凉붾젧昻뽰쒼松덄お液ㅵ폇溜잌떀苡겸늿囹ら턄^ 11101111101001011011100111101011101101101011111011101100101000001010011111100110100110001011101111101011101111011011000011101100100100101011110011100110100111011011111011101011100011011000010011100011100000011000101011100110101101101011001011100011100001011011010111101101100011111000011111101111101001111000101111101100100111101000110011101011100101101000000011101000100010111010000111101010101100101011100011101011100010101011111111101111101001101010100111100011100000101000100111101101100001001000010001011110 efa5b9ebb6beeca0a7e698bbebbdb0ec92bce69dbeeb8d84e3818ae6b6b2e385b5ed8f87efa78bec9e8ceb9680e88ba1eab2b8eb8abfefa6a9e38289ed84845e
UHC 凉붾젧昻뽰쒼松덄お液ㅵ폇溜잌떀苡겸늿囹ら턄^ 11100101101111001001010011101011101000001001111111100100111010011001011011101100101111101011000011100001111001101000100011100111101010101010101011100100111110111010010011100101101111001001010011101010111111101001111111100101100010111001011011101100101111101011000011100010100010001000100011100111101010101010101011101001101101011010000001011110 e5bc94eba09fe4e996ecbeb0e1e688e7aaaae4fba4e5bc94eafe9fe58b96ecbeb0e28888e7aaaae9b5a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)