To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 辱??裔??鹽??n}辱??裔??鹽??n{^ 1001000001001010001111110011111111100101111000010011111100111111111010100110010000111111001111110110111001111101100100000100101000111111001111111110010111100001001111110011111111101010011001000011111100111111011011100111101101011110 904a3f3fe5e13f3fea643f3f6e7d904a3f3fe5e13f3fea643f3f6e7b5e
EUC-JP 辱??裔??鹽??n}辱??裔??鹽??n{^ 1011111110101011001111110011111111101010111000110011111100111111111100111100010100111111001111110110111001111101101111111010101100111111001111111110101011100011001111110011111111110011110001010011111100111111011011100111101101011110 bfab3f3feae33f3ff3c53f3f6e7dbfab3f3feae33f3ff3c53f3f6e7b5e
UTF-8 辱양즯裔볢뮅鹽양ㅊn}辱양즯裔볢뮅鹽양ㅊn{^ 1110100010111110101100011110110010010110100100011110110010100110101011111110100010100011100101001110101110110011101000101110101110101110100001011110100110111001101111011110110010010110100100011110001110000101100010100110111001111101111010001011111010110001111011001001011010010001111011001010011010101111111010001010001110010100111010111011001110100010111010111010111010000101111010011011100110111101111011001001011010010001111000111000010110001010011011100111101101011110 e8beb1ec9691eca6afe8a394ebb3a2ebae85e9b9bdec9691e3858a6e7de8beb1ec9691eca6afe8a394ebb3a2ebae85e9b9bdec9691e3858a6e7b5e
UHC 辱양즯裔볢뮅鹽양ㅊn}辱양즯裔볢뮅鹽양ㅊn{^ 1110100110110100101111101110011110100011100000011110011111100000100100111110100010010010100101001110011110100100101111101110011110100100101110100110111001111101111010011011010010111110111001111010001110000001111001111110000010010011111010001001001010010100111001111010010010111110111001111010010010111010011011100111101101011110 e9b4bee7a381e7e093e89294e7a4bee7a4ba6e7de9b4bee7a381e7e093e89294e7a4bee7a4ba6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)