To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 節??狎??壯??絶??絶???ヮ?鼇??^ 1001000011011111001111110011111111100000101111100011111100111111100110101110000100111111001111111001000011100010001111110011111110010000111000100011111100111111001111111000001110001110001111111110101010000111001111110011111101011110 90df3f3fe0be3f3f9ae13f3f90e23f3f90e23f3f3f838e3fea873f3f5e
EUC-JP 節??狎??壯??絶??絶???ヮ?鼇??^ 1100000011100001001111110011111111100000110000000011111100111111110101001110001100111111001111111100000011100100001111110011111111000000111001000011111100111111001111111010010111101110001111111111001111100111001111110011111101011110 c0e13f3fe0c03f3fd4e33f3fc0e43f3fc0e43f3f3fa5ee3ff3e73f3f5e
UTF-8 節억풃狎숅쎁壯섋찓絶놅슭絶뗥넍樂ヮ굹鼇쇽풌^ 11100111101011111000000011101100100101101011010111101101100100101000001111100111100010111000111011101100100010001000010111101100100011101000000111100101101000111010111111101100100001001000101111101100101100001001001111100111101101011011011011101011100001101000010111101100100010101010110111100111101101011011011011101011100101111010010111101011100001001000110111101111101001101011111111100011100000111010111011101010101101011011100111101001101111001000011111101100100001111011110111101101100100101000110001011110 e7af80ec96b5ed9283e78b8eec8885ec8e81e5a3afec848becb093e7b5b6eb8685ec8aade7b5b6eb97a5eb848defa6bfe383aeeab5b9e9bc87ec87bded928c5e
UHC 節억풃狎숅쎁壯섋찓絶놅슭絶뗥넍樂ヮ굹鼇쇽풌^ 11101111101111011011111011101111101111101000101111100100111001001001100111101001100110111010101111101101111000001001100011101000101010011001010011101111101111101000011011101111101111011011111011101111101111101000101111100101100001101001100111101000111110011010101111101110100000101001100011101000101010001011110011101111101111101001000101011110 efbdbeefbe8be4e499e99babede098e8a994efbe86efbdbeefbe8be58699e8f9abee8298e8a8bcefbe915e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)