To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????n}??????????n{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111110100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 諸?????艇???n}諸?????艇???n{^ 1000111110010100001111110011111100111111001111110011111110010010111110000011111100111111001111110110111001111101100011111001010000111111001111110011111100111111001111111001001011111000001111110011111100111111011011100111101101011110 8f943f3f3f3f3f92f83f3f3f6e7d8f943f3f3f3f3f92f83f3f3f6e7b5e
EUC-JP 諸?????艇???n}諸?????艇???n{^ 1011110111110100001111110011111100111111001111110011111111000100111110100011111100111111001111110110111001111101101111011111010000111111001111110011111100111111001111111100010011111010001111110011111100111111011011100111101101011110 bdf43f3f3f3f3fc4fa3f3f3f6e7dbdf43f3f3f3f3fc4fa3f3f3f6e7b5e
UTF-8 諸쇘렍렰樂렭艇肋렰렕n}諸쇘렍렰樂렭艇肋렰렕n{^ 1110100010101011101110001110110010000111100110001110101110100000100011011110101110100000101100001110111110100110101111111110101110100000101011011110100010001001100001111110111110100101100100111110101110100000101100001110101110100000100101010110111001111101111010001010101110111000111011001000011110011000111010111010000010001101111010111010000010110000111011111010011010111111111010111010000010101101111010001000100110000111111011111010010110010011111010111010000010110000111010111010000010010101011011100111101101011110 e8abb8ec8798eba08deba0b0efa6bfeba0ade88987efa593eba0b0eba0956e7de8abb8ec8798eba08deba0b0efa6bfeba0ade88987efa593eba0b0eba0956e7b5e
UHC 諸쇘렍렰樂렭艇肋렰렕n}諸쇘렍렰樂렭艇肋렰렕n{^ 111100001011001110111100111001111000111010100011100011101011110111101000111110011000111010111010111011111111001111010010111100011000111010111101100011101010101001101110011111011111000010110011101111001110011110001110101000111000111010111101111010001111100110001110101110101110111111110011110100101111000110001110101111011000111010101010011011100111101101011110 f0b3bce78ea38ebde8f98ebaeff3d2f18ebd8eaa6e7df0b3bce78ea38ebde8f98ebaeff3d2f18ebd8eaa6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)